Gene Acid345_2387 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_2387 
Symbol 
ID4071385 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp2820853 
End bp2822007 
Gene Length1155 bp 
Protein Length384 aa 
Translation table11 
GC content59% 
IMG OID637984403 
Productglycoside hydrolase family protein 
Protein accessionYP_591462 
Protein GI94969414 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2273] Beta-glucanase/Beta-glucan synthetase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.286598 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.735718 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTAATC AACTAAAACG CTCCTCGCTA GTTTTCCTGT TGAGCGGAAA GCGCAGCGCT 
GCGGCTGCTT TTTCGCTTTT GTGCGTCATC ATCACCGCAG GCTGCGGCGG AGGCAGCAGT
GGCGGTGGCA CAACCCCGCC CCCACCGGCG CAGGTGGCTA CGCCGACGGT CACCATGACA
GCCGCTCAGA ACGGCGGCCA GGTGGTCACG ATGGCCGATA CGACATCGGG TGCGAGCATC
TTTTTCACGA CCGACGGATC GACGCCGACG TCCACCTCAC AGCAATACCT GACGCCGGTA
CTGATTGCTT CCAACGTTAC GATCAAGGTC GTTGCGACTG TGAGTGGCAT GACGGACAGC
GATGTGACCT CGCAAAGCGT CTCCGGGATT GCGAGCGGTG CATTGGTTTG GAGCGACGAA
TTTTCAAACT CGAGCGGCTC GAATGTGCAG CCCGATGCGT CAGTATGGAC CTACGACACC
GGGGCCGGCG GCTGGGGTAA TGCTGAACTC GAGACCTATT GTGCGTATGG CTCGACAACG
TCGCCGTGCG ATACCGCTCT GCCAAATACG TACGTTGGTA CCGATGGTTA TCTGCACGTT
GTAGCGCGCA ACGCGAGCGG CGGTGCGTAC ACTTCGGCGC GGCTAAAGAC CCAGGGCCTG
TTCAGTACGT CTTACGGGCG CGTTGAGGCG CTGATCAAAA TGCCGGAAGG GCAGGGGCTA
TGGCCGGCGT TCTGGATGTT GGGGAACAAC ATCACGACCG TGAACTGGCC AGCGTGCGGC
GAGCACGACA TCATGGAACA CATCAATGCG CCTTTGCCGG ATTGGTTTGC GGGCTCGATC
CATGGCGCCG GCAACCTGAA CGGCTCGGTA ACCTGGCCTA CAGCGGCCCA AACTTATACG
GCGTCAGATT GGCATATTTA CGGCATGATC TGGTCCAAGG GGAAGGTGCA GTACTACATC
GATGATCCGT CGAATGTGTA TGCATCGTTC GATTCGACGA CAATCACTAC CGGCACTTGG
CCGTTTGATT CGGATGGTGG AGCGTTCATC ATTCTCAACA TGGCTGTCGG TGGAAGCTGG
CCGGGTGCAC CAGATGCGAC GACCCCGTTT CCGTCGGAGA TGCTCGTGGA TTACGTCCGC
GTGTACGCGA ACTGA
 
Protein sequence
MTNQLKRSSL VFLLSGKRSA AAAFSLLCVI ITAGCGGGSS GGGTTPPPPA QVATPTVTMT 
AAQNGGQVVT MADTTSGASI FFTTDGSTPT STSQQYLTPV LIASNVTIKV VATVSGMTDS
DVTSQSVSGI ASGALVWSDE FSNSSGSNVQ PDASVWTYDT GAGGWGNAEL ETYCAYGSTT
SPCDTALPNT YVGTDGYLHV VARNASGGAY TSARLKTQGL FSTSYGRVEA LIKMPEGQGL
WPAFWMLGNN ITTVNWPACG EHDIMEHINA PLPDWFAGSI HGAGNLNGSV TWPTAAQTYT
ASDWHIYGMI WSKGKVQYYI DDPSNVYASF DSTTITTGTW PFDSDGGAFI ILNMAVGGSW
PGAPDATTPF PSEMLVDYVR VYAN