Gene Acid345_0038 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_0038 
Symbol 
ID4071743 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp34756 
End bp35844 
Gene Length1089 bp 
Protein Length362 aa 
Translation table11 
GC content56% 
IMG OID637982038 
ProductTonB-like protein 
Protein accessionYP_589117 
Protein GI94967069 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0810] Periplasmic protein TonB, links inner and outer membranes 
TIGRFAM ID[TIGR01352] TonB family C-terminal domain 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAACAC TTCTCTGCAT TTTCTTGTTG TCGTTAGCTG CGATTGCGCA AAATCTGCCG 
GATGCACCGT CAGCGCCGAA ACCCGCACCA TTCGGGAAGC CTGCTCCGAA GCCTGCGGTG
TCCTCGAGTT CCGTCACAGT GAATGAGACC GACGCGCTCT CGCACATTTT GACCCGGCCC
TTGCAGATCT ATCCCGCACT CGCCTCGGTA AATAAAATCG AGGGCGACGT TGTGATTGAA
GCGACGATCG ACACCGACGG CAATGTCGCC TCGACCAAAG TAGTTTCAGG GCATCCAGCA
CTGGCGCCAA CGGCTGTCGC ATTCGTGAAA CAGTGGCTCT TCCGGCCCTT CTATTCTGGC
GAGACGCGCG TGCCGGCGGT CACACAACTG ACGGTCCACT ATTCCCTCTT TGCGTCTGAA
GCGGAACGCG AGTTAGAAAA ACATTTCCAG GAAACATACT GGCCGGCGTG GCGCGCTGGC
GAAGAGGCGC TTGCAAAACA AGACTACGCA ACCGCGAAAC AACAGTTCGA GATCGGTCGC
AGCGAGGCTT CTAAGCTAGG CCAGGCAAAC TGGCAGGAAT TGGCGAATGC GCTGTCGAGA
CTGGGATCTG TTGAGTACCG CCAGAAGAAC TATTCCGCGG CCGAGCCATA TTTGATGCAG
GCGATCCAGA TCCAGCAAAA CCACCGCGAA GCCGACTCTT CGGAGATCGC AGACGCAATC
GGAAATCTCG CGCAGGTATA CCTTGCCGAG AACAACTTCA GCAAGGCGGA GCCGCTGTTC
CTCAAAGCAG TTGAGATCTA CGAAAAGCGA TTACAGGATC CGACTTCGAA GACGCAATAC
ACCAACGACC GCCGTCACCG GGTAATGAAT CTCTTCATGC TTGCGTCGCT GAACCAGGAG
ATGGGTGCGG GCGAAGAGGC CCTGAAGTAC TGCGATCAGG CTGCCGGCGA TGCCGGCCAG
GCAATGGCGA AGGATGAAGC GATCATCGTG CTGCGCACCT GTGAAACCGT GTATCGCAAG
AACCTGAAGT ACTCGCGTGC TCGGGAAGTC GAAGGGCTCG CGCAAGACCT GGAAAAGCAG
GCTCAATAA
 
Protein sequence
MKTLLCIFLL SLAAIAQNLP DAPSAPKPAP FGKPAPKPAV SSSSVTVNET DALSHILTRP 
LQIYPALASV NKIEGDVVIE ATIDTDGNVA STKVVSGHPA LAPTAVAFVK QWLFRPFYSG
ETRVPAVTQL TVHYSLFASE AERELEKHFQ ETYWPAWRAG EEALAKQDYA TAKQQFEIGR
SEASKLGQAN WQELANALSR LGSVEYRQKN YSAAEPYLMQ AIQIQQNHRE ADSSEIADAI
GNLAQVYLAE NNFSKAEPLF LKAVEIYEKR LQDPTSKTQY TNDRRHRVMN LFMLASLNQE
MGAGEEALKY CDQAAGDAGQ AMAKDEAIIV LRTCETVYRK NLKYSRAREV EGLAQDLEKQ
AQ