Gene Acid345_3133 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_3133 
Symbol 
ID4070248 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp3725814 
End bp3726842 
Gene Length1029 bp 
Protein Length342 aa 
Translation table11 
GC content66% 
IMG OID637985153 
ProductTonB-like protein 
Protein accessionYP_592208 
Protein GI94970160 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0810] Periplasmic protein TonB, links inner and outer membranes 
TIGRFAM ID[TIGR01352] TonB family C-terminal domain 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.162217 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTAGCG CCGCCCTGTA CGACCAACTC GACCAAGCCG TGGAGCGCAT TCTCACCGGC 
GACCAACTCG CCGTCGAAGA ATTCGACCCG CTCGTTCGCG AACTGCTTCC CATCGCCGAC
GATTTGCACG TCGCGCCTCG CCCCGACTTC CGCGCGTCGT TGCGAGCCGA ACTTGAACGC
CCGCGCCGTA GCGAGGTGAT CCCTATCGCG CCCGCCGTCC TGCCTTTCCT GTTCGCCGAG
CACGCTTCGA TGAGGCGCAG CCCGTTCGGC GCCTCGGCCG CACTGCACGC CGCGGCGTTT
CTGCTCATCG CAACTTCGAG CCTCTGGATG GCGCAACATC CCGTCGCGAA AAAACAAACC
ACCGCGCTGC TCACCGATGT CGGTACCTTC ACGCTGCCGC CGTCGAAGAC CATCGCTGGC
GGAGGTGGTG GAGGCGGAGA TCGCGACAAG TTCGACGCCT CTCGCGGCGA CGCACCGCGC
TTCGCCCGCG AGCAGATCAC GCCACCTGCC ATCGTCGTGC GCAATGAAGC CCCGAAACTT
GCGGTTGATC CGACCGTAGT CGGTCCGCCG GACGTAAAGC TCTCGAATCT CGGCGTAACC
GGCGACCCGC TTTCGAAGAT GCTGAGCGCC TCGAACGGCA CCGGCGCGGG CGGCGGGATT
GGCAGCGGCT ACGGTGGCGG CGTCGGTTCT GGCTATGGTC CTGGCGTCGG GCCGGGCTGG
GGCGGGGGGT ACGGGGGAGG GGTGTATCGG GTAGGCAGCG GCGTCAGCGC ACCGCGGGCG
ATCTACGCAC CCGACCCGCA ATACTCCGAA GAAGCCCGCA AAGCCAAGAT GCAAGGCGTA
GTGGTCCTCG CACTCGTAGT CGGCGCCGAT GGCCGCACCC ACGACGTCAA AATCGCCCGC
ACCCTGGGCA TGGGCCTCGA CGAGAAAGCC ATCGAAGCAG TAAAGACCTG GAAGTTCGAG
CCCGCCCTCA AAGACGGCCA CCCCGTCTCT GTCCTGGTCA GCGTCGAAGT CAACTTCCAC
CTCTACTAA
 
Protein sequence
MSSAALYDQL DQAVERILTG DQLAVEEFDP LVRELLPIAD DLHVAPRPDF RASLRAELER 
PRRSEVIPIA PAVLPFLFAE HASMRRSPFG ASAALHAAAF LLIATSSLWM AQHPVAKKQT
TALLTDVGTF TLPPSKTIAG GGGGGGDRDK FDASRGDAPR FAREQITPPA IVVRNEAPKL
AVDPTVVGPP DVKLSNLGVT GDPLSKMLSA SNGTGAGGGI GSGYGGGVGS GYGPGVGPGW
GGGYGGGVYR VGSGVSAPRA IYAPDPQYSE EARKAKMQGV VVLALVVGAD GRTHDVKIAR
TLGMGLDEKA IEAVKTWKFE PALKDGHPVS VLVSVEVNFH LY