Gene Acid345_2101 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_2101 
Symbol 
ID4069700 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp2512630 
End bp2514093 
Gene Length1464 bp 
Protein Length487 aa 
Translation table11 
GC content56% 
IMG OID637984116 
ProductTonB-like protein 
Protein accessionYP_591176 
Protein GI94969128 
COG category[R] General function prediction only 
COG ID[COG0666] FOG: Ankyrin repeat 
TIGRFAM ID[TIGR01352] TonB family C-terminal domain 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGATAGAAA TCATAGTTGT TCTTGCTTGG GCATCCGTCG GCTTCGGTCA GATCGCTCAT 
CAGCGAGTGC TGCTTTCGGC AGAGGACGTG CATTACCCGG CCCTCGCACG ACAGGCGCGC
ATTCAAGGCG ATGTTGAAAT TGCATTCCAC ATCGGAGACG ACGGTGTTCC AATCGGCACG
GAGCCGATCT CAGGTCATCC GATGTTGACC CAAGCCGCGC TCGACAATGT CGCCACCTGG
AGATTCAAGC CAGTTTCTGC GCACGATGCC CACACCTACA AAACCACGTT TCGCTTCGAG
CTTGACGCGC TGAATGGGCC CTATGACTAC GGCCCGAAGA CAATCAGACA CGGATTGGCG
TTAGTGGAGG TACAGGTTGT CTCAACTTGG ATTGGGGGCG GGCCTCCGCG CGCTCTGAAT
TGCCCGAAAG AGAAGGTCCG AGCGCCAGCC TCGACGACCG CCGATTTTCT TGAGCTTGAC
GAGAGCGACT ACCAAATCCG TCTTGGTGCG GATGGCTCGG TGATCTGGAG CAACACCGAT
GGGGAGCACG CATTCACCGT CGATTCCCGT CGTGCTCGAA AGTTGGTGGC GTCGTTCAAC
ACCGAGCAAA TCTGGAATTT GTGTGGGTTG TATGAAGGCT TTGGCAAAAC CGTGCGCCTG
ACTCTTCATA CAGGTGTTGC AACGAAAAAA GTCGAGGACG CTGGGTATAT ATCCCCTCAT
CCATTACCGG AGTTGGAAGA CGAAGTGAAC TTACTTGCGC GCACGCATGA ATGGCGCCAC
GGCGATCCTT CCGCTGAGCC AATCTGGAAC ATCCGCGACG AACGCGTAAA GCCGGACTTG
CCTCCTCTGG CCGCGGCGGC GCGCGACTGT AATTCCGATC GCGTCAAACA GTTGATCGCC
GCGGGAGAGA ATCTCTCTGC AGCGGACGCG AGCGGATGGA CGCCGTTAAT GTACGCAACG
CAGTGCGGAA ACGATACCGA AAAGCGTTTG CTGAAGGCCG GCGCTGATCC GAACCAATCT
TCCTATCGCG GGGATACCGC GCTCATGGTC CGTGCACTCC AGGGCTATCT TGATGAGGAT
CTTTTGCGCG CAGGGGCTAA CATCAACTTG CAAGACAACG ATGGACGCAC AATCCTGATG
TTTCTCGCCT CGCAGGGATC AGGAGAAATC GAGTACGCAA TCGAAGCTGG GGCCAATTCC
ACAATGAAGG ATACCCGCGG ATTGACGGCG TTCGACTACC TGAAGATTGC GAATTGTCGC
GCGAACCCGA TGGTGCAGGG ACTTGAGGCG ATCCCCGCAC CGGCTCCTCC CACACAAGAG
GTGGATGAAA AGGATTCGTC GGAAGAGACT TCGTGCAATC CGTTTCCAGA CGAGTGGATG
ACCAAGGCAC TGCAGCAGTT GAGCGCGCCT TACGACCCGC TACGCAGACC GCACATCCCA
AAGATTCTTC GCATGAGCGA GTAA
 
Protein sequence
MIEIIVVLAW ASVGFGQIAH QRVLLSAEDV HYPALARQAR IQGDVEIAFH IGDDGVPIGT 
EPISGHPMLT QAALDNVATW RFKPVSAHDA HTYKTTFRFE LDALNGPYDY GPKTIRHGLA
LVEVQVVSTW IGGGPPRALN CPKEKVRAPA STTADFLELD ESDYQIRLGA DGSVIWSNTD
GEHAFTVDSR RARKLVASFN TEQIWNLCGL YEGFGKTVRL TLHTGVATKK VEDAGYISPH
PLPELEDEVN LLARTHEWRH GDPSAEPIWN IRDERVKPDL PPLAAAARDC NSDRVKQLIA
AGENLSAADA SGWTPLMYAT QCGNDTEKRL LKAGADPNQS SYRGDTALMV RALQGYLDED
LLRAGANINL QDNDGRTILM FLASQGSGEI EYAIEAGANS TMKDTRGLTA FDYLKIANCR
ANPMVQGLEA IPAPAPPTQE VDEKDSSEET SCNPFPDEWM TKALQQLSAP YDPLRRPHIP
KILRMSE