Gene Acid345_0810 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_0810 
Symbol 
ID4068689 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp1004249 
End bp1006093 
Gene Length1845 bp 
Protein Length614 aa 
Translation table11 
GC content62% 
IMG OID637982817 
ProductTonB-like protein 
Protein accessionYP_589889 
Protein GI94967841 
COG category 
COG ID 
TIGRFAM ID[TIGR01352] TonB family C-terminal domain 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.904031 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCATCCG GCACGCAGTT GAATTTAAGT CGCACGCCAG AGCCGAGTTC TACGGAACCC 
ACGCTCGCGC TTCTGCCTGC CATCGAATCC GGGCGCACCG TCTTCTTCCG CAACCTCGGC
GACTTCCTCT GTTTCCGCGA TTGGCCTCGC CCTTTCGATC GCAGTCGGTA CGAAGGCTGG
CTCACCAACT ACGTCCTGAC CGCGCCCGCA TGGACGCGCA TCCGCGAATC GTATTGGGGA
CACGTCCTCG TCGTCGTCGC CACCTTCGCG ATCTGCACCT CTGCCTGGTT CCTTGAACCG
CCCGTCAGGC CAATCAGCCC GTTTGAGCAC GCGCACATCC AGTACTACCC GACTTCCGAT
TACCTGCCCG CAATCAACAG CAAGGCGCCG CGTCCGCACA CGTCGAAGAA AGCCGATCCG
GTTTACGCGA AGCAGGAAAT CATCTCGGTA CGACCGAACG CCGACAATCA CACCCAGACC
GTCATCACAC CACCAAACGT GAAGCTCCAG CACGATGTCC CGCTTCCCAA CATGGCGGTT
TGGACGAGCG ACCCGGTCGC GCCTACTGCG GCTTCGCCTG CCGTGCAACC GAAGATGACC
TTCCCACTCG CGGCCCAGGT CGTTGAGCCA ACTCCGGACA TCAGCAAACT CCGCGACAAG
CGCACAATCA ACCTGCAAAG CTCAGCGGTC GAGCCGGTCG CGAACGATCG TACGCTGCGT
CCCAACGGTC AACTCAACGT CTCGGCACTG CAACCTTCAG TCGTCGCGCC TGCGCCGTCG
CTCCCTGTTC CCGCGCAACG CGCCAGCGGA CTCACCGTCG GCGAAGTCGT GCCACCCGCG
CCCACTGTTT CCGCAAAGAA GAGTGGTGTG ACCGGAATCG GCAGCCTGCA ACCGCAAGCT
ATACCCCCGG CACCAAATGC TCCGGGCATC GCCCAGCACG GGGTGCCGAA CGGAACCCCG
CAGCCCCAAG TGGTTCCACC CCAACCCAGT GTCGGCGGCA TCGCCGGATC GAACGGCAAA
CCAACTGGCC AGATCATCGC CCTCAGCGTT CACCCCACCG ACGTCCACGG TCCCATCAGC
GTTCCCGCCG GTAATCGCAA CGGTGAATTC TCCGCTGGCC CCAGCGGCCG TCCCGGTGCC
ACAGGACAGC CCGAATCTTC TGGAAACGAG AGCGGCCCTG GAGCAGGTAA GAACGGGAAC
TCTAGTGGCG CTGGAAACGG CTCCGGGAAA GAGAACGGCC CTGCGGGCAT TTACGTCAGC
GCGCCGCCCG AAGGCGCGAA TCCCGCGCCC GTCGTAGGCA AGACGCCGTC CCCCGCTCCC
GCAACAACCG AGATGGCCAA GCTGCAGTTC CCCAAGATGC AGCACGCCAC TGTCGCCGAT
CTCGCGAAAG CGACCAAGCC CATGCCCGCG ACAACCGCGC CCGAAGCGCG CAACCCGCTC
GCTGACAAGG TTTTCGCCGG CAAGCGCTAC TACGCCCTTA CGCTCAACAT GCCGAACCTA
AACTCTTCGA CCGGAAGCTG GGTGGTGCGC TTCGCCGAAC TCAACGATCG TCGCGATGGT
ATCCCCGTGT TAGCGCCTGT CGCGACCAGC AAACTCGATC CGGTTTACCC GCAGGCACTT
GTCCACTACC ACATCGAAGG CACGGTAACG CTCTACGCTG TCATCCGCCA GGACGGCACT
GTCGCTGACA TCAAAGTTTT ACGCAGTCTC GACAAGGATC TCGACTACAG CGCGATGCGC
GCCCTCGCGG GCTGGAGATT TGTTCCTGGA ATGAAGAATG GGACGGCGGT GGATTTAGAA
GCGATCGTGG ATATCCCATT CCACTTGAAG CCGATCAACC CGTAG
 
Protein sequence
MASGTQLNLS RTPEPSSTEP TLALLPAIES GRTVFFRNLG DFLCFRDWPR PFDRSRYEGW 
LTNYVLTAPA WTRIRESYWG HVLVVVATFA ICTSAWFLEP PVRPISPFEH AHIQYYPTSD
YLPAINSKAP RPHTSKKADP VYAKQEIISV RPNADNHTQT VITPPNVKLQ HDVPLPNMAV
WTSDPVAPTA ASPAVQPKMT FPLAAQVVEP TPDISKLRDK RTINLQSSAV EPVANDRTLR
PNGQLNVSAL QPSVVAPAPS LPVPAQRASG LTVGEVVPPA PTVSAKKSGV TGIGSLQPQA
IPPAPNAPGI AQHGVPNGTP QPQVVPPQPS VGGIAGSNGK PTGQIIALSV HPTDVHGPIS
VPAGNRNGEF SAGPSGRPGA TGQPESSGNE SGPGAGKNGN SSGAGNGSGK ENGPAGIYVS
APPEGANPAP VVGKTPSPAP ATTEMAKLQF PKMQHATVAD LAKATKPMPA TTAPEARNPL
ADKVFAGKRY YALTLNMPNL NSSTGSWVVR FAELNDRRDG IPVLAPVATS KLDPVYPQAL
VHYHIEGTVT LYAVIRQDGT VADIKVLRSL DKDLDYSAMR ALAGWRFVPG MKNGTAVDLE
AIVDIPFHLK PINP