Gene Acid345_4189 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_4189 
Symbol 
ID4072148 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp4955811 
End bp4959128 
Gene Length3318 bp 
Protein Length1105 aa 
Translation table11 
GC content57% 
IMG OID637986220 
ProductTonB-dependent receptor 
Protein accessionYP_593263 
Protein GI94971215 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.316829 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.196574 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTATCAAA AAATCGGAAC GCGAAGTGTG GCAATCGTTA GCCTGATGTT GCTCATACTC 
ACCTTCGCCG TTACCGGATT GGGCCAAACT AGCAAGGGCA TCATAGGTGG CACAGTTACC
GATAACAGCG GAGCAGTGGT CGTAGGAGCA CAAATTACAG CAACCAATCT CGACACCGGC
GATTCTCGCA CGGTGCAATC CGGACCGACC GGAGCCTTCC GTATGGAGGC GTTGAATCTC
GGGAAATACA AAGTGACCGT CGGGTACCAG GGCTTCCAAA CGCAGACCGT GACGGGTCTG
GAAGTGGCTG GCTCGGTCAT GACCCCGCTC GATATCAGAC TACAGATTGC GAGCGGCACC
GCGACGGAAA TTACAGTTTC CGCGGATACC AACCAGGTTC AGACGGAAAA CGGCGAACTT
TCCGGCAGCA TCGGTTCAAA AGAACTTTCG GATTTGCCGA TCAGCAGTCT CAACCCAATC
CAACTGGCGT TGACGCAGCC TGGAGTAATC GACAACAACG GCCGTGGCAG CACGAACGGC
CAGGGTTTCT CCGTCGCCGG CGGGCGTCCG CAGAGCAACA ACTTCCTGAT CGATGGTCAA
GACAACAACG ACAACAGCAT CCAAGGTCAG GCGTTCCAGC CGCAGAATCC CAATGCGGTC
CAGGAAGTCG CGATCATGAC GAACTCTTAC TCGGCAGAAT TCGGCCGCGG TGGATCATCG
GTCACCAACG TTATCTTCAA AAGCGGCACG AACCAGTACC ATGGCACGCT GAGCGAACTC
TATTCCGGTT CCGGATTAGA TGCGATCGAT GCGGCCAACG GCCTTGCCGG CATGAAAGAT
GGCGGTGATT GCAACCGGGC CCCGAATTAC GCTCCCTGTA AAGGCCGCTA CGATACGCAC
ACCTTCGGAT TTACCGTGGG TGGCCCGATC GTTAAGGACA AGCTGTTTGC GTTCGGCAGC
GGATTGTGGA ACCGCACTTA CGGCAATGAA GTCTCGAGCA CCTTCACCAT TCCCACGGCG
AACGGAGCGG CGCAGCTATC AAGTTATGGT TCGACCAATG CGAACCTGAT GCTGCAGTAT
CTCGGTAACA TTCGCGGGGG CTCAAACATT CAGAACGTGG CCACGGGCAT TGCGAGCATG
CCGTTCGTCG AAATGGGCGA CGCGACCCGC ATTGTTCCGG AGCAGAGCAC GGACACGCAG
TGGAATGTGA AAGTGGATTA CCTGCCCCAC CAGAGCGACA GCATTACCTT CCACTACCTG
CATGATCGTG GCTATTTCTC GCCCGATTGG TTTGCAAACT CGAGTTCGGT TCTGCCGAAC
TTTGAAACCT ACCAGGGAGG ACCGTCCTGG ATCTCGGGTG GCTCCTGGAC GCACACATTC
AGTTCCAACA AGGTGAATGA GTTCCGTGTC TCTTACGGCC ATCTCGGATT TACTTTCGCT
CCAACAGCGG GAACGACTGC AAATCCTCTT TATCCATTGC CTTATCTCTC TCTGAGCAGC
CCAAGCACGT TCCCGCTTCT CGGCACGGAT TCCGGTTTCC CGCAAGGGCG TAGCCATCGT
ACTTTGCAAC TTCAGGAGGC GTTCTCGATC ACGAAGGGAG CGCACACCGT CAAGATGGGT
GTCGACATCG CCCATATCTC GGTGACCGAC GACATTCCGA TCAACTCTCG CGGATGGATC
ACCTTCTCGG CGGGTGGCGG CTACAGCGCT CTCGGCAATT TCCTCGACAA CTACACAGGG
CGCAGCGGAC AGGCCCTCGA CATCCAGATT GGCAATCCGC GCGTGGAACC GACTCTCCTG
CAGAGCGGAT ACTATGTGCA AGACAATTGG AAGATCAAGT CGAACCTGAC CTTGAACCTC
GGTCTTCGCT ACGAATATCA GACAAATCCC GAGAATTCGC TGACGTACCC AGCCGTGAAG
CCGATCATGG GCGGCGAAGT CGCGTTCCCC ACCGTGGTGA AGGCAGACCA GCAGTACATG
CACTTCGCGC CACGTATCGG GTTTGCCTAC ACGCCTGATT TCTGGCCGAG CCTTTTCGGT
GACGGCAAAA CAGTCATTCG CGGCGGTTAT GGAATTTTCT ACGACGCGCT GTACACCAAC
ATTCTCGACA ATACGGCGTC CTCTTCACCA AACTCGATCG ATGTGCCCTT GTACGGCCGT
AATGGCGGCG CGCGCGGTTA TGCAAGCGCG ACCGACCTCT TCAACTCGCT TGATCCGGTC
GTCAGCCCTT TCAACACTGT CACCAGCGTT TCGCAACGCA TGACCAATCC GCGTACCACG
CAGTGGAACC TGGACGTCCA GCGCGAACTG CCGTGGAACC TGCTCGCGAC CGTCGCATAT
ATCGGCAGTC GCGGTCAGAA GCTGCTGGTA AACGACGACT ACAACCCCTT CGGCGGATAT
GACGCCACGA CGGGCGCTTA CATTCCCCGC TATAACTCGG ATCGCGGAGC CATGGCAATC
CGCACCAACG GCGGCGACTC GTACTATCAC GGCCTCGCCT TCACGGTGGA GCGCAAGTTC
AACAAGGGCC TCATGCTGCG CAGCGCGTAT ACCTTCTCGA AGTCGATCGA CGACAGTTCG
AACATTTTCG TGATCACGGG TGGATCGTCG TACGCGCAAA ACGTGTGGGA CCGCCAAGCC
GATCGTGGCT TGTCTGCTTT CAATGCATTC CAGCGCTGGG CGTTTACTTA CGTTTGGGAC
GTTCCGGGCT TTAAGTCCGA AAACAAGGCC CTCGACGTGC TTGGATACAT TTCACGTCAC
TGGCAGTGGA CCGGGACCAC CAGTCTGCAG TCTGGTTTGC CCGACACGAT CTATGTCGGC
TCACTCGACA GCACTGGCGA GGGTCACGGC TACAGCGGAC GTCCGGACGT GCTTAGCAGC
AGAGCGCCGA TGACGAACGT GGCGATCTCC GGCCAATACT CTTATTGCTG GCCGGGCGAC
GCCCAAACGG CACCCGCGTA CGACTGGGCT ACTTGCGCCC CGATCAGCCA GAGCGACCTG
AATGGATACC ACTGGTTTAT TCCATTCGGT CGTCCAGGAA ACGAAGGACG TAACAGCTAC
ATACTACCGG GACAGATCAA CTTCAACTTC GGCATCAATC GCAACATTCC GATCCCGAAG
CATGAGTCGC AGTTCCTGCA GCTCCGCGTC GAGATGTACA ACCCGTTCAA TCACCCGAAT
GAGTCGGCGA ACCCTGGCGG TTTCTGGACG ACGGACGTGA ACACGATCGC GCCCGACAAT
CCAAGCCACC TCTTCGATAA GTTCTGGGCA CGCCAGGGAG GTCGCAGTAT TCGCCTCTCG
GCCAAGTACC AGTTCTAA
 
Protein sequence
MYQKIGTRSV AIVSLMLLIL TFAVTGLGQT SKGIIGGTVT DNSGAVVVGA QITATNLDTG 
DSRTVQSGPT GAFRMEALNL GKYKVTVGYQ GFQTQTVTGL EVAGSVMTPL DIRLQIASGT
ATEITVSADT NQVQTENGEL SGSIGSKELS DLPISSLNPI QLALTQPGVI DNNGRGSTNG
QGFSVAGGRP QSNNFLIDGQ DNNDNSIQGQ AFQPQNPNAV QEVAIMTNSY SAEFGRGGSS
VTNVIFKSGT NQYHGTLSEL YSGSGLDAID AANGLAGMKD GGDCNRAPNY APCKGRYDTH
TFGFTVGGPI VKDKLFAFGS GLWNRTYGNE VSSTFTIPTA NGAAQLSSYG STNANLMLQY
LGNIRGGSNI QNVATGIASM PFVEMGDATR IVPEQSTDTQ WNVKVDYLPH QSDSITFHYL
HDRGYFSPDW FANSSSVLPN FETYQGGPSW ISGGSWTHTF SSNKVNEFRV SYGHLGFTFA
PTAGTTANPL YPLPYLSLSS PSTFPLLGTD SGFPQGRSHR TLQLQEAFSI TKGAHTVKMG
VDIAHISVTD DIPINSRGWI TFSAGGGYSA LGNFLDNYTG RSGQALDIQI GNPRVEPTLL
QSGYYVQDNW KIKSNLTLNL GLRYEYQTNP ENSLTYPAVK PIMGGEVAFP TVVKADQQYM
HFAPRIGFAY TPDFWPSLFG DGKTVIRGGY GIFYDALYTN ILDNTASSSP NSIDVPLYGR
NGGARGYASA TDLFNSLDPV VSPFNTVTSV SQRMTNPRTT QWNLDVQREL PWNLLATVAY
IGSRGQKLLV NDDYNPFGGY DATTGAYIPR YNSDRGAMAI RTNGGDSYYH GLAFTVERKF
NKGLMLRSAY TFSKSIDDSS NIFVITGGSS YAQNVWDRQA DRGLSAFNAF QRWAFTYVWD
VPGFKSENKA LDVLGYISRH WQWTGTTSLQ SGLPDTIYVG SLDSTGEGHG YSGRPDVLSS
RAPMTNVAIS GQYSYCWPGD AQTAPAYDWA TCAPISQSDL NGYHWFIPFG RPGNEGRNSY
ILPGQINFNF GINRNIPIPK HESQFLQLRV EMYNPFNHPN ESANPGGFWT TDVNTIAPDN
PSHLFDKFWA RQGGRSIRLS AKYQF