Gene Acid345_0663 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_0663 
Symbol 
ID4069755 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp817417 
End bp820947 
Gene Length3531 bp 
Protein Length1176 aa 
Translation table11 
GC content57% 
IMG OID637982669 
ProductTonB-dependent receptor 
Protein accessionYP_589742 
Protein GI94967694 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00550591 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.789281 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGGTCG GTCTAAAAAG CTTTGGACGG TTCAGTCTGG CGGTATTTGT TTTCCTGGCA 
ACAATGAGTT TGTTCGCGCA GAAGGACGCC GGCACAATTG CCGGTGTGGT GCGCGATCCA
TCGGGCGCCG TTGTCGCCGG TGCGCAAGTT TTGGTACGCG ACATCGATCG GGGGGGCGAA
AGCAAGCTCA CCACCAATGC AAATGGCGAA TACGTCGCCA GCCCGCTGCG AATTGGTCAC
TACACGGTCG AGGTAAACCA TCCGGGTTTC CGCGGCGTGA AGGCGGGGCC GATAGAGCTG
CAAGTCCAGC AGCGCGCCGT CCTCGACCTG CAACTTCAAG TCGGTGATGT CACCGAGAAA
GTCGAAGTCG TCGCCGCGGC TCCACGTCTT GAGACTGAAA CCTCAGAACT CGGGCAAGTT
GTGAGTCAGC GTCAAGTCTC GCAATTGCCT CTTAACGGAC GCAACTTCGC TCAGCTTGCA
CAATTGAGCG CTGGCGTCGC GCCCAGCGAA CCCGGCTCCC GCGACGAGGG CGGCTACGGA
TTCAGTTCCA ACGGCGCGCG TTCACTGCAG AACAACTTCC TGCTCGATGG CGTGGACAAT
AACTCGAACC TCCCCGACCT GTTGAATGAA ACCAACTTCG TCATTCAGCC ACCGATTGAC
GCGTTGCAAG AATTCAAGGT CCAGACCAGC GCTTATAGTG CGGAATTCGG TCGCGGCAAC
GGCGCGATCA TCAATGCCGT CATCAAGTCA GGTACCAACC AGTTCCATGG TGGGGCATGG
GAGTTCTTCC GTAATGAAGC GCTCGATGCG CGCTACTACT ACGACACGGA TCGTCAACCG
TATAAGCAAA ACCAGTATGG CGTCATGCTC GGCGGTCCGA TCATCAAGGA CCGGACCTTC
TTCTTCGCCG ATTTCGAAGG CCTCCGCCTC AACCAGGCCC AGCCGCAGAC TGCCCTCGTT
CCAACTCAGG ACATGCGCAA CGGCGACTTC TCGTCGTTTC TCGATACCTC TACCCAGGTC
TACGGAACCA ACGCCGCTGG GAATCTCGCG CCCATTCTCG ATTGCAACGG CATGCCGACC
TACTCGGGCG AAATCTTCAA TACGCGCCTG ACGCAAGCCT ATGCCGGCAA TCCCACCGGC
ATCTGCGGTA TGCCCTTCGG CTATTCAAAC GGATTGCCGG TGAACATCAT GCCTGGGGGA
GTGATCGATC CACTGGCGCA GCGCCTCTCG GCGCTCTACC CGCTGCCGAA CAGCAATAAC
AATGCCGGCT TTAATTACAT CGCCGACCCA GTACAAACTA CGCACCGTGC CAATTTTGAT
GTTCGCATTG ACCATAAGTT CTCCGAGAGG AACAACATCT TTGGCCGCTT CAGCTATGAA
GACCAGCCGA GCTTCTTCCC GCCGACCTTC AGCACTGGCG GCGACGGCGG CGGTTTCTTC
AGCGGCATCG AAGACAATGC TTATAAGAGC GTCGCGATCA GTGACATTCA TACCTTCTCG
CCGACGTTTA TCAACGAATT TCGCCTCGGT TACAACCGCA TCAATTCGCA CCGCTTCCAG
CAGAACTACA ACGTGGACGT CTCCGGTGCG ATTGGTTTCC CAGGCGTTCC GTTCACGCCT
ATCAACGGCG GTCTTCCGCA ACTGACCTTC AGTGATGTCT CGACACTGGG CAGCCCAACG
TTTCTACCTT CCGTCGAGTT GCAGAACACC TACGTGCTTG ATGAAAACGT TACATGGGTG
AAAGGCCGTC ACACCTGGAA GTTCGGAACG GAAATCCGCA AGGAAGAGTT CACCATCAAT
CAGCCCGCGG AATCGCGCGG CACGCTGAAT TTCGGCAATG ACTTCACCAG CAACCCCGGT
GCACAGGACG CGCAGGACCT CAGCGGCAAC GCGCTCGGCA GCGGTTCGGG CTACGCATCG
TTCCTGCTCG GCGCCACCGA TGGCGGCGGA ATCAACAACA TCCACAACGT GGATTACCAC
CGACCCATCT ACTCGTTCTT CGCGCAAGAT GATTGGAAAG TGAACGGACG ACTGACCGTG
AACCTTGGTC TGCGTTACGA ATTGTTCACG ACCGTAAAAT CGCGCCACAA TGAGCAGGGC
ACCTTCGATC TTGCCACCGC GACACTGATT CTTCCCAAGG GCCAGACGGC GCAGCTTACG
CCGTATCTCT CCACAATCAT TCCGGTTTCG GCCACGGGAA GCGAAGGTCT GATCAAGCCG
GACCTCAATA ACTTCGCACC GCGCATCGGC TTTGCGTTCC TCGTTGATCA GAACCTTGTA
CTGCGCGCGG GATACGGCAT TTTCTACGGT GGCCAGGAGA ACGGCCCGTA TTCCAATCCG
AGCCCCGGCT TCAACCCGCC GTATTTCGTG ACGCAATCTT TCAACACGCC GTGCGGATTG
GCTTCATTGA ATCCGAATTT AGCGGGGAGC GGTCAATACT GCGGCATAGA TGGCTTGGAG
TTCCTGCAGA ACGGGTTCCC GGCATCCGCA CTGACGGACC CGAACACGCC GATCCTGTAT
TCGGTGGATC CGGCACTGCG CACGCCATAC ATGCAGAATT GGCACATCGG GTTCCAGCAG
CAGATAGGAG CCAACACAGT ATTTGAGCTT GGTTACGCGG GTTCACGTGG ACTGAAGCTT
TTCACGTTCC TCAACGGCAA TCAAGCGACG CCGACAGCAG ATCCAAATAG TCCTTTTGCC
GATCGCCGTC CCATCCCGCA GATTGACTCC TCCGTCCAAT GGTTCCGTTC CGGTGGACAG
TCTAACTACA ACTCGCTACA AGCAAGTCTC GAACGCCACT TCGCAAACGG TTTCACCTAT
CACATCAATT ACACGTGGGG CCACTCGCTG GATACGGCGT CGAACGCGAA CCTCGGCGCA
CAGAATGGCG GCGACTTCCG CGACATGCGC TTCCCCAACG CGGAGTATGG TAACTCCGAT
TTCGATGTCC GCCACCACGC TGTATTCAGC GCACTGTATG ACCTACCCTT CGGCATCGGA
CGCAAGTATG CGACTGATAT TTCGAAGCCG CTCGACTACG TGATTGGCGG CTGGCAGGTT
GGCGGGATCG CCAGCTTCTC CACCGGCAAC TGGTACACAG TGACCAGCAA CGCCGGCGTC
TCCAATGCCG ATGGCGGCGG AAACGTCGGT TCTTCCGACC GTCCTGACCA GATCGGGAAT
CCCAACGCTG CGCCTTGCCA GCCGGGCACG TGGTTCAACA CCTGCGCGTT TACCGTCGCG
ACCCCGGGAA CGTTCGGCGA CGTGGGCCGC AATACGATTC AAGGTCCGGG GTACGAGATT
GTTGATTTCT CGCTCTATAA GGATTTCGCA GTGACCGAAC GCAGCCACTT CGAATTCCGC
GCCGAGATGT TCAATTCTCT CAATCACTAC AATCCGTTGT TCGCGAAGTC CGGTCCGCAG
AACGGGAACA ACGCAACCGT CTACGACCCA TCAAATCCCG GCCTGTTTGG CGTGATCACG
GCAGCTCGCT CGCCGCGGCA AATCCAGTTG GCGTTGAAGT TCTTGTTCTA G
 
Protein sequence
MKVGLKSFGR FSLAVFVFLA TMSLFAQKDA GTIAGVVRDP SGAVVAGAQV LVRDIDRGGE 
SKLTTNANGE YVASPLRIGH YTVEVNHPGF RGVKAGPIEL QVQQRAVLDL QLQVGDVTEK
VEVVAAAPRL ETETSELGQV VSQRQVSQLP LNGRNFAQLA QLSAGVAPSE PGSRDEGGYG
FSSNGARSLQ NNFLLDGVDN NSNLPDLLNE TNFVIQPPID ALQEFKVQTS AYSAEFGRGN
GAIINAVIKS GTNQFHGGAW EFFRNEALDA RYYYDTDRQP YKQNQYGVML GGPIIKDRTF
FFADFEGLRL NQAQPQTALV PTQDMRNGDF SSFLDTSTQV YGTNAAGNLA PILDCNGMPT
YSGEIFNTRL TQAYAGNPTG ICGMPFGYSN GLPVNIMPGG VIDPLAQRLS ALYPLPNSNN
NAGFNYIADP VQTTHRANFD VRIDHKFSER NNIFGRFSYE DQPSFFPPTF STGGDGGGFF
SGIEDNAYKS VAISDIHTFS PTFINEFRLG YNRINSHRFQ QNYNVDVSGA IGFPGVPFTP
INGGLPQLTF SDVSTLGSPT FLPSVELQNT YVLDENVTWV KGRHTWKFGT EIRKEEFTIN
QPAESRGTLN FGNDFTSNPG AQDAQDLSGN ALGSGSGYAS FLLGATDGGG INNIHNVDYH
RPIYSFFAQD DWKVNGRLTV NLGLRYELFT TVKSRHNEQG TFDLATATLI LPKGQTAQLT
PYLSTIIPVS ATGSEGLIKP DLNNFAPRIG FAFLVDQNLV LRAGYGIFYG GQENGPYSNP
SPGFNPPYFV TQSFNTPCGL ASLNPNLAGS GQYCGIDGLE FLQNGFPASA LTDPNTPILY
SVDPALRTPY MQNWHIGFQQ QIGANTVFEL GYAGSRGLKL FTFLNGNQAT PTADPNSPFA
DRRPIPQIDS SVQWFRSGGQ SNYNSLQASL ERHFANGFTY HINYTWGHSL DTASNANLGA
QNGGDFRDMR FPNAEYGNSD FDVRHHAVFS ALYDLPFGIG RKYATDISKP LDYVIGGWQV
GGIASFSTGN WYTVTSNAGV SNADGGGNVG SSDRPDQIGN PNAAPCQPGT WFNTCAFTVA
TPGTFGDVGR NTIQGPGYEI VDFSLYKDFA VTERSHFEFR AEMFNSLNHY NPLFAKSGPQ
NGNNATVYDP SNPGLFGVIT AARSPRQIQL ALKFLF