Gene Acid345_1545 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_1545 
Symbol 
ID4072936 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp1888252 
End bp1891569 
Gene Length3318 bp 
Protein Length1105 aa 
Translation table11 
GC content59% 
IMG OID637983554 
ProductTonB-dependent receptor 
Protein accessionYP_590621 
Protein GI94968573 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.222059 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCACCT TACGATCAGT TGGGATCGCT GTATTCCTGT TTTTCTTGTC TACGTTTGCG 
ATGGGACAGA GCTATCGCGG ATCGATACGC GGGGTGGTGA CAGACGCTAG CGGGGCGGTG
ATACCCAGTG CATCGGTGAC GGTAAAGAGC TCGGCCACTG GACTGGAGCG TAGTGCAGTC
ACCGACGGTG AAGGACTTTA TGTGATCGCT GAGCTGCCTG CCGGCGAATA TCGGCTCTCC
GTCCCCGTGA CGGGCTTCCG AACCTTCGCA CGCAATGTGT TGGTTGACGT CGGTCACGAC
AGTACCGTGG ATATCACAAT GATGGTCGCC GGTGGAGATA CGGTAGAGGT CAACGAGTCC
ACGGCTCCCC TTGTGGAAGA CACTCGCGAT GTTCTTGGCC AGATCGTGGA CAACAAGCTC
GTCGTCGAAC TGCCGCTGAA TGGCCGCGAC TTCGGCAAAC TCGTCGCGCT CACACCGGGC
GTGACGGTCG AAGGCTCCGG CGTGGCGGGA ACCGAGAAGG GCTTTGGCCA GTTCAACATC
AATGGCAACC GCGACCGCTC GAACAACTAC ATGCTTGACG GCACGGACAA CAACGATCCG
TTCTTCAACA ACTCCGCGTT GAACCAGGTG GGTATCACTG GCGCGCCGGC TTCCCTGCTA
CCGATTGACG CCATCCAGGA ATTCAACCTG CAAACGCAGT ACGGCGCGGA GTATGGACGC
AACTCCGGCG GTGCGGTGAA CGTGCTGACG AAGTCTGGTA CCAACGCGTT CCACGGCAGC
GTGTTTTATT TCCTGCGCAA CTCGGCACTC GACGCGCGCA ACTACTTCGA TCCCACGACG
AATCCTGACG GCAGTCCGAA CCCGAAGGGC GGCTTTAAGA ACAACCAGTA CGGCGCTTCG
ATCGGCGGCC CGATTGTGAA GGACAAAACG TTCTTCTTCG CCGCCTACGA AGGCCAGCGC
GAGCGCGTGA CATCGAGCTA CACGCTGTTT GTCCCGACGG AGATGCAGAA GGCCAACGCG
CGCGCGGCGG CACTGGCAGC GACGACTTCG GATGGCGAGT CAGAAGTGCC GGTGATCAAC
GCGATCAATC CGGGGATTGA CGCACTGCTC GGCTACTTCC CCGCCGCAAC GGGCTGCAGT
AATGGCGGCA CGCCGGCGGC CACCGGTTGC ATTGGAGGCG CCGGAACCGT GGCGGGCGCA
GTGGAAGACC GCAACGACCT CGACAACGGC ATTATTAAGG TAGATCACTA CTTCACGCAG
ACGGAGCAGT TCTCGGCACG CTACGCCATC AGCAATAGCG ACCAGGTCTT TCCGCTCGGC
GGGCTCGGCA CCTATGGCAA TGGATCGCGA CTGGCGGGAT TCGCACAGAC TTCGCCTACG
CGGGTGAATG TCGTCTCCGC AAGTTTGCTT TCAACCTTCA GTCCGACGTT CCTGAACGAA
CTGCGCTTCG GCTACTCGCG CTATAACACT TCGTTCAACA CGCTCGACGG CACGGTCGAT
CCGAACAGCG CCTTCGGACT GAACATGGGC ACGGGCAAGA CGGGCGTCCC GGAAATTGAC
TTCTTCGCGC TGTACGACAA TTTGGGCGCG TCGGCTTACA GCATTCCGCG CGGACGCACG
AGCCAGACCT ACCAGGTGCT CGACAACCTT ACGAAGATCC ACGGCGCGCA TACCTTCAAA
TTCGGTGGCG AGTTCCGCCG CGCGACGATC GAGAACTTCA ACGATAACCT CGAGCGCGGA
TTGCTCGCGC TGGATCCGTA CCAACTCACC AACGGCCCAT GGCCGGGCGA CGACCAAACG
GCGATGTTGA CGAATTTCTA CCTTGGCATT TTCGACTGGG GCACCGCGGC CAACACCGGC
AACACGCAGC GCAATACCTT TAACAACGGC TTCAGCTTCT TCGCGCAGGA TGATTGGCGC
GCGACTAAGA AGCTCACTTT GAATCTCGGC GTTCGCTGGG AATACTTTGG ACCGCTTGGC
GAGAGCAATG GGTTGATCTC GAACCTCGGC ACCGATGGTC TGCTGCACAT GACCGACCAG
CCATACAACA AAGACTGGAA CAACGTGGCG CCTCGCGTTG GGCTGGCGTG GAACGTGTTC
AGTGGCACCG TAGTTCGCAT GGGATATGGC GTGTACTTCG ACTACGTTCC GCAGAACAAC
ATGATCGCCA ACTACACCAA TACCGCCGGA CTGGTGACGA ACCCGATCGG GCCGAAGGCG
GTCACGTCGA TGGACTATAA CCAGTCGGCG TTCAACGGCA GCGATGCGGG CGCGGCGGTC
TTCACGCCCA GTACCGGCGC GCAGAGCATC TTCGCGGTAC CGCAGAACTT TGCTACGCCT
TACACGCAGA GCTGGAACGT GAATGTGGAG CAGGAACTCG GCAAAGCTGC CTCCATGCAA
ATTGGCTACG TGGGCAGCAA GGGTACGCGG CTGACGCGGC TGTACGACGC GAACCAGGAC
TACACCAATT CGAACTACAA CGCGATTGAT GTGCTGGCAA CGATCTCCGA TTCCACCTAC
AACGCGCTGC AGGCGACACT GACGGCACGC TCGTGGAAGG GGATTTCGGG ATTCGCAAAT
TACACTTGGG CGAAGTCGCT GGATGATGCG TCGGACGGCA TCGACTTCAA CTTCGCGTCG
GCGGCGTTCC CACAGAACTC GGATTGCCCT GTGGCGTGCG AGCATGGGCC CTCGACCTTT
GATACGCGGC ATCGCTTTAC TGGCTCGATG AATTATGCGG TGCCGCAGTG GAAGGCACTG
CCTCCGGTGC TCGGCAAAGG ATGGGAGTTG AATACCATTG CGACTTTCCA GTCCGGGCGA
CCGATTCCGA TTCTGACTTC GAACGACACC AGCGGAACCT ACAACTATCA CCAGCGGCCG
GATCGTGTGC CCGGCGTGAA CCCGGTACTC GACCACTGGA ATCCGGTGAC CGGCTACCTC
AACCCGCTCG CGTTCCAGCA ACCTGCGGAC GGAACTTTCG GCAATTTGCA GCGTAACTCG
ATCTACGGTC CGCACTATAC GAATGTGGAT TTCTCCATCA CGAAGAACAT GCCGATCACC
GAGAAGGTGA ACGTGCAGTT CCGCGCGGAG TTCTTCAACA TCTTTAACCA CCCGAACTTC
GCATTGCCGG GTGGCACTTT GAACCCTGCG TATTTGGCGG ATGGCACGCT TGATCCGTCG
GTCGTGGATC CGGCGAGCCA TGCGATCCTG ACGCCTGCGG GACAGGTAAC ACAGACGCCG
GATGTGGCGC AAGGTAACCC TGGCTTGGGC GGCGGCGGAC CGCGCGTGAT TCAGTTTGGG
CTGCGGTTCT CGTTCTAA
 
Protein sequence
MSTLRSVGIA VFLFFLSTFA MGQSYRGSIR GVVTDASGAV IPSASVTVKS SATGLERSAV 
TDGEGLYVIA ELPAGEYRLS VPVTGFRTFA RNVLVDVGHD STVDITMMVA GGDTVEVNES
TAPLVEDTRD VLGQIVDNKL VVELPLNGRD FGKLVALTPG VTVEGSGVAG TEKGFGQFNI
NGNRDRSNNY MLDGTDNNDP FFNNSALNQV GITGAPASLL PIDAIQEFNL QTQYGAEYGR
NSGGAVNVLT KSGTNAFHGS VFYFLRNSAL DARNYFDPTT NPDGSPNPKG GFKNNQYGAS
IGGPIVKDKT FFFAAYEGQR ERVTSSYTLF VPTEMQKANA RAAALAATTS DGESEVPVIN
AINPGIDALL GYFPAATGCS NGGTPAATGC IGGAGTVAGA VEDRNDLDNG IIKVDHYFTQ
TEQFSARYAI SNSDQVFPLG GLGTYGNGSR LAGFAQTSPT RVNVVSASLL STFSPTFLNE
LRFGYSRYNT SFNTLDGTVD PNSAFGLNMG TGKTGVPEID FFALYDNLGA SAYSIPRGRT
SQTYQVLDNL TKIHGAHTFK FGGEFRRATI ENFNDNLERG LLALDPYQLT NGPWPGDDQT
AMLTNFYLGI FDWGTAANTG NTQRNTFNNG FSFFAQDDWR ATKKLTLNLG VRWEYFGPLG
ESNGLISNLG TDGLLHMTDQ PYNKDWNNVA PRVGLAWNVF SGTVVRMGYG VYFDYVPQNN
MIANYTNTAG LVTNPIGPKA VTSMDYNQSA FNGSDAGAAV FTPSTGAQSI FAVPQNFATP
YTQSWNVNVE QELGKAASMQ IGYVGSKGTR LTRLYDANQD YTNSNYNAID VLATISDSTY
NALQATLTAR SWKGISGFAN YTWAKSLDDA SDGIDFNFAS AAFPQNSDCP VACEHGPSTF
DTRHRFTGSM NYAVPQWKAL PPVLGKGWEL NTIATFQSGR PIPILTSNDT SGTYNYHQRP
DRVPGVNPVL DHWNPVTGYL NPLAFQQPAD GTFGNLQRNS IYGPHYTNVD FSITKNMPIT
EKVNVQFRAE FFNIFNHPNF ALPGGTLNPA YLADGTLDPS VVDPASHAIL TPAGQVTQTP
DVAQGNPGLG GGGPRVIQFG LRFSF