Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_1545 |
Symbol | |
ID | 4072936 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | - |
Start bp | 1888252 |
End bp | 1891569 |
Gene Length | 3318 bp |
Protein Length | 1105 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 637983554 |
Product | TonB-dependent receptor |
Protein accession | YP_590621 |
Protein GI | 94968573 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.222059 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCACCT TACGATCAGT TGGGATCGCT GTATTCCTGT TTTTCTTGTC TACGTTTGCG ATGGGACAGA GCTATCGCGG ATCGATACGC GGGGTGGTGA CAGACGCTAG CGGGGCGGTG ATACCCAGTG CATCGGTGAC GGTAAAGAGC TCGGCCACTG GACTGGAGCG TAGTGCAGTC ACCGACGGTG AAGGACTTTA TGTGATCGCT GAGCTGCCTG CCGGCGAATA TCGGCTCTCC GTCCCCGTGA CGGGCTTCCG AACCTTCGCA CGCAATGTGT TGGTTGACGT CGGTCACGAC AGTACCGTGG ATATCACAAT GATGGTCGCC GGTGGAGATA CGGTAGAGGT CAACGAGTCC ACGGCTCCCC TTGTGGAAGA CACTCGCGAT GTTCTTGGCC AGATCGTGGA CAACAAGCTC GTCGTCGAAC TGCCGCTGAA TGGCCGCGAC TTCGGCAAAC TCGTCGCGCT CACACCGGGC GTGACGGTCG AAGGCTCCGG CGTGGCGGGA ACCGAGAAGG GCTTTGGCCA GTTCAACATC AATGGCAACC GCGACCGCTC GAACAACTAC ATGCTTGACG GCACGGACAA CAACGATCCG TTCTTCAACA ACTCCGCGTT GAACCAGGTG GGTATCACTG GCGCGCCGGC TTCCCTGCTA CCGATTGACG CCATCCAGGA ATTCAACCTG CAAACGCAGT ACGGCGCGGA GTATGGACGC AACTCCGGCG GTGCGGTGAA CGTGCTGACG AAGTCTGGTA CCAACGCGTT CCACGGCAGC GTGTTTTATT TCCTGCGCAA CTCGGCACTC GACGCGCGCA ACTACTTCGA TCCCACGACG AATCCTGACG GCAGTCCGAA CCCGAAGGGC GGCTTTAAGA ACAACCAGTA CGGCGCTTCG ATCGGCGGCC CGATTGTGAA GGACAAAACG TTCTTCTTCG CCGCCTACGA AGGCCAGCGC GAGCGCGTGA CATCGAGCTA CACGCTGTTT GTCCCGACGG AGATGCAGAA GGCCAACGCG CGCGCGGCGG CACTGGCAGC GACGACTTCG GATGGCGAGT CAGAAGTGCC GGTGATCAAC GCGATCAATC CGGGGATTGA CGCACTGCTC GGCTACTTCC CCGCCGCAAC GGGCTGCAGT AATGGCGGCA CGCCGGCGGC CACCGGTTGC ATTGGAGGCG CCGGAACCGT GGCGGGCGCA GTGGAAGACC GCAACGACCT CGACAACGGC ATTATTAAGG TAGATCACTA CTTCACGCAG ACGGAGCAGT TCTCGGCACG CTACGCCATC AGCAATAGCG ACCAGGTCTT TCCGCTCGGC GGGCTCGGCA CCTATGGCAA TGGATCGCGA CTGGCGGGAT TCGCACAGAC TTCGCCTACG CGGGTGAATG TCGTCTCCGC AAGTTTGCTT TCAACCTTCA GTCCGACGTT CCTGAACGAA CTGCGCTTCG GCTACTCGCG CTATAACACT TCGTTCAACA CGCTCGACGG CACGGTCGAT CCGAACAGCG CCTTCGGACT GAACATGGGC ACGGGCAAGA CGGGCGTCCC GGAAATTGAC TTCTTCGCGC TGTACGACAA TTTGGGCGCG TCGGCTTACA GCATTCCGCG CGGACGCACG AGCCAGACCT ACCAGGTGCT CGACAACCTT ACGAAGATCC ACGGCGCGCA TACCTTCAAA TTCGGTGGCG AGTTCCGCCG CGCGACGATC GAGAACTTCA ACGATAACCT CGAGCGCGGA TTGCTCGCGC TGGATCCGTA CCAACTCACC AACGGCCCAT GGCCGGGCGA CGACCAAACG GCGATGTTGA CGAATTTCTA CCTTGGCATT TTCGACTGGG GCACCGCGGC CAACACCGGC AACACGCAGC GCAATACCTT TAACAACGGC TTCAGCTTCT TCGCGCAGGA TGATTGGCGC GCGACTAAGA AGCTCACTTT GAATCTCGGC GTTCGCTGGG AATACTTTGG ACCGCTTGGC GAGAGCAATG GGTTGATCTC GAACCTCGGC ACCGATGGTC TGCTGCACAT GACCGACCAG CCATACAACA AAGACTGGAA CAACGTGGCG CCTCGCGTTG GGCTGGCGTG GAACGTGTTC AGTGGCACCG TAGTTCGCAT GGGATATGGC GTGTACTTCG ACTACGTTCC GCAGAACAAC ATGATCGCCA ACTACACCAA TACCGCCGGA CTGGTGACGA ACCCGATCGG GCCGAAGGCG GTCACGTCGA TGGACTATAA CCAGTCGGCG TTCAACGGCA GCGATGCGGG CGCGGCGGTC TTCACGCCCA GTACCGGCGC GCAGAGCATC TTCGCGGTAC CGCAGAACTT TGCTACGCCT TACACGCAGA GCTGGAACGT GAATGTGGAG CAGGAACTCG GCAAAGCTGC CTCCATGCAA ATTGGCTACG TGGGCAGCAA GGGTACGCGG CTGACGCGGC TGTACGACGC GAACCAGGAC TACACCAATT CGAACTACAA CGCGATTGAT GTGCTGGCAA CGATCTCCGA TTCCACCTAC AACGCGCTGC AGGCGACACT GACGGCACGC TCGTGGAAGG GGATTTCGGG ATTCGCAAAT TACACTTGGG CGAAGTCGCT GGATGATGCG TCGGACGGCA TCGACTTCAA CTTCGCGTCG GCGGCGTTCC CACAGAACTC GGATTGCCCT GTGGCGTGCG AGCATGGGCC CTCGACCTTT GATACGCGGC ATCGCTTTAC TGGCTCGATG AATTATGCGG TGCCGCAGTG GAAGGCACTG CCTCCGGTGC TCGGCAAAGG ATGGGAGTTG AATACCATTG CGACTTTCCA GTCCGGGCGA CCGATTCCGA TTCTGACTTC GAACGACACC AGCGGAACCT ACAACTATCA CCAGCGGCCG GATCGTGTGC CCGGCGTGAA CCCGGTACTC GACCACTGGA ATCCGGTGAC CGGCTACCTC AACCCGCTCG CGTTCCAGCA ACCTGCGGAC GGAACTTTCG GCAATTTGCA GCGTAACTCG ATCTACGGTC CGCACTATAC GAATGTGGAT TTCTCCATCA CGAAGAACAT GCCGATCACC GAGAAGGTGA ACGTGCAGTT CCGCGCGGAG TTCTTCAACA TCTTTAACCA CCCGAACTTC GCATTGCCGG GTGGCACTTT GAACCCTGCG TATTTGGCGG ATGGCACGCT TGATCCGTCG GTCGTGGATC CGGCGAGCCA TGCGATCCTG ACGCCTGCGG GACAGGTAAC ACAGACGCCG GATGTGGCGC AAGGTAACCC TGGCTTGGGC GGCGGCGGAC CGCGCGTGAT TCAGTTTGGG CTGCGGTTCT CGTTCTAA
|
Protein sequence | MSTLRSVGIA VFLFFLSTFA MGQSYRGSIR GVVTDASGAV IPSASVTVKS SATGLERSAV TDGEGLYVIA ELPAGEYRLS VPVTGFRTFA RNVLVDVGHD STVDITMMVA GGDTVEVNES TAPLVEDTRD VLGQIVDNKL VVELPLNGRD FGKLVALTPG VTVEGSGVAG TEKGFGQFNI NGNRDRSNNY MLDGTDNNDP FFNNSALNQV GITGAPASLL PIDAIQEFNL QTQYGAEYGR NSGGAVNVLT KSGTNAFHGS VFYFLRNSAL DARNYFDPTT NPDGSPNPKG GFKNNQYGAS IGGPIVKDKT FFFAAYEGQR ERVTSSYTLF VPTEMQKANA RAAALAATTS DGESEVPVIN AINPGIDALL GYFPAATGCS NGGTPAATGC IGGAGTVAGA VEDRNDLDNG IIKVDHYFTQ TEQFSARYAI SNSDQVFPLG GLGTYGNGSR LAGFAQTSPT RVNVVSASLL STFSPTFLNE LRFGYSRYNT SFNTLDGTVD PNSAFGLNMG TGKTGVPEID FFALYDNLGA SAYSIPRGRT SQTYQVLDNL TKIHGAHTFK FGGEFRRATI ENFNDNLERG LLALDPYQLT NGPWPGDDQT AMLTNFYLGI FDWGTAANTG NTQRNTFNNG FSFFAQDDWR ATKKLTLNLG VRWEYFGPLG ESNGLISNLG TDGLLHMTDQ PYNKDWNNVA PRVGLAWNVF SGTVVRMGYG VYFDYVPQNN MIANYTNTAG LVTNPIGPKA VTSMDYNQSA FNGSDAGAAV FTPSTGAQSI FAVPQNFATP YTQSWNVNVE QELGKAASMQ IGYVGSKGTR LTRLYDANQD YTNSNYNAID VLATISDSTY NALQATLTAR SWKGISGFAN YTWAKSLDDA SDGIDFNFAS AAFPQNSDCP VACEHGPSTF DTRHRFTGSM NYAVPQWKAL PPVLGKGWEL NTIATFQSGR PIPILTSNDT SGTYNYHQRP DRVPGVNPVL DHWNPVTGYL NPLAFQQPAD GTFGNLQRNS IYGPHYTNVD FSITKNMPIT EKVNVQFRAE FFNIFNHPNF ALPGGTLNPA YLADGTLDPS VVDPASHAIL TPAGQVTQTP DVAQGNPGLG GGGPRVIQFG LRFSF
|
| |