Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_4555 |
Symbol | |
ID | 4071500 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | + |
Start bp | 5400219 |
End bp | 5401328 |
Gene Length | 1110 bp |
Protein Length | 369 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 637986595 |
Product | TonB-like protein |
Protein accession | YP_593629 |
Protein GI | 94971581 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01352] TonB family C-terminal domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.248985 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.0574394 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCATAC TCCAGATCAC GCCTTCCCGA GAAGAAGAAC AGGACAAGCA CTTCGAGGAT GTTGCACGCG CTAACGAAGG TAAAGCCAAC TTCGTGGAAG AGGCGTTCAC TCCCGTTGTG CTCATGGACT TGCGCGACGA GCTCACTCGC TCGCGCCTCC GGGAAGCCGC CTGGATCTCG ATCATCGCCC ACCTCGTCGC GATCATCTTT CTTAGTCTTA GCCCCAAGTG GATGCCCAAT CTCTGGGGAC ATCCCGTGAA GGTCGTGGAA GACCGGTTGC GCGACAAGGA CACCACGTTC CTCGCGCTTC CTCCCGACGC GCAAAAGCTG GTACAGAAGC CACACACCAA CGTTCTCTCC GATAAAGACC GCGTCGCGAC TTCGCACAAT CCTGATCCGA AAGAGTTGAA GAAACTCCTC GATCAGCGGC AGCCAGGCCC ACCCGCGCAT CCAGCAGCGC AGCCCAGCGT TCCCGCGCCG CCGCAAATGG CCCAACAACA GCAGCAGTCG CCGCAGCAGC AAAACCCTGC TACGCAGCAG GGACAGCAAA CGGCGATGAA CAATCCGCCG CAGTTCGAGA GCCCAAACAT GCAGCCCAAG ATGACGCTGC CCAAGGCGCA GCCCAGCTTC GGCGCCGTCG CGATGTCGGC CGGCTCAGCG ATCCAGCAGG CGGCGCGCGC ATCCTCCGGA TCAGCCGGTA AGCTCGCCGT TGGCGGTGGT ATGGGACTCG GCCGCGGCCC CACCGGCGGG CAAGTTCGCG ACGCGATGGA GATCACGACC GATACCCAGG GCGTGGACTT CGGCCCCTAT CTCGCACGCA TCAAGCAGAC CATCGAAGCC AACTGGTACA CCGCAATGCC GGAATCGGTT TATCCGCCAC TGCGCAAGAG CGGCAAGGTC GCCGTCGAAT TCGTAATTCT CCCCGACGGC AAAGTACAGG GCATGCGCAT CTTCTTCCCG TCAGGCGACG TCGCACTCGA TCGCGCGGCG TGGGGCGGCA TCTCAGCCTC GAATCCATTC CCGCCACTGC CCAAAGAATT CCACGGACCG TACCTCGGCC TCCGCTGCTA CTTCCTCTAC AACCCGACGA CAAAAGACCT CGAGCAATAG
|
Protein sequence | MAILQITPSR EEEQDKHFED VARANEGKAN FVEEAFTPVV LMDLRDELTR SRLREAAWIS IIAHLVAIIF LSLSPKWMPN LWGHPVKVVE DRLRDKDTTF LALPPDAQKL VQKPHTNVLS DKDRVATSHN PDPKELKKLL DQRQPGPPAH PAAQPSVPAP PQMAQQQQQS PQQQNPATQQ GQQTAMNNPP QFESPNMQPK MTLPKAQPSF GAVAMSAGSA IQQAARASSG SAGKLAVGGG MGLGRGPTGG QVRDAMEITT DTQGVDFGPY LARIKQTIEA NWYTAMPESV YPPLRKSGKV AVEFVILPDG KVQGMRIFFP SGDVALDRAA WGGISASNPF PPLPKEFHGP YLGLRCYFLY NPTTKDLEQ
|
| |