Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hhal_2216 |
Symbol | |
ID | 4709534 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhodospira halophila SL1 |
Kingdom | Bacteria |
Replicon accession | NC_008789 |
Strand | + |
Start bp | 2430212 |
End bp | 2431162 |
Gene Length | 951 bp |
Protein Length | 316 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 639856691 |
Product | TonB domain-containing protein |
Protein accession | YP_001003782 |
Protein GI | 121998995 |
COG category | [S] Function unknown |
COG ID | [COG5373] Predicted membrane protein |
TIGRFAM ID | [TIGR01352] TonB family C-terminal domain [TIGR02794] TolA protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 32 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGCTGGC TGAGAGAGCG AGCCCCGAAG CTGCCGAGCC GGCTGGCGAG CAGTCCGTCG CGTTTTGTGA TCTACTCCAC CCTGGGCCAC GTGGCCGTGT TCACCTTGGT GGGCGCCAAT TTCGCGACCT GCACCCGAAC ACCGGAGCCC CCCGAGTTCG AAGGCCCCAT CATCGAGGCC ACGACGGTCG ACAGTGCCGC GGTGGAGGCC GAGATGGAAC GGCGCCAGGA GCCAGAGCCC GAACCGGAGC CAGAGCCCGA GCCGGAACCC GAGCCGGAAC CGGAGCCGGA GCCGGAACCC GAGCCGGAGC CCGAGCCCGA GCCAGAGCCA GAGCCCGAAC CGGATCCGGA GCCGGAACCC GAGCCCGAAC CGGAGCCGGA ACCCGAGCCG GAACCCGAGC CCGAACCGGA GCCGGAGCCA GAGCCAGAGC CAGAGCCGGA ACCGGAGCCG GAGCCAGAGC CCGAGCCCGA GCCCGAGCCC GAGCCCGAGC CCGAGCCCGA GCCGGACCCT GCCGAACAAC GGGCCGAGGA GCAAGACCGA CTGCGCGAGG AGATTCGCCA GCGCATGGAG GAAGAGCGCG AAGCGCAACG CCAGCAAGAA CTGGAGGCGG AGCGCCAACG CCAGGAGGCG GCCCAGCAGG AGGCTGAGGA ACAGCGCCGC CTGCAGGGGC AGCAACAGCG TTACACCGCG GCGATCGCCG AGGCCGTCGA GCGCAACTGG CGACGCCCCA CCGGCACGCC CGAAGGCCTT GAGGCCGTGG TTCGGGTCTC GTTCTCGAGC AGCGGGGACG TGCGGCGGGT CGAGGTCGTC AGCGGCAGCG GCGATTCGGG CTTTGATCGC TCCGTCGAGC GTGCCGTGCA GGCCGCCTCG CCGGTCCCCT TCCCGGACGA GGCCGCACTG CAGGAGCGCA TGCAGACCCT AACCTTCCGC TTCGCACCGG ACCGGAGTTG A
|
Protein sequence | MRWLRERAPK LPSRLASSPS RFVIYSTLGH VAVFTLVGAN FATCTRTPEP PEFEGPIIEA TTVDSAAVEA EMERRQEPEP EPEPEPEPEP EPEPEPEPEP EPEPEPEPEP EPEPDPEPEP EPEPEPEPEP EPEPEPEPEP EPEPEPEPEP EPEPEPEPEP EPEPEPEPDP AEQRAEEQDR LREEIRQRME EEREAQRQQE LEAERQRQEA AQQEAEEQRR LQGQQQRYTA AIAEAVERNW RRPTGTPEGL EAVVRVSFSS SGDVRRVEVV SGSGDSGFDR SVERAVQAAS PVPFPDEAAL QERMQTLTFR FAPDRS
|
| |