Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_2932 |
Symbol | |
ID | 5734804 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 3707374 |
End bp | 3708408 |
Gene Length | 1035 bp |
Protein Length | 344 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 641280076 |
Product | NMT1/THI5-like domain-containing protein |
Protein accession | YP_001545698 |
Protein GI | 159899451 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.01782 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTTAAGC TTCTGTCTCG CCGAGGAGTT AGGATGCGCC GGTTAAGCTA TCTCAGTGTG CTCATGCTGG TGTTACTGGC TGCTTGTAGT ACGAGCCAAG CAACCCCCAC CCCAGCCCCT AAGGATTCGG TCAAGCTTCA GTTGAACTGG GTTTTTGATT ATTCGTCGTC GGGCTTTTTT GCTGCTGAAA AGAATGGTCG TTTTGGCGAG CAGAATTTGA ATGTCGAGTT GATTGCAGGC GGTTTTGATG CTAACGGCTA TATTGATGGT ACTGAAAAAG TCAGTAGTGG GGCCGCTGAT TTTGGGGTAG CCAGCGCCGA TAGTGTGATT CAAGCTCGTG CTAATGGCAA ACCTGTGGTT GGGATTGCCG TGCTAACCCA AAATAGCCCA TTAGCGATTC TTTCCTTGCC TGGCGCTAAT ATTCGCACGC CCCAAGATTT GGTTGGCAAG AAAGTGCTGG TCTCGGAAGG CGGGGCAACC CAGCTTTACA ATACCTTGCT GACCGCCCAA GGCATCGATT TGGAGAGCGC CAAGCCCTTA CCACGCTTCG ATTCAGGTAT CGATCAGTTA ATTGATGGCG AAATTGATGC GTTGGTGGCT TGGAATATCA ACGAAGCAAT TGAATTAAGC GAGCGGGGCT ATCCACCCTC AATTATGTTG ATGAGCGATT ACGGCATCAA TAGCTATGAG TTGGTGATTA TTACTACCGA AAAAATGGCA ACTGAAAATC CCGATTTGGT CACCCGTTTC CTCAAAGCTA CCTTCAAAGG TTGGAATGAC GTAATTGCTA ATCCAAGCCA AGCGGTTGAT TATGTTGTGA CCTACGATGT TAAGCTCAAT CGCGATGCCC AACTACGGCG CTTAACTGAA ATGCTGAAGT TGATCAAGCC TGCGAACACC AAAATTGGCG ATATGCGACC CGATCTTTGG TCGTTTACCC ACCAAATGTT GCAAACCCAG GGCGCACTCA AAGAGCCAAT TGAGTTGGGT CGGGCCTATT CAACCTTGTT CTTAGATGTT ATTCCTGACC GCTAG
|
Protein sequence | MVKLLSRRGV RMRRLSYLSV LMLVLLAACS TSQATPTPAP KDSVKLQLNW VFDYSSSGFF AAEKNGRFGE QNLNVELIAG GFDANGYIDG TEKVSSGAAD FGVASADSVI QARANGKPVV GIAVLTQNSP LAILSLPGAN IRTPQDLVGK KVLVSEGGAT QLYNTLLTAQ GIDLESAKPL PRFDSGIDQL IDGEIDALVA WNINEAIELS ERGYPPSIML MSDYGINSYE LVIITTEKMA TENPDLVTRF LKATFKGWND VIANPSQAVD YVVTYDVKLN RDAQLRRLTE MLKLIKPANT KIGDMRPDLW SFTHQMLQTQ GALKEPIELG RAYSTLFLDV IPDR
|
| |