Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_5201 |
Symbol | |
ID | 5737159 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009973 |
Strand | + |
Start bp | 290002 |
End bp | 291039 |
Gene Length | 1038 bp |
Protein Length | 345 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 641282365 |
Product | NMT1/THI5-like domain-containing protein |
Protein accession | YP_001547956 |
Protein GI | 159901710 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGTATTC CACGTCTCGC GCTCCGCGCC ATCGGTGTCC GCTGCTTTCG GTCGATGCGA CTGCTCATGC TCGTGTTCCT TACGGCCTGT AGTACCGCCC AACCGAGTCC GACCCCTGCC CCGATGGATA CGGTGACCAT CCAACTCAAT TGGGTTAATG ATTATTCCTC TGCTGGCTTT TTCGCGGCGG AAAAGAACGG ACGCTTCGCC GACCAACGCG TCCAAGTGAC CTTGCGTGAG GGCGGCTTTG ATGCCAATGG CTATATTGAT GGAACGGAAC AAGTCAGCAG TGGCGCTGCC GATTTTGGGG TGGCCAGTGC CGATAGTATC ATTCAGGCGC GGGCACAGGG GAAGCCCATT GTGGGCATTG CGGTGTTGGC GCAGGATAGT CCCCTCGCCA TTCTCTCATT GCCGCAGACC GCCATCCGTG ACCCCCACGA TTTGGTCGGA AAAAAGGTGT TGGTGGCCGA AGGCGGCGCA ACCCAACTGT ATACGACGTT GTTGGCATCC CAACAGATTG CGCTCACCCA AGCACCGCCC ATCCCACGCA CCGATTCCGG GATTGACCAG TTAATTGCGG GAAAGATTGA TGCCTTGGTG GCGTGGAATG TCAACGAAGC GATTGAATTA AGTGAACTCG GCTACCCACC ATCGGTCATG TTGTTCAGTG ATTATGGGAT CAATAGCTAT GAGTTGGTGC TGATCACGAC CGAACGCATG GTCACCGAGA ACCCCGATCT GGTCACGCGG GTGCTGAAGG CGACCCTACA GGGATGGAAG GATGTGATCC TCAGTCCGGC CCAAGCAATT GGCTATGTCA AAGACTATGC GCCCACGGTG GATCGGGACG GACAAATGCG CCGCTTGAGT GCGTTCGTCG AGTTATTACA ACCCACCAAT ACCAAACTCG GCGATATGCT GCCGGATCGC TGGGCGTTTA CCCATCAAAT GTTACAAACC CAAGGGGCGC TGACCCAGCC AATCGAACTT GGACGGGCCT ATTCCACGAT GTTTCTTGAT GTGCTTCCCG ATCGCTAA
|
Protein sequence | MGIPRLALRA IGVRCFRSMR LLMLVFLTAC STAQPSPTPA PMDTVTIQLN WVNDYSSAGF FAAEKNGRFA DQRVQVTLRE GGFDANGYID GTEQVSSGAA DFGVASADSI IQARAQGKPI VGIAVLAQDS PLAILSLPQT AIRDPHDLVG KKVLVAEGGA TQLYTTLLAS QQIALTQAPP IPRTDSGIDQ LIAGKIDALV AWNVNEAIEL SELGYPPSVM LFSDYGINSY ELVLITTERM VTENPDLVTR VLKATLQGWK DVILSPAQAI GYVKDYAPTV DRDGQMRRLS AFVELLQPTN TKLGDMLPDR WAFTHQMLQT QGALTQPIEL GRAYSTMFLD VLPDR
|
| |