Gene Haur_5087 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_5087 
Symbol 
ID5737045 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009973 
Strand
Start bp112002 
End bp113030 
Gene Length1029 bp 
Protein Length342 aa 
Translation table11 
GC content54% 
IMG OID641282252 
ProductNMT1/THI5-like domain-containing protein 
Protein accessionYP_001547843 
Protein GI159901597 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCACGCTG TGTCACGCCC AACACACCGC GTCCGTCGCA TCGGATCGTT CACGACAATC 
CTGCTGATCC TGCTAGCAGC CTGTAGCACC CAGACAGAAC CGACTCCGGT TCCCATGGAT
GCGGTTACCC TCCAACTCAA CTGGGTCAAT GACTTTTCCT CAGCGGGCTT TTTTGCAGCG
GAAAAGAACG GACGCTTTGC CGACCAACGC CTGCAGGTCA CCTTGCGCGA GGGTGGCTTT
GATGCCAATG GCTATATTGA TGGCACCGAA CAAGTCAGTA GCGGTGCGGC TGATTTTGGG
GTGGCCAGCG CCGATAGTAT CCTTCACGCC CGTGCCCAAG GAAAACCAAT TGTTGGGATT
GCGGTGTTGG CGCAAGATAG TCCGCTGGCG ATTCTCTCGC TTCCTGCGAC CAATATTCGC
ACGCCCCGCG ATTTAATTGG CAAACGGGTG TTGGTTTCCG AGGGCGGAGC AACCCAACTC
TATACCACGT TGCTTGCTTC TCAATCCATC GATATTGCCC AAGCTCCACC CATTCCCCGC
ACCGATTCAG GGATTAATCA GCTGATTGCT GGTGAGATTG ATGCCCTCGT CGCATGGAAT
GTCAACGAAG CGATTGAATT AAGTGAACTT GGCTACCCAC CATCGGTTAT GCGGTTCAGC
GATTATGGCA TCAATAGCTA TGAATTGGTC GTGATCACCT CGGAGCGCCT CGTCACCGAG
AATCCCGATC GCGTCACTCG GTTTCTCAAG GCCGTCCTGC AAGGGTGGAA GGATGTCATC
CTTAGTCCCG CCCAAGCGAT TGGCTATGTG AAGGACTATG CCCCGGACGT TGAGCGGGAT
GGACAGTTGC AGCGGTTAAG TGTGTTTGTT GAGTTATTAC AACCAGCACA AACCAAACTC
GGCGATATGC TGCCTGAACG CTGGGCATTT ACCCAGACGA TGTTGCAAAC CCAAGGGGTG
CTCACGACCC CCATTGACCT TAATCGTGCC TACACGACCA CATTCCTTGA ACAGTTGCCA
GATCGCTAA
 
Protein sequence
MHAVSRPTHR VRRIGSFTTI LLILLAACST QTEPTPVPMD AVTLQLNWVN DFSSAGFFAA 
EKNGRFADQR LQVTLREGGF DANGYIDGTE QVSSGAADFG VASADSILHA RAQGKPIVGI
AVLAQDSPLA ILSLPATNIR TPRDLIGKRV LVSEGGATQL YTTLLASQSI DIAQAPPIPR
TDSGINQLIA GEIDALVAWN VNEAIELSEL GYPPSVMRFS DYGINSYELV VITSERLVTE
NPDRVTRFLK AVLQGWKDVI LSPAQAIGYV KDYAPDVERD GQLQRLSVFV ELLQPAQTKL
GDMLPERWAF TQTMLQTQGV LTTPIDLNRA YTTTFLEQLP DR