Gene Haur_5201 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_5201 
Symbol 
ID5737159 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009973 
Strand
Start bp290002 
End bp291039 
Gene Length1038 bp 
Protein Length345 aa 
Translation table11 
GC content56% 
IMG OID641282365 
ProductNMT1/THI5-like domain-containing protein 
Protein accessionYP_001547956 
Protein GI159901710 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGTATTC CACGTCTCGC GCTCCGCGCC ATCGGTGTCC GCTGCTTTCG GTCGATGCGA 
CTGCTCATGC TCGTGTTCCT TACGGCCTGT AGTACCGCCC AACCGAGTCC GACCCCTGCC
CCGATGGATA CGGTGACCAT CCAACTCAAT TGGGTTAATG ATTATTCCTC TGCTGGCTTT
TTCGCGGCGG AAAAGAACGG ACGCTTCGCC GACCAACGCG TCCAAGTGAC CTTGCGTGAG
GGCGGCTTTG ATGCCAATGG CTATATTGAT GGAACGGAAC AAGTCAGCAG TGGCGCTGCC
GATTTTGGGG TGGCCAGTGC CGATAGTATC ATTCAGGCGC GGGCACAGGG GAAGCCCATT
GTGGGCATTG CGGTGTTGGC GCAGGATAGT CCCCTCGCCA TTCTCTCATT GCCGCAGACC
GCCATCCGTG ACCCCCACGA TTTGGTCGGA AAAAAGGTGT TGGTGGCCGA AGGCGGCGCA
ACCCAACTGT ATACGACGTT GTTGGCATCC CAACAGATTG CGCTCACCCA AGCACCGCCC
ATCCCACGCA CCGATTCCGG GATTGACCAG TTAATTGCGG GAAAGATTGA TGCCTTGGTG
GCGTGGAATG TCAACGAAGC GATTGAATTA AGTGAACTCG GCTACCCACC ATCGGTCATG
TTGTTCAGTG ATTATGGGAT CAATAGCTAT GAGTTGGTGC TGATCACGAC CGAACGCATG
GTCACCGAGA ACCCCGATCT GGTCACGCGG GTGCTGAAGG CGACCCTACA GGGATGGAAG
GATGTGATCC TCAGTCCGGC CCAAGCAATT GGCTATGTCA AAGACTATGC GCCCACGGTG
GATCGGGACG GACAAATGCG CCGCTTGAGT GCGTTCGTCG AGTTATTACA ACCCACCAAT
ACCAAACTCG GCGATATGCT GCCGGATCGC TGGGCGTTTA CCCATCAAAT GTTACAAACC
CAAGGGGCGC TGACCCAGCC AATCGAACTT GGACGGGCCT ATTCCACGAT GTTTCTTGAT
GTGCTTCCCG ATCGCTAA
 
Protein sequence
MGIPRLALRA IGVRCFRSMR LLMLVFLTAC STAQPSPTPA PMDTVTIQLN WVNDYSSAGF 
FAAEKNGRFA DQRVQVTLRE GGFDANGYID GTEQVSSGAA DFGVASADSI IQARAQGKPI
VGIAVLAQDS PLAILSLPQT AIRDPHDLVG KKVLVAEGGA TQLYTTLLAS QQIALTQAPP
IPRTDSGIDQ LIAGKIDALV AWNVNEAIEL SELGYPPSVM LFSDYGINSY ELVLITTERM
VTENPDLVTR VLKATLQGWK DVILSPAQAI GYVKDYAPTV DRDGQMRRLS AFVELLQPTN
TKLGDMLPDR WAFTHQMLQT QGALTQPIEL GRAYSTMFLD VLPDR