Gene Haur_2353 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_2353 
Symbol 
ID5734234 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp3002234 
End bp3003451 
Gene Length1218 bp 
Protein Length405 aa 
Translation table11 
GC content51% 
IMG OID641279494 
ProductS-adenosylmethionine synthetase 
Protein accessionYP_001545121 
Protein GI159898874 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0192] S-adenosylmethionine synthetase 
TIGRFAM ID[TIGR01034] S-adenosylmethionine synthetase 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATGCAA CTACGTTTAT GCGTGCTCCA CAGTTCTTCT TCACCTCAGA ATCGGTGACT 
GAAGGACACC CCGATAAAAT TTGTGACCAA GTTTCTGATG CGATCTTAGA TGAATTGTTG
GCCCAAGACC CCATGTCGCG GGTAGCTTGC GAAACCGCTA CTACCACCGG CTTGATCGTC
GTTCTCGGTG AAGTTACCAC CAAGGGCTAT GTTGAAGTTC AAGATATTGT GCGCAAAGTT
GTCAGCGATA TTGGCTACAC CCGTGGTAAG TTCGGCTTCG ATGCCGAAAC CTGTGGGATT
ATCGTCGCCT TGCACGGCCA ATCGCCCGAC ATCGCTCAAG GCGTTGATGT GGCCTTGGAA
GTACGCGACG ATCAAGCCAG TCGCGAAGCT GAAATTGAAA AAGTTGGCGC TGGCGATCAA
GGTATGGTGT TCGGGTTCGC TTGTAACGAA ACCGATGAAT TTATGCCTTT GACGATTTCG
TTGGCTCACC AATTGACCCG CCGCTTGGCC AAAGTGCGCA AAACTGGCGA AGTTGGCTAC
TTACGCCCCG ACGGCAAGAG CCAAGTAACC GTCGAATATA GCCATGGCAA GCCTGTGCGC
GTCGATACCG TGTTGATCTC AACCCAGCAC GATCCCAACG TCGATAACGA GCGGATTCAC
AAGGATGTAA TCGAATTAGT GATTAAGCCG GTAATTCCTG AAGGCATGCT TGATGAAAAC
ACCAAGATTT TCATCAACCC AACTGGCCGC TTCGTCACTG GTGGCCCAAT GGGCGATTCA
GGTTTGACTG GCCGCAAGAT CATCGTTGAC ACCTACGGCG GGGTTGCTCG CCACGGTGGC
GGCGCTTTCT CAGGCAAAGA TTCAACCAAG GTTGATCGCT CAGCTGCCTA TGCTGCTCGC
TACGTTGCCA AAAACATCGT GGCTGCTGGC TTGGCCGAAC GCTTTGAATT GCAAGTCAGC
TATGCAATCG GCGTTTCAAA GCCGCTTTCA ATCTCGTTTG AAACCTTTGG TACCGCCAAA
GTCAGCGACG AAAAATTGTT AGAGTTGATC AACAAACACT TTGATTTGCG TCCAGGTGCG
ATCATCCGTG ATCTTGATCT GCGCCGCCCA ATCTATCGCC AAACCGCTGC CTACGGTCAC
TTTGGCCGTG CTGACATCGA TTTGCCATGG GAACGCACCG ACAAAGCTGA ATTGCTTCGC
GCTGAATCAG GCCTGTAA
 
Protein sequence
MDATTFMRAP QFFFTSESVT EGHPDKICDQ VSDAILDELL AQDPMSRVAC ETATTTGLIV 
VLGEVTTKGY VEVQDIVRKV VSDIGYTRGK FGFDAETCGI IVALHGQSPD IAQGVDVALE
VRDDQASREA EIEKVGAGDQ GMVFGFACNE TDEFMPLTIS LAHQLTRRLA KVRKTGEVGY
LRPDGKSQVT VEYSHGKPVR VDTVLISTQH DPNVDNERIH KDVIELVIKP VIPEGMLDEN
TKIFINPTGR FVTGGPMGDS GLTGRKIIVD TYGGVARHGG GAFSGKDSTK VDRSAAYAAR
YVAKNIVAAG LAERFELQVS YAIGVSKPLS ISFETFGTAK VSDEKLLELI NKHFDLRPGA
IIRDLDLRRP IYRQTAAYGH FGRADIDLPW ERTDKAELLR AESGL