Gene Haur_4304 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4304 
Symbol 
ID5736163 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp5495457 
End bp5496521 
Gene Length1065 bp 
Protein Length354 aa 
Translation table11 
GC content53% 
IMG OID641281464 
ProductTHUMP domain-containing protein 
Protein accessionYP_001547064 
Protein GI159900817 
COG category[R] General function prediction only 
COG ID[COG2933] Predicted SAM-dependent methyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCTTGATA CCTATCGTGT TTTAATTAGT GCTGCGCCCG AGATGCTTGA GGCAGCGATA 
GATGAAATTC GTTATGGCAA ATATGGGGTT GCCGTTCAAC GGCTCGCGCC AGAAGTGGCC
TTGGTTGAGT TAGAGCAACC ACTCGCTGAA TGGCACGCCT TCTTGCTCAA ACGGCCAGCC
ATTTTTGTGC GCCATATCGC GCCAGTTCAT CGTGAAGTCG TGCTCGAAGG TACGCCTGAT
GATTTAGAAC GCCTAACCCA AGTGGCCTTG AATCTTGCGC CACATTTGCC AGCCCGTAAA
CCATTTGCGG TGCAAACTCG CTTGCTTGAT GAACATGTCA AATTAACCCG CTTTGAAATT
AACCAAGCCT TGGCCGAAGC AGTTTCGCAA GTCAGCGGGC TACCCTTGGA TGTACGCAAT
CCGCAGGGCG TGCTTTCGGT TGCTCTGAGC ATGGATCGGG CTTATTTGGG TGTTTCGACC
GTGGAATTTA ATCGCTCGGC TTGGGCGGGT GGCGCACACC GCTTTGCCCG CGAACAAGGC
CAATTAAGCC GCGCTGAATT TAAATTGCTC GAAGCCCAAG CCTTATTCAA CATCGAATGG
CCCGACCATG GCCAAGCCTT GGATTTAGGG GCTGCGCCTG GTGGTTGGAC GCGCATTTTG
CGGCGAGCAG GCCTGCCAGT AGTCGCGATC GACCCAGCCG ATTTGCACCC CAGCTTGCGC
TACGATCAAG GCGTGCGCCA TTTGCAAATC TTGGCTCAGC GCTTTTTGCC TACCCAAGAG
CGTTTTACCG TGATCGTTAA TGATATGCGG ATCGATGCGC GTGATTCGGC GTGGCTGATG
GTTGATTCAG CCAATGCCTT GACCAACGAT GGTTTAGCGG TGGTGACACT CAAATTGCCG
CATGAACATA GTGCCCAAAT CGCCTATCAC TCGCTGAGTA TTTTGCGCAC GGCCTATTGG
GTGATTGGGG CACGTCAACT GTTCCATAAT CGCTCGGAAG TGACTGCTGT GCTGCGTAAA
AAACATGAAA AACGCCCCAA ACCTGAGCAA TTGCCAAAAG ATTAA
 
Protein sequence
MLDTYRVLIS AAPEMLEAAI DEIRYGKYGV AVQRLAPEVA LVELEQPLAE WHAFLLKRPA 
IFVRHIAPVH REVVLEGTPD DLERLTQVAL NLAPHLPARK PFAVQTRLLD EHVKLTRFEI
NQALAEAVSQ VSGLPLDVRN PQGVLSVALS MDRAYLGVST VEFNRSAWAG GAHRFAREQG
QLSRAEFKLL EAQALFNIEW PDHGQALDLG AAPGGWTRIL RRAGLPVVAI DPADLHPSLR
YDQGVRHLQI LAQRFLPTQE RFTVIVNDMR IDARDSAWLM VDSANALTND GLAVVTLKLP
HEHSAQIAYH SLSILRTAYW VIGARQLFHN RSEVTAVLRK KHEKRPKPEQ LPKD