Gene Haur_2998 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_2998 
Symbol 
ID5734870 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp3786122 
End bp3787315 
Gene Length1194 bp 
Protein Length397 aa 
Translation table11 
GC content53% 
IMG OID641280142 
Producthypothetical protein 
Protein accessionYP_001545764 
Protein GI159899517 
COG category[R] General function prediction only 
COG ID[COG1092] Predicted SAM-dependent methyltransferases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGCCGCCC GTTCAATGCC AACAATCAAC TTGCCAACCC AGCTAAAGGC TCATCTTCAG 
GCCGGCCATC CATGGGTCTA CCGCGACCAT GTGCCGCCGA GTACTCGTTT AGCCAGCGGC
ACATGGGTGC GGCTGCATTG TGGCAATTGG CAAGGGTTTG GCTTGTGGGA TGCTCGCTCG
CCGATCGCTC TACGCATCTT TTCGAGCCGC ATGCAGCCCG ATGCCAACTG GATCAAAACG
GTCGTTCAGC AAGCATGGCA GGCACGTGAA CCGTTGCGCC AAACTGCCAC CACCGCCTAT
CGTTTGCTGT TTGGCGAGGG CGATGGCCTA CCAGGGATCA CCATCGATCT CTACAATCAA
TATGCCGTGA TTGCGACCTA CGCCGATTGT GTTGAGGTAT TGATTGCCGA TGTGGTCAAG
GCCTTGCAAG CCAGCGTGCC ACAACTGCGG GGCGTGGTGC GCCGCCGCCG CGACGATAGC
GAAAACGACG ATGAAACTGG CAAAATTGAG TTGTTGTGGG GCGAATTGCC ACCAGCCCAA
CTGATAGTCG AAGAACATGG CCTTAAATTG ATCGCTAATT TGTTTGAGGG CCAGAAAACT
GGCTTATTCC TTGATCATCG TGAGAACCGC CATACCATTG AGCAATGGAG CCATGGTAAA
ACGGTGCTGA ATTGCTTCTC GTATACTGGG GCATTTTCGT TATACGCTGC TCGTGGCGGC
GCAACTGCCA CCACCAGCGT CGATATTGCG CCAGCCGCTG CCCACGATGC TGAACAAAAT
TTTATGCTCA ATGGCTTGAT GAATGAACAC CAGCGCTTTT TGGCCCGCGA TTGCTTTGAT
TTTCTGAGTC GCACGATTCA GCGTGGCGAA ACCTATGATT TGGTGATTCT TGACCCACCT
TCGTTTGCCC GCTCGAAGAA AAATATTCAT GCAGCAACTC GAGCTTATGT CAAACTCAAT
GCCTTAGCGA TTCAATGTGT GGCGAAGGGT GGGCTACTGG CCTCAGCCAG TTGTACTAGC
CAACTTTCGC CCGCCAATTT TCGCTTGATG CTGGGCGAAG CTGCTGCCCA AACCGATCAG
CAATTGCGCA TTATTCATGA GGCAGGGCAA GCGCTCGATC ACCCAGTGCC AGCGCATTTT
ACCGAAGGCC GCTATCTCAA ATTTGTGTTA GCCCGCGTTG ATGAGCGTAT GTAA
 
Protein sequence
MAARSMPTIN LPTQLKAHLQ AGHPWVYRDH VPPSTRLASG TWVRLHCGNW QGFGLWDARS 
PIALRIFSSR MQPDANWIKT VVQQAWQARE PLRQTATTAY RLLFGEGDGL PGITIDLYNQ
YAVIATYADC VEVLIADVVK ALQASVPQLR GVVRRRRDDS ENDDETGKIE LLWGELPPAQ
LIVEEHGLKL IANLFEGQKT GLFLDHRENR HTIEQWSHGK TVLNCFSYTG AFSLYAARGG
ATATTSVDIA PAAAHDAEQN FMLNGLMNEH QRFLARDCFD FLSRTIQRGE TYDLVILDPP
SFARSKKNIH AATRAYVKLN ALAIQCVAKG GLLASASCTS QLSPANFRLM LGEAAAQTDQ
QLRIIHEAGQ ALDHPVPAHF TEGRYLKFVL ARVDERM