Gene Haur_4901 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4901 
Symbol 
ID5736737 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp6237872 
End bp6238864 
Gene Length993 bp 
Protein Length330 aa 
Translation table11 
GC content51% 
IMG OID641282068 
Producthypothetical protein 
Protein accessionYP_001547659 
Protein GI159901412 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG1240] Mg-chelatase subunit ChlD 
TIGRFAM ID[TIGR02226] N-terminal double-transmembrane domain 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.908989 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACATTTG CTTATCCATT GTTATTTTGG TTGTTGCCGA TTGTGCCGAT TGTCTGGTGG 
TCGATGTATC GCCGCAGTCG CGGCGAAGTG GTGACCATGC GCTTCTCCGA TTTGGGCATG
CTGCGTGGGG TCAAGCCGTC GTGGCGGATT CGCATGCGGC CTGTTTTGAT TTCGTTGCGA
GCGGCGGCAG TTGGCTTATT GGTGGTTGTG CTCACTCGGC CACAATATGC CCAAAGCTCC
GAGCGCGTGG TGCGCGAAGG CATCGATATT CAGCTAGCCT TGGATATTTC GCTGAGTATG
AAGGCTGGCG ATTTCGATCC AAAAGATCGG ATTACCGTAG CCAAAGAAGT TATTGCCGAG
TTTGTCAAAG GCCGCAAAGA TGATCGGATT GGGCTGGTAG TTTTTTCGGG CCATGCCTTT
ACTCAAGTAC CATTGACGCT TGATTACGAC TTTTTGCAGA ATTTATTGGG TCAAGTGCAA
ACCGTTCGTC GGCCTGATGG TACAGCGATT GGCCTCGCCT TAGCCCACTC GGTCAATGGC
TTACGTAATA GCACCACCAA AAGCAAAGTC GTGATTTTGC TCACCGACGG CTCGAACAAT
CGTGGCGATA TCGAGCCAGC CCAAGCTGCC GAAATTGCCC GCGCTTTGGA TGTGCGGGTC
TATACGATTT TGGTTGGTAA ACCAGGCAAT GGTGAATATC CCGTGCATGA TCCTTGGCGC
GATGAAACCT ATTTGATTCC AGCACCAACT GCCGAGGATG AAGTGGCCCT CCGCGATATT
GCTGAACAGA CGGGCGGGAT TTTCTTCCGT GCTGGCGATG AACAAGGTCT GCGCGATGTC
TATGATACGA TCGATAAAAT GGAACGATCA CAAGTCGCTA GTGAAAAATT AGTTCGCTAC
ACCGAGGCTT GGCAACCATG GGCCGCCGGA GCACTTTTGC TCTTGATGAT TGAAATTTTG
CTACGCAATA CCATCCTACG GAGTATTGGC TAA
 
Protein sequence
MTFAYPLLFW LLPIVPIVWW SMYRRSRGEV VTMRFSDLGM LRGVKPSWRI RMRPVLISLR 
AAAVGLLVVV LTRPQYAQSS ERVVREGIDI QLALDISLSM KAGDFDPKDR ITVAKEVIAE
FVKGRKDDRI GLVVFSGHAF TQVPLTLDYD FLQNLLGQVQ TVRRPDGTAI GLALAHSVNG
LRNSTTKSKV VILLTDGSNN RGDIEPAQAA EIARALDVRV YTILVGKPGN GEYPVHDPWR
DETYLIPAPT AEDEVALRDI AEQTGGIFFR AGDEQGLRDV YDTIDKMERS QVASEKLVRY
TEAWQPWAAG ALLLLMIEIL LRNTILRSIG