Gene Haur_4707 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4707 
Symbol 
ID5736554 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp6012859 
End bp6014058 
Gene Length1200 bp 
Protein Length399 aa 
Translation table11 
GC content49% 
IMG OID641281871 
Productradical SAM domain-containing protein 
Protein accessionYP_001547466 
Protein GI159901219 
COG category[H] Coenzyme transport and metabolism
[R] General function prediction only 
COG ID[COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes 
TIGRFAM ID[TIGR00423] radical SAM domain protein, CofH subfamily 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATGTAC AATCGCTTCT TGAAAAAGCA GTCAATGGTG AGCGTCTTTC GGCTGAGGAA 
GGGCTGTTTT TGTATCGTGA AGCTGACCTG CTGGATTTAG GCATGGCTGC TGATGCGATT
CGCCAACGGC TTGTGCCCGG AAACTTAGTG ACGTATCTTG TTGATCGCAA CGTCAATTAT
AGTAATGTTT GCACGACCAA TTGCCAATTC TGTGGTTTTT ATCGCCCATT GGGTCACCCA
GAATCCTATG TGCAAACCTA CGAAGAAATT GGTGCTCGTT TAGCCGAGCT TGTCGAATAT
GGTGGAACCC GCGTGCTGAT GCAGGGTGGC CATCATCCTG AATTGCGGCT TTCGTGGTAT
ACCGGCTTAC TGAGCTATTT GCGCGAACAT TATCCTAGCA TCGAGCTTGA TTGCTTCTCG
CCTTCAGAAA TCGATAATCT GGCTAAGGTC GAGAACATGA GCATTCGCGA GGTGTTGATT
GCGCTGAAGG AAGCTGGCTT GGCTGGCGTG CCTGGCGGCG GCGCTGAAAT TCTCGATGAT
GAAGTGCGCC AACGGGTTAG CCCACTCAAG CAAGATGCTG CTGGTTGGCT CGAAGTTCAG
GGTACAGCGC AATCGCTGGG CATGGCCACC ACCGCAACTA TGGTGATTGG CTTCGGCGAG
TCGCTTGAAC AACGTATGAA TCACCTTATG AAATTGCGTG ATTTGCAGGA TGTTTCGTTG
CGCGAGTATA ACAATGGCTT TATTGCCTTT ATTTCGTGGA CCGTCCAGAA ATCGGAGTTG
ACCTCGTTGG GTCGCTCGAA GTTCTCGGAC GACTACGGGG CAACCGCTGC CGAATATTTG
CGCCACACCG CGCTTGCCCG AATTATGCTC GATAACTTTA ATCATATTCA GGCTTCATGG
GTAACTCAAG GACCAAAGAT TGGCCAAGTT TCGCTGAAGT TTGGCATCGA TGATTTTGGC
TCGACCATGC TTGAGGAAAA TGTGGTTTCC AACGCCGCTC ACGACACCTA CCAATGTATG
GGCGAACGCG AAATCCACGG TTTTATTCGC GATGCTGGCT TTATTCCAGC CAAACGCGAT
ACCCACTACA ACTTGATTCA AGCTTTTGAA GACCCCAAAG ATAGCGAAAA CGTGCCATCG
ATGGGCATCA AACAGCCACG TCAAGCCAAG CAGGTCGAAA TTCCATTAAT GGTTAAATAG
 
Protein sequence
MNVQSLLEKA VNGERLSAEE GLFLYREADL LDLGMAADAI RQRLVPGNLV TYLVDRNVNY 
SNVCTTNCQF CGFYRPLGHP ESYVQTYEEI GARLAELVEY GGTRVLMQGG HHPELRLSWY
TGLLSYLREH YPSIELDCFS PSEIDNLAKV ENMSIREVLI ALKEAGLAGV PGGGAEILDD
EVRQRVSPLK QDAAGWLEVQ GTAQSLGMAT TATMVIGFGE SLEQRMNHLM KLRDLQDVSL
REYNNGFIAF ISWTVQKSEL TSLGRSKFSD DYGATAAEYL RHTALARIML DNFNHIQASW
VTQGPKIGQV SLKFGIDDFG STMLEENVVS NAAHDTYQCM GEREIHGFIR DAGFIPAKRD
THYNLIQAFE DPKDSENVPS MGIKQPRQAK QVEIPLMVK