Gene Haur_2163 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_2163 
Symbol 
ID5736874 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp2728446 
End bp2729549 
Gene Length1104 bp 
Protein Length367 aa 
Translation table11 
GC content47% 
IMG OID641279304 
Productradical SAM domain-containing protein 
Protein accessionYP_001544931 
Protein GI159898684 
COG category[H] Coenzyme transport and metabolism
[R] General function prediction only 
COG ID[COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes 
TIGRFAM ID[TIGR00423] radical SAM domain protein, CofH subfamily 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTATTC TGTTGGCGAA GTCGCCACTC TCTGATATTG CTGAAAAAGT TGAAGCAGGC 
GAACGCCTTT CGTTTGACGA TGGAATGCGT TTGTATCAAA CTAACGATAT TTTAGCCTTG
GGTAAATTGG CCGATACGGT GAATCGACGC AAAAATGGCG ATGTGGTGTA TTTTGTGCAA
AATCACCGCA TTACACCAAC CAATGTTTGT GCATTTCACT GTAATTTCTG CTCGTTTCGG
CGTAATGGCA ACGAACCCGA TGCCTTTGTG CGCACTCCCG AACAAATTAT CGATCACGTT
GGGCGTTTGT TTAGCGAACG TACCCGTGAA TTTCATATTG TCGGCGGTTT AGTGCCCGAT
CTCGATGTTG AATATTATGC CGATATCATT CGTGAATTGA AGGATCATTA TCCCAATGTT
CACGTCAAAG CCTTTACGGC AGTTGAAATT GATTATATGG CTCAAATTTC GCATCTTGAT
TGGCGCACAA CCCTTGAGAT TTTGCGCAAG GCTGGGCTGG ATGCCTTGCC TGGTGGCGGT
GCTGAAATTT TCCATCCAGC GGTGCGCCGT AAAATCTGCC CCGAAAAGGT TGATGGCGAT
GGTTGGTTGG AAATTCATGG CATTGCTCAC GAATTAGGCA TCAAAACCAA TGCCACTATG
CTCTATGGCC ATATCGAAAC CCTCGAACAA CGGGTTGATC ACTTGTTACG TTTGCGCGAA
CAGCAAGATA AAACTGGCGG TTTTGTAACC TACATTCCGC TGGCTTTCCA CCCCGAAAAC
AACAATTTAG GGCGGGTCAA AAAGCTCGAT TGGACGACAG GCTTCGAGGA TTTGAAGAAT
TTGGCGATTG GCCGTTTGTT GCTCGACAAC TTTGCCCATG TCAAAGCCTA TTGGATCTCG
CTCACGCCAC GGTTGGCCCA AGTCGCTTTG TCGTTTGGGG TTTCCGATGT TGACGGCACG
GTGATCGAAG AAGAAATCTA TCACGCTGCT GGGGCTAAAA CCGAACAAGG TATCTCACGG
GCAGAATTAG TTCATCTGGT GACGACTGCT GGCAAAACCG CAGTTGAGCG GGATGCACTT
TATAATCACA TCGCTGTGAA CTAA
 
Protein sequence
MAILLAKSPL SDIAEKVEAG ERLSFDDGMR LYQTNDILAL GKLADTVNRR KNGDVVYFVQ 
NHRITPTNVC AFHCNFCSFR RNGNEPDAFV RTPEQIIDHV GRLFSERTRE FHIVGGLVPD
LDVEYYADII RELKDHYPNV HVKAFTAVEI DYMAQISHLD WRTTLEILRK AGLDALPGGG
AEIFHPAVRR KICPEKVDGD GWLEIHGIAH ELGIKTNATM LYGHIETLEQ RVDHLLRLRE
QQDKTGGFVT YIPLAFHPEN NNLGRVKKLD WTTGFEDLKN LAIGRLLLDN FAHVKAYWIS
LTPRLAQVAL SFGVSDVDGT VIEEEIYHAA GAKTEQGISR AELVHLVTTA GKTAVERDAL
YNHIAVN