Gene Haur_4097 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4097 
Symbol 
ID5735958 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp5229423 
End bp5230601 
Gene Length1179 bp 
Protein Length392 aa 
Translation table11 
GC content57% 
IMG OID641281251 
Productmolybdopterin binding domain-containing protein 
Protein accessionYP_001546857 
Protein GI159900610 
COG category[R] General function prediction only 
COG ID[COG1058] Predicted nucleotide-utilizing enzyme related to molybdopterin-biosynthesis enzyme MoeA 
TIGRFAM ID[TIGR00177] molybdenum cofactor synthesis domain
[TIGR00200] competence/damage-inducible protein CinA N-terminal domain 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAAGCTG AATTAATTGC CATCGGCACT GAACTGACCC TTGGCACAAC CGTCGATACC 
AATAGCGCTT GGCTGGCTCG GACTTTGGCC ACGGTTGGGG TCGAGGTGCA ACGCGTAACC
TTGGTCGCCG ACGATATTGG CGCAATCACC GAGGTGATCG CGGCGGCCTG GCAACGCAGC
CCATTGGTGC TATGCACAGG CGGGCTTGGC CCAACTGCCG ATGATCTGAC GCGCGAAGCC
GTGGCCCAGG CAACCCAGCG CCCCTTGGAA TTCCACCAAG ATCTGTTTGA TGGGATCGCC
GCTCGCTTTC GTTCGTTTGG TCGCACGATG AGCGAGAGCA ACCGCCAGCA AGCCTTTGTG
CCGATGGGCG CTCGACCAGT GCCCAATGCG CGTGGCACAG CACCCTCATT CATCATCGAC
GAAGGCTCAC GAGCCTTGAT GGTGTTTCCA GGTGTGCCCA GCGAGATGAA ATTTTTGGTC
GAAACTGAGT TACTGCCATT TTTGCGCAAC GAGCGCGGGC TAAAATCAGT CTTGTTGGTG
CGTTCGATTT GGCTCAGCGG CACAAGTGAG GCCGAGGCTG GCGAAATCAT TGCTGATCTG
ATGCAAGCTT CTTATCCAAC GGTGGGAATT TCGGCCAAAG CCGCCCAATA CGAAGTGCGA
ATCTCGGCCC AAGGTGAAGA TCCAGCCCAA GTTGAGGCTG ATATCGAGGC TTGCGCGGCG
GAAGTTGAAC GGCGGCTTGA GCGCTTTTTG ATGGATCGTG ATGGCTTGGC AGCGCATGTA
CTGCGGCGTT TGACCAATCG CTCCGCCAGC GTGGCAATTT ATGAAGGCTT ACGCGGCGCA
CCAATCTTCA ATGCATTGCG TTCAGCCAAA CCAGATTTAG TGCAAGCCTT GCGTGGCGTG
ACGATTCATC CCCTCGATCA AGCGGTTGAT CGCGAGGCTG CTGAATCGTT GGCAATTGCT
GGGGCCAACA CTGTGCGCAA CAATTGGCAG GCCAGCTATG GAATTGCCGC CGTGCCAGCC
CAAGTTGGAG CCGATGGCTT TACCGATGTC TGCATTGTCT TGGTTGGCAA AGATTTGCAA
CGCAGCTTCA CCCGTCGCGT CGAGTTGAAA AGTGACGATG CCTTGGGCTT TATCGCCACT
GCCGCGCTTG ACTTGATTCG GCGCAGTCTG GAAGCATAG
 
Protein sequence
MQAELIAIGT ELTLGTTVDT NSAWLARTLA TVGVEVQRVT LVADDIGAIT EVIAAAWQRS 
PLVLCTGGLG PTADDLTREA VAQATQRPLE FHQDLFDGIA ARFRSFGRTM SESNRQQAFV
PMGARPVPNA RGTAPSFIID EGSRALMVFP GVPSEMKFLV ETELLPFLRN ERGLKSVLLV
RSIWLSGTSE AEAGEIIADL MQASYPTVGI SAKAAQYEVR ISAQGEDPAQ VEADIEACAA
EVERRLERFL MDRDGLAAHV LRRLTNRSAS VAIYEGLRGA PIFNALRSAK PDLVQALRGV
TIHPLDQAVD REAAESLAIA GANTVRNNWQ ASYGIAAVPA QVGADGFTDV CIVLVGKDLQ
RSFTRRVELK SDDALGFIAT AALDLIRRSL EA