Gene HS_1031 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHS_1031 
SymbolmoaA 
ID4240529 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaemophilus somnus 129PT 
KingdomBacteria 
Replicon accessionNC_008309 
Strand
Start bp1137555 
End bp1138571 
Gene Length1017 bp 
Protein Length338 aa 
Translation table11 
GC content36% 
IMG OID638104592 
Productmolybdenum cofactor biosynthesis protein A 
Protein accessionYP_719243 
Protein GI113461174 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG2896] Molybdenum cofactor biosynthesis enzyme 
TIGRFAM ID[TIGR02666] molybdenum cofactor biosynthesis protein A, bacterial 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAAAGAT CTATTTCTAT TCATAATGTT ATTGAAAATC ATCTTATTGA TGCGTTTCAA 
CGTAAATACT ATTATTTACG CTTGTCTGTC ACGGATATGT GTAATTTTCG TTGTAACTAT
TGTTTACCTC ATGGCTATCG ATCGGAAAGC ACAAAACCCA GTTTTCTGAA TCTGGTTGAA
ATTAAACGAT TGGTGAATGC CTTTGCACAA TTAGGTACAG AAAAAGTTCG TATTACCGGT
GGTGAGCCGA CTCTTCGTAA AGATTTCCTG TCCATTGTTG AAAATATTCG TGCTATTGAA
ACAATAAAAA ATATTGCATT GACGACGAAT GGCTACAAAA TGGCTAGACA AGTTGAGGAT
TGGAAAAAAG CGGGAATCAG TGCAATCAAT GTGAGCGTGG ATAGCCTTGA TCCTAAAATG
TTTCATGCGA TTACCGGCGT TGATAAATTT CACGATATTA TGCGAGGAAT TGACCGTGCT
TTTGAAATTG GTTATGAAAA AATTAAAGTG AATTCAGTGT TAATGAAATC TCTCAATGAT
AAGGAGTTTG ATCAATTTTT AATGTGGGTC AAAGAGCGTC CTATTCAAAT GCGATTTATT
GAATTGATGC AAACCGGCGA AATGGCTCAA TTTTTTCATC AATACCATTT ATCAGGTCAA
ATTTTAGCGG AAAAACTTAT AAGAGAAGGG TGGAGCATAC AGAAAAAAGA ACGTGCAGAT
GGACCAGCAA AGGTATTTGC ACATCCTGAT TATAAGGGCG AAATTGGATT GATCATGCCT
TATGAAAAAA ATTTTTGCAC AAGTTGTAAC CGTCTAAGAG TTTCGGCAAA GGGAAAATTA
CACTTGTGTT TATTTGGTGA AGAAGGAATT GATCTACGAG ATTTATTGCA ATTTGATGAG
CAGCAGAACC AATTAAAAAG TCGAATTTTT ACGGCTTTAC AAAGTAAAAG AGAACATCAT
TTTTTACATA TCGGCGATAG CGGTGTACGA AATCATCTCG CAAGTATTGG GGGCTAA
 
Protein sequence
MQRSISIHNV IENHLIDAFQ RKYYYLRLSV TDMCNFRCNY CLPHGYRSES TKPSFLNLVE 
IKRLVNAFAQ LGTEKVRITG GEPTLRKDFL SIVENIRAIE TIKNIALTTN GYKMARQVED
WKKAGISAIN VSVDSLDPKM FHAITGVDKF HDIMRGIDRA FEIGYEKIKV NSVLMKSLND
KEFDQFLMWV KERPIQMRFI ELMQTGEMAQ FFHQYHLSGQ ILAEKLIREG WSIQKKERAD
GPAKVFAHPD YKGEIGLIMP YEKNFCTSCN RLRVSAKGKL HLCLFGEEGI DLRDLLQFDE
QQNQLKSRIF TALQSKREHH FLHIGDSGVR NHLASIGG