Gene HS_1229 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHS_1229 
SymbolhemL 
ID4240740 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaemophilus somnus 129PT 
KingdomBacteria 
Replicon accessionNC_008309 
Strand
Start bp1395605 
End bp1396903 
Gene Length1299 bp 
Protein Length432 aa 
Translation table11 
GC content40% 
IMG OID638104802 
Productglutamate-1-semialdehyde aminotransferase 
Protein accessionYP_719441 
Protein GI113461372 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0001] Glutamate-1-semialdehyde aminotransferase 
TIGRFAM ID[TIGR00713] glutamate-1-semialdehyde-2,1-aminomutase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00808664 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACAACTT CAGCGACATT ATTCTCTCGT GCACAACAAG TTATTCCAGG CGGAGTAAAC 
TCTCCAGTTA GAGCATTTAA AGGTGTGGGC GGAACGCCAG TGTTCATAGA AAAAGCCAAC
GGTGCGTATA TTTTCGATAC AGAAGGAAAA CAATATATTG ACTACGTAGG TTCTTGGGGA
CCAATGATTT TAGGTCATAA CCACCCATCA ATCTTAAGTG CGGTACTAAA AACAGCAGAA
AATGGGCTAA GTTTTGGAAC ACCTACACCG CTTGAAATTG AACTTGCGGA ACTGATTTGT
CAATTAGTCC CATCAATTGA AATGGTGAGA ATGGTCAATT CGGGAACAGA GGCAACTATG
TCAGCTATTC GTTTGGCTAG AGGCTATACT AAAAGAGATA AAATTTTAAA ATTTGAAGGC
TGTTATCATG GTCACTCGGA TAGTTTGCTT GTCAAAGCCG GCTCCGGATC TTTGACTTTG
GGACAACCAA GCTCTCCTGG TGTTCCGGAA GACTTTGCTA AACATACCAT CACTTGCGAA
TATAATAATC TTCAATCTGT CAAAAATGCT TTTGAACAAT ATCCTGATCA GATCGCCTGC
GTTATCGTTG AGCCTGTTGC AGGTAACATG AACTGCATCC TTCCGAAACA GGATTTTTTA
CAAGGCTTGC GTCAACTTTG CAATGAATAT GGTTCTCTAT TTATTATTGA TGAGGTCATG
ACAGGATTTC GTGTAGCCTT AGGCGGTGCA CAATCTTACT ATGAAGTGAC ACCTGATCTA
ACAACATTAG GAAAAGTCAT TGGAGGAGGT ATGCCCGTTG GTGCTTTCGG AGGCAAAAAA
GAAATTATGC AATATATTGC ACCTACAGGT CCCGTATATC AAGCAGGAAC ATTATCAGGA
AATCCAATTG CTATGTCTGC CGGAATCGCA TGCTTAAATG AATTGAAAAA AGAAGGTAAC
GAACAACGTT TAGCAATGCT CACAAAAAAA TTGGCATTAG GTTTAAAAAA CTTAGCAAAT
CAACACAATA TCCCGCTTGT AGTCAATTAT GTAGGCGGAA TGTTTGGCAT CTTCTTTACC
ACACAAAATG AAGTTACCTC TTACCAACAA GCAATTCAAT GTGATGTTGA AAAGTTTAAT
CTATTTTTCC ACAAAATGTT AGAACAAGGT GTTTATCTTG CACCATCTGC ATTTGAAGCA
GGTTTCATGT CATTAGCACA CACTGACGCA GATATTGACC GCACTTTACA AGCGGCGGAT
ATTGCTTTTG CCAGTTTATG CTCATCATCA TTTTCCTAA
 
Protein sequence
MTTSATLFSR AQQVIPGGVN SPVRAFKGVG GTPVFIEKAN GAYIFDTEGK QYIDYVGSWG 
PMILGHNHPS ILSAVLKTAE NGLSFGTPTP LEIELAELIC QLVPSIEMVR MVNSGTEATM
SAIRLARGYT KRDKILKFEG CYHGHSDSLL VKAGSGSLTL GQPSSPGVPE DFAKHTITCE
YNNLQSVKNA FEQYPDQIAC VIVEPVAGNM NCILPKQDFL QGLRQLCNEY GSLFIIDEVM
TGFRVALGGA QSYYEVTPDL TTLGKVIGGG MPVGAFGGKK EIMQYIAPTG PVYQAGTLSG
NPIAMSAGIA CLNELKKEGN EQRLAMLTKK LALGLKNLAN QHNIPLVVNY VGGMFGIFFT
TQNEVTSYQQ AIQCDVEKFN LFFHKMLEQG VYLAPSAFEA GFMSLAHTDA DIDRTLQAAD
IAFASLCSSS FS