Gene GBAA_1968 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGBAA_1968 
Symbolhom-1 
ID2817031 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus anthracis str. 'Ames Ancestor' 
KingdomBacteria 
Replicon accessionNC_007530 
Strand
Start bp1850762 
End bp1852057 
Gene Length1296 bp 
Protein Length431 aa 
Translation table11 
GC content36% 
IMG OID637788846 
Producthomoserine dehydrogenase 
Protein accessionYP_018612 
Protein GI47527263 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0460] Homoserine dehydrogenase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000153616 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATAACG TTATTCATGT AGGGGTGTTA GGATTAGGTA CGGTCGGAAG TGGTGTTGTC 
CATATTTTGA AAGAACATTA TAAAAAAATT GCACTTGATA CAGGGTATGA GGTGAAGGTG
AAGACAGTCG TTGTACGTGA TTTAGAAAAA GAACGTGATG TTTGTATTGA TGGAATCGTA
GTAACATGTC ATGTCGATGA AGTTCTAAAT GATCCAAATA TTGATATTGT AGTAGAGGTA
ATGGGCGGAA TTGAAGAAGC GAAGCAGCAT ATTGTTAAGG CTTTACGAAA TAAGAAACAT
GTCGTGACAG CAAATAAAGA TTTAATGGCT GTATACGGTG CAGAGCTTTT GCAACTGGCG
AACGATAATG ATTGTGATCT ATGTTATGAG GCAAGTGTAG CTGGTGGTAT TCCAGTGTTA
AGAGGACTAA CAGACGGATT AGCTTCAGAT CAAATTGAAA AAATAATGGG AATCGTAAAT
GGAACAACAA ATTATATGTT AACAAAGATG AGTCAAAAGG GATGGTCGTA TGAAGAGGCT
TTACAAGAAG CGCAAAAATT AGGTTTCGCA GAATCAGATC CGACAGCGGA TGTAGATGGA
TTAGATGCAG CGAGAAAAGT AGCAATCCTT GCAAATTTAG GTTTTTCGAT GAATGTTTCT
TTGGATGATG TGCAAGTAAG AGGGATTCGA AAGGTAGAAA AAGAAGATTT ACAAATGGCT
GAAAAGTTAG GGTTTACTAT GAAGTTAATT GGTAAAGCAG AGAAACAGGG ATCAGCTATT
CATTTAAGTG TAGAACCGAC ACTTTTACCA AGTCATCATC CATTGTCAAA TGTAAATAAT
GAATTTAATG CAGTGTATGT TCACGGGCAA GCGGTAGGAG AAGTGATGTT TTACGGACCT
GGAGCAGGTA AATTGCCGAC TGGTTCTGCA GTAGTAAGTG ATATTATTTC AATCGTTAAA
AATATGAATC AAGTTCCGAA AAATAAAAGT GTGTTAAAAG AACCAGAGCC ATACGAATTA
CAAGGGGATG AAGAAGTCGT TTCGAAATAT TTCTTACGTA TTTCATTACG AGATGAGCCA
GGGATGTTAC AAAAAATAAC AGAATGTTTC GTTAATTATT CTGTAAGTTT AAAAGAAGTA
ATTCAATTAC CTTTAAATCG TGAACTTGCA GAAGTCGTTG TTGTGACACA TCAAACTTCA
AAGTATCAAT TCGAACGAGT TTTAGGGGCA ATAGAAGATG TCGCAAGTGA AATAAACAGT
TACTACATTA TCGAGGAGGA AAAACAATAT GTATAA
 
Protein sequence
MNNVIHVGVL GLGTVGSGVV HILKEHYKKI ALDTGYEVKV KTVVVRDLEK ERDVCIDGIV 
VTCHVDEVLN DPNIDIVVEV MGGIEEAKQH IVKALRNKKH VVTANKDLMA VYGAELLQLA
NDNDCDLCYE ASVAGGIPVL RGLTDGLASD QIEKIMGIVN GTTNYMLTKM SQKGWSYEEA
LQEAQKLGFA ESDPTADVDG LDAARKVAIL ANLGFSMNVS LDDVQVRGIR KVEKEDLQMA
EKLGFTMKLI GKAEKQGSAI HLSVEPTLLP SHHPLSNVNN EFNAVYVHGQ AVGEVMFYGP
GAGKLPTGSA VVSDIISIVK NMNQVPKNKS VLKEPEPYEL QGDEEVVSKY FLRISLRDEP
GMLQKITECF VNYSVSLKEV IQLPLNRELA EVVVVTHQTS KYQFERVLGA IEDVASEINS
YYIIEEEKQY V