Gene SAG1120 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSAG1120 
Symbolhom 
ID1013924 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptococcus agalactiae 2603V/R 
KingdomBacteria 
Replicon accessionNC_004116 
Strand
Start bp1126866 
End bp1128149 
Gene Length1284 bp 
Protein Length427 aa 
Translation table11 
GC content36% 
IMG OID637316302 
Producthomoserine dehydrogenase 
Protein accessionNP_688129 
Protein GI22537278 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0460] Homoserine dehydrogenase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTATTA AAATAGCTTT ATTAGGTTTT GGAACGGTTG CTAAGGGTAT TCCATATTTG 
CTAAAAGAAA ATCAACATAA GCTACTTTCT TTAGAAGGCG AAGATATTGT GATTGATAAA
GTATTAGTAA GAGATAATGA AAGCCGCCAG CGTTTCATCA ATCAGGGATT TACTTATAAC
TTTGTGACAG AGATAAATAC TATTCTTCAA GATTCACAAA TTGATATTGT AGTGGAATTA
ATGGGGGGGA TTGAGCCAGC TAAAACTTAT TTGAGTCAAG CATTAGGATT TGGTAAACAT
ATTGTGACAG CCAATAAAGA TCTCATTGCT TTACACGGAA AAGAGTTGAT GGATTTAGCA
GACGCTAGAG GTCTAGCTTT ATTCTATGAG GGAGCAGTTG CTGGAGGCAT TCCTATTTTA
AGGACCCTAT CGCATTCGTT TGCCTCAGAT AAAATGACAC GTTTATTAGG AATTCTCAAC
GGTACCTCCA ACTTCATGTT AACAAAAATG TTTGAAGAGG GATGGTCTTA TGAACAAGCT
CTAAAAAAGG CACAAGAGTT AGGTTATGCT GAAAGTGATC CCACAAATGA TGTTGAAGGT
ATTGATACTG CCTACAAAGC CACTATCTTA AGTCAATTTG GATTTGGTAT GCCTATTGAT
TTTGATGATG TTAATTATAA GGGGATTTCT AGTATTCGCT CAGAGGATGT TGAAGTAGCT
CAGGAGATGG GCTTTGCCAT TAAGTTGGTA GCTGATCTTC GTGAAACTCC AACTGGTATA
AGTGTAGACG TTTCTCCGAC ACTAATTTCT CAAAAGCATC CCTTAGCTGC AGTTAATCAT
GTGATGAATG CAGTATTCAT TGAATCAATA GGGATTGGTC AGTCTCTTTT TTATGGACCA
GGTGCGGGAC AAAATCCAAC AGCAACCTCT GTTTTAGCGG ATATCATCGA TATTAGTCGT
AGTATTCGAT CACAGATAAA AATTAAGCCT ATGAATACTT ATCATTGTCC GTGTAGGTTG
TCAATGCAGT CTGATATTTT CAATGAGTAC TATCTAGCTA TTTCTTTGAG AAATGCTGAA
GATAGTGATA CACTTGGAAG GTACTTTGAG CAAGAAAATA TAGGTTTGAA AAATGTTATC
GAAAAAGCAT TGGGTGATAA ACAACAAGAA ATCTATGTAT TAACAGATGA AGTTAGCCAA
GAGAAAATAA CTCAATTTAT TGAGGAGTTT CCTGAGAGTG GTGTCATTCA GTTAATCAAT
GTTTTCAAAG TAATAGGAGG GTGA
 
Protein sequence
MTIKIALLGF GTVAKGIPYL LKENQHKLLS LEGEDIVIDK VLVRDNESRQ RFINQGFTYN 
FVTEINTILQ DSQIDIVVEL MGGIEPAKTY LSQALGFGKH IVTANKDLIA LHGKELMDLA
DARGLALFYE GAVAGGIPIL RTLSHSFASD KMTRLLGILN GTSNFMLTKM FEEGWSYEQA
LKKAQELGYA ESDPTNDVEG IDTAYKATIL SQFGFGMPID FDDVNYKGIS SIRSEDVEVA
QEMGFAIKLV ADLRETPTGI SVDVSPTLIS QKHPLAAVNH VMNAVFIESI GIGQSLFYGP
GAGQNPTATS VLADIIDISR SIRSQIKIKP MNTYHCPCRL SMQSDIFNEY YLAISLRNAE
DSDTLGRYFE QENIGLKNVI EKALGDKQQE IYVLTDEVSQ EKITQFIEEF PESGVIQLIN
VFKVIGG