Gene SAG1124 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSAG1124 
Symbol 
ID1013928 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptococcus agalactiae 2603V/R 
KingdomBacteria 
Replicon accessionNC_004116 
Strand
Start bp1130926 
End bp1132302 
Gene Length1377 bp 
Protein Length458 aa 
Translation table11 
GC content38% 
IMG OID637316306 
Productaldehyde dehydrogenase family protein 
Protein accessionNP_688133 
Protein GI22537282 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.115238 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCATATA AAACAATTTA TCCGTACACT AACGAAGTAT TGCATGAATT CGATAATATC 
AGTGACAGTG ATCTCGAACA GTCATTAGAC ATAGCGCATG CCCTCTACAA GACATGGCGC
AAAGAAGATA ATGTTGAAGA ACGCCAAAAT CAATTGCATA AAGTAGCAGA TTTATTACGT
AAGGATCGTG ATAAGTACGC TGAAGTTATG ACTAAAGATA TGGGAAAACT TTTCACTGAA
GCACAAGGAG AAGTGGACTT GTGTGCAGAC ATAGCTGACT ATTATGCTGA TAATGGGCAA
AAGTTTTTAA AACCTGTTCC GTTAGAAAGT CCAAATGGAG AGGCTTACTA TTTGAAGCAA
GCTGTCGGTG TTTTACTTGC TGTCGAGCCA TGGAATTTCC CATTTTATCA GATTATGCGT
GTATTTGCCC CTAACTTTAT AGTGGGTAAT ACTATGCTTT TAAAACATGC TTCTATTTGT
CCTGCCTCAG CTCAAGCGTT TGAGGATTTA GTCCGAGAAG CAGGAGCTCC AGAAGGAGCG
TTTAAAAATA TTTTTGCTTC GTATGATCAA GTTTCAAACC TTATATCAGA TCCTCGTGTT
GCAGGTGTGT GTTTAACTGG TTCTGAGCGT GGTGGGGCAT CAATAGCAGC TGAAGCTGGT
AAAAATTTGA AAAAATCGTC TATGGAATTA GGTGGGAACG ATGCTTTTCT TATTTTAGAT
GATGCTGATT TTGATTTACT TAGTAAAACA ATATTCTTTG CGCGTTTATA TAATGCAGGT
CAAGTATGTA CATCTTCAAA ACGTTTCATT GTCATGGCTG ATAAGTATGA CGAATTTGTA
AATATGGTAG TTGAGACCTT TAAGTCTGCT AAATGGGGCG ACCCAATGGA TTCGGAAACA
ACCTTGGCTC CACTGTCATC TGCAGGTGCA AAGGATGATG TTTTAAAACA AATTAAATTG
GCGGTAGACC ACGGTGCTGA GGTAGTTTTT GGAAATGACA CTATAGATCA TCCAGGAAAC
TTCGTCATGC CAACTGTTTT AACGAATATC ACTAAAGCAA ATCCAATCTA TAATCAAGAA
ATTTTCGGCC CCGTAGCCTC TATCTATAAA GTGGATACTG AAGAAGAAGC TATTGCTTTA
GCTAATGATT CTAGTTATGG TTTAGGAAGC ACTGTTTTTT CTTCTGATCC AGAACATGCT
AAAAAAGTGG CGGCTCAGAT TGAAACAGGG ATGACATTTA TTAATTCAGG GTGGACATCA
TTACCGGAAT TACCGTTTGG AGGTATTAAA AATTCAGGAT ATGGTCGTGA GTTGAGCCAA
CTTGGATTTG ATGCCTTTGT CAACGAACAT TTGGTATTTA CACCAAATAG TGATTAA
 
Protein sequence
MAYKTIYPYT NEVLHEFDNI SDSDLEQSLD IAHALYKTWR KEDNVEERQN QLHKVADLLR 
KDRDKYAEVM TKDMGKLFTE AQGEVDLCAD IADYYADNGQ KFLKPVPLES PNGEAYYLKQ
AVGVLLAVEP WNFPFYQIMR VFAPNFIVGN TMLLKHASIC PASAQAFEDL VREAGAPEGA
FKNIFASYDQ VSNLISDPRV AGVCLTGSER GGASIAAEAG KNLKKSSMEL GGNDAFLILD
DADFDLLSKT IFFARLYNAG QVCTSSKRFI VMADKYDEFV NMVVETFKSA KWGDPMDSET
TLAPLSSAGA KDDVLKQIKL AVDHGAEVVF GNDTIDHPGN FVMPTVLTNI TKANPIYNQE
IFGPVASIYK VDTEEEAIAL ANDSSYGLGS TVFSSDPEHA KKVAAQIETG MTFINSGWTS
LPELPFGGIK NSGYGRELSQ LGFDAFVNEH LVFTPNSD