Gene SAG0174 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSAG0174 
SymbolpepA 
ID1012948 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptococcus agalactiae 2603V/R 
KingdomBacteria 
Replicon accessionNC_004116 
Strand
Start bp194950 
End bp196017 
Gene Length1068 bp 
Protein Length355 aa 
Translation table11 
GC content40% 
IMG OID637315352 
Productglutamyl-aminopeptidase 
Protein accessionNP_687209 
Protein GI22536358 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1363] Cellulase M and related proteins 
TIGRFAM ID[TIGR03107] glutamyl aminopeptidase 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCAGATT TATTTAACAA AATTAAAACC GTAACTGAGC TTGATGGGAT TGCTGGCTAT 
GAACACAATA TCCGCAACTT CCTTCGTCAA GAAATAACTC CTTTAGTTGA TCAAGTTGAG
ACAGACGGAC TTGGTGGAAT TTTTGGAGTT AAAAATACTC ATGAGACTAA TGCTCCTAAA
GTCATGGTTG CTGCCCATAT GGATGAAGTC GGCTTTATGG TTAGTCATAT TCAGCCAGAT
GGAACATTTC GTGTACTTGA GGTTGGAGGA TGGAATCCCC TAGTAGTCAG CTCACAACGC
TTTACCCTCT ACACACGTTC TGGTGATGCT ATTCCTGTTA TATCAGGCTC AGTTCCTCCT
CACTTTCTTC GTGGACAAAG CGGTGGAACA ACATTACCCA AAATTAGTGA CATTGTTTTT
GATGGAGGAT TCACAGATAA AAATGAAGCT GAAAGCTTTG GCATTGCTCC TGGCGATATC
ATTGTTCCTA AATCTGAAAC CATTTTAACT GCAAATCAAA AACATATTAT GTCAAAAGCT
TGGGATAATC GCTATGGTGT GCTTATGGTG ACCGAATTGC TAAAAAGCTT AAAAGATCAA
AGTCTTAGCA ACACACTTAT TGCTGGGGCA AATGTTCAAG AAGAAGTCGG ACTTCGTGGC
GCACATGTTT CAACAACTAA ATTCAACCCA GATATCTTCT TAGCTGTCGA TTGTTCCCCA
GCTGGAGATA TTTATGGGGA ACAAGGCAAA ATAGGAGAGG GAACCTTAAT CCGTTTTTAT
GATCCCGGAC ATATCATGCT TAAAGATATG AGAGATTTCT TACTTACAAC AGCTGAAGAA
GCAGGTATAA AATACCAATA TTATGCTGCA AATGGTGGTA CCGATGCTGG GGCTGCTCAC
CTAAAAAATA GTGGTATTCC TTCTACAACT ATCGGTGTCT GTGCACGCTA CATTCATTCT
CATCAAACAC TCTACGCTAT GGATGATTTT CTACAAGCAC AAGCTTACCT TCAGGCCATC
GTTAACAAAT TAGACCGCTC GACGGTGGAT ATTATTAAAG GTTATTAA
 
Protein sequence
MSDLFNKIKT VTELDGIAGY EHNIRNFLRQ EITPLVDQVE TDGLGGIFGV KNTHETNAPK 
VMVAAHMDEV GFMVSHIQPD GTFRVLEVGG WNPLVVSSQR FTLYTRSGDA IPVISGSVPP
HFLRGQSGGT TLPKISDIVF DGGFTDKNEA ESFGIAPGDI IVPKSETILT ANQKHIMSKA
WDNRYGVLMV TELLKSLKDQ SLSNTLIAGA NVQEEVGLRG AHVSTTKFNP DIFLAVDCSP
AGDIYGEQGK IGEGTLIRFY DPGHIMLKDM RDFLLTTAEE AGIKYQYYAA NGGTDAGAAH
LKNSGIPSTT IGVCARYIHS HQTLYAMDDF LQAQAYLQAI VNKLDRSTVD IIKGY