Gene SAG1472 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSAG1472 
SymbolpepS 
ID1014281 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptococcus agalactiae 2603V/R 
KingdomBacteria 
Replicon accessionNC_004116 
Strand
Start bp1491811 
End bp1493052 
Gene Length1242 bp 
Protein Length413 aa 
Translation table11 
GC content39% 
IMG OID637316644 
Productaminopeptidase PepS 
Protein accessionNP_688466 
Protein GI22537615 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2309] Leucyl aminopeptidase (aminopeptidase T) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTTTTAC AAGATTTCGA CAACCTTTTA AAAAAATATG CCCAATTAAT TATTTCTAAA 
GGTTTAAATG TCCAAAAAGG GCACACTCTC GCTTTAACAA TCGATGTGGA ACAAGTCCAC
TTAGCAAGGC TTTTAACTGA AGCCGCTTAT GAAAAGGGAG CAAGTGAAGT TATTGTTGAT
TATACAGATG ATTTTATCAC GCGCCAGCGA CTACTTCATG CTTCAGACGA AGTTCTCACG
AATGTTCCAC AGTATACCGT TGATAAATCT TTAGCACTAT TAAATAAGAA GGCTAGTCGA
TTAGTTGTGA AATCTTCTAA CCCTAACGCT TTCGCTACTG TTGATCCTAA ACGTTTATCT
GAAACAACTA GAGCAACCGC TATTGCCTTA GAGGAACAAA GTAGAGCAAT ACAAGCTAAT
AAAGTATCTT GGAACGTGGC TGCAGCTGCT GGTAGAGAAT GGGCTGCACT TGTCTTCCCA
GAATTAAAAA CAAGCGACCA ACAAGTTGAT GCTCTTTGGG ATACCATTTT CAAATTAAAT
CGTATTTATG AAGATGATCC TATTGCTGCT TGGGACGCAC ATGAAGCTAA ATTATTAGAA
AAAGCTACTA GACTAAATCA AGAACAATTT GATGCTCTTC ATTATACCGC ACCAGGTACA
GATTTAACGC TTGGTATGCC TAAAAATCAT ATTTGGGAGG CAGCCGGTAG TCTCAACGCT
CAGGGAGAGA CTTTTATCGC TAATATGCCT ACTGAAGAAA TCTTTTCAGC ACCTGATTAC
CGTCGTGCAG ATGGGTATGT GACAAGTACA AAACCTCTCA GTTATGCTGG CGTTATTATC
GAAAATATGA CATTTACCTT TAAAGACGGT AAAATTATCA ATGTCACTGC AGAAAAAGGG
CAAGAAACAG TCCAACGCTT AATCGAGGAA AATGATGGGG CAAGATCGCT TGGGGAAGTT
GCACTTGTCC CACATAAAAC ACCAATTTCA CTATCTGGAC TGATTTTCTT TAATACTTTA
TTCGATGAAA ATGCCTCTAA TCACCTCGCT ATTGGAACTG CATATGCCTT CAATGTAGAA
GGAGGAACAG AAATGACAAG TCAAGAATTG GATGAAGCTG GTTTAAATCG TTCTTCAACA
CATGTTGATT TTATGATTGG TTCAGAACAA ATGGATATTG ATGGTATTCG TGCAGATGGA
ACTGCTGTCC CAATCTTTAG AAATGGCGAA TGGGCTATTT AA
 
Protein sequence
MVLQDFDNLL KKYAQLIISK GLNVQKGHTL ALTIDVEQVH LARLLTEAAY EKGASEVIVD 
YTDDFITRQR LLHASDEVLT NVPQYTVDKS LALLNKKASR LVVKSSNPNA FATVDPKRLS
ETTRATAIAL EEQSRAIQAN KVSWNVAAAA GREWAALVFP ELKTSDQQVD ALWDTIFKLN
RIYEDDPIAA WDAHEAKLLE KATRLNQEQF DALHYTAPGT DLTLGMPKNH IWEAAGSLNA
QGETFIANMP TEEIFSAPDY RRADGYVTST KPLSYAGVII ENMTFTFKDG KIINVTAEKG
QETVQRLIEE NDGARSLGEV ALVPHKTPIS LSGLIFFNTL FDENASNHLA IGTAYAFNVE
GGTEMTSQEL DEAGLNRSST HVDFMIGSEQ MDIDGIRADG TAVPIFRNGE WAI