Gene SAG1389 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSAG1389 
SymbolpepT 
ID1014198 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptococcus agalactiae 2603V/R 
KingdomBacteria 
Replicon accessionNC_004116 
Strand
Start bp1396195 
End bp1397415 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content35% 
IMG OID637316565 
Productpeptidase T 
Protein accessionNP_688387 
Protein GI22537536 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2195] Di- and tripeptidases 
TIGRFAM ID[TIGR01882] peptidase T 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0424518 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTTACG AAAAGCTTTT AGAACGATTT TTAACATACG TCAAAATAAA TACAAGAAGT 
AATCCTAATA GTACACAAAC GCCAACAACT CAAAGTCAAG TTGACTTTGC TTTAACAGTT
TTAAAACCAG AAATGGAAGC AATTGGTTTA AAAGATGTTC ATTATTTACC TTCTAATGGG
TATTTGGTTG GAACCTTACC TGCTACAAGC GACCGCTTAC GCCATAAAAT AGGTTTTATA
TCCCATATGG ATACAGCTGA TTTCAATGCT GAAAATATTA CTCCACAAAT TGTTGACTAT
AAAGGTGGAG ATATTGAACT TGGAGACTCA GGTTACATTT TAAGTCCAAA AGATTTTCCA
AATTTAAATA ATTACCATGG GCAAACACTG ATTACAACAG ATGGTAAAAC CTTACTGGGA
GCAGACGATA AGTCTGGTAT AGCAGAAATC ATGACAGCTA TGGAATATTT GGCTTCGCAT
CCAGAAATTG AGCATTGTGA AATTAGAGTT GGCTTTGGAC CAGACGAAGA AATTGGTATA
GGTGCAGATA AATTTGATGT TAAAGATTTT GATGTTGATT TTGCCTATAC AGTGGATGGT
GGACCACTAG GAGAATTACA GTATGAAACC TTTAGTGCAG CTGGTTTGGA GCTTACATTT
GAAGGACGAA ACGTTCACCC TGGAACTGCA AAAAATCAAA TGATTAATGC TTTACAGCTT
GCTATGGATT TTCATAGTCA ATTACCAGAA AATGAACGTC CTGAACAAAC AGATGGCTAT
CAAGGATTTT ATCACTTATA TGATTTAAGT GGAACAGTTG ATCAAGCTAA AAGTTCATAT
ATCATTCGAG ATTTTGAGGA AGTTGATTTC TTAAAGCGTA AGCACTTGGC TCAAGATATC
GCTGATAATA TGAATGAAGC ATTACAATCT GAACGTGTAA AGGTTAAACT ATACGATCAA
TATTACAACA TGAAGAAAGT TATTGAAAAA GACATGACAC CTATCAACAT TGCTAAAGAA
GTAATGGAAG AGTTAGACAT CAAGCCAATC ATAGAACCGA TTCGTGGTGG TACAGATGGC
TCTAAAATTT CCTTTATGGG AATCCCTACT CCTAATCTTT TTGCAGGTGG TGAAAACATG
CATGGACGCT TTGAATTCGT TAGTCTACAA ACAATGGAAA AAGCAGTTGA TGTTATTTTA
GGCATCGTTG CTAAGGATTA G
 
Protein sequence
MSYEKLLERF LTYVKINTRS NPNSTQTPTT QSQVDFALTV LKPEMEAIGL KDVHYLPSNG 
YLVGTLPATS DRLRHKIGFI SHMDTADFNA ENITPQIVDY KGGDIELGDS GYILSPKDFP
NLNNYHGQTL ITTDGKTLLG ADDKSGIAEI MTAMEYLASH PEIEHCEIRV GFGPDEEIGI
GADKFDVKDF DVDFAYTVDG GPLGELQYET FSAAGLELTF EGRNVHPGTA KNQMINALQL
AMDFHSQLPE NERPEQTDGY QGFYHLYDLS GTVDQAKSSY IIRDFEEVDF LKRKHLAQDI
ADNMNEALQS ERVKVKLYDQ YYNMKKVIEK DMTPINIAKE VMEELDIKPI IEPIRGGTDG
SKISFMGIPT PNLFAGGENM HGRFEFVSLQ TMEKAVDVIL GIVAKD