Gene SAG1736 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSAG1736 
SymbolpepX 
ID1014545 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptococcus agalactiae 2603V/R 
KingdomBacteria 
Replicon accessionNC_004116 
Strand
Start bp1730956 
End bp1733241 
Gene Length2286 bp 
Protein Length761 aa 
Translation table11 
GC content36% 
IMG OID637316904 
Productx-prolyl-dipeptidyl aminopeptidase 
Protein accessionNP_688726 
Protein GI22537875 
COG category[R] General function prediction only 
COG ID[COG2936] Predicted acyl esterases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00688929 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGTTATA ATCAATTCAG CTACATTCCA ACTAAACCTA ACGAAGCTTT TGAAGAGCTC 
AAAGGACTAG GTTTTCCATT AAACAAAAAG AATTCTGATA AAGCTAATTT GGAAGCTTTT
CTCAGACATT CTTTTTTAAA TCAAACTGAT ACTGATTACG CTCTTAGTCT CCTTATCGTT
GATGCAAAAA CCGATGCTCT AACCTTTTTT AAATCAAATA GTGACTTAAC ACTAGAAAAT
TTACAATGGA TTTATTTACA GTTATTAGGC TTTATCCCTT TTGTAGACTT TAAAGACCCT
AAAGCATTCT TACAAGATAT TAACTTCCCA GTCTCATATG ACAATATTTT TCAAAGTCTA
CATCACTTAC TCGCCTGTCG TGGAAAATCT GGCAATACAT TAATCGACCA ATTAGTTGCT
GATGGTTTAC TTCATGCAGA TAATCACTAC CATTTTTTCA ATGGGAAGTC TCTGGCCACT
TTCAATACTA ACCAATTGAT TCGCGAAGTT GTCTATGTTG AAACATCCTT AGATACTATG
TCTAGTGGTG AACATGATTT AGTAAAAGTT AACATTATCA GACCCACTAC CGAGCATACT
ATCCCTACGA TGATGACAGC TAGCCCCTAT CATCAAGGTA TCAATGATCC TGCCGCAGAC
CAAAAAACAT ACCAAATGGA GGGTGCGCTA GCAGTTAAAC AACCTAAACA CATACAAGTT
GACACAAAAC CATTTAAAGA AGAAGTAAAA CATCCTTCAA AATTACCCAT CAGCCCTGCA
ACTGAAAGCT TCACACACAT TGACAGTTAT AGTCTCAATG ACTATTTTCT TTCTCGGGGT
TTTGCTAATA TATACGTTTC AGGTGTGGGT ACTGCTGGCT CTACGGGTTT CATGACCAGT
GGGGATTACC AACAAATACA AAGCTTTAAA GCAGTCATTG ATTGGTTAAA TGGTAAGGTT
ACTGCATTCA CAAGTCATAA ACGAGATAAA CAAGTCAAGG CTAATTGGTC AAATGGCCTT
GTAGCAACCA CAGGTAAATC TTATCTTGGT ACCATGTCAA CTGGTTTAGC AACAACTGGC
GTTGAGGGGC TGAAAGTCAT TATCGCTGAA GCCGCAATCT CCACATGGTA TGATTATTAT
CGAGAAAATG GGCTTGTGTG TAGTCCAGGC GGCTACCCCG GTGAAGATTT AGACGTTTTA
ACAGAATTAA CATACTCACG AAACCTCTTA GCTGGTGATT ACATCAAAAA CAACGATTGC
TATCAAGCAT TGTTAAATGA ACAATCAAAA GCAATTGACC GTCAAAGTGG GGATTACAAC
CAATACTGGC ATGACCGTAA TTACCTAACT CACGTCAATA ATGTCAAAAG TCGAGTAGTT
TACACTCATG GACTACAGGA TTGGAATGTT AAGCCAAGAC ATGTCTATAA AGTTTTCAAT
GCATTGCCTC AAACCATCAA AAAACACCTT TTTTTACATC AAGGTCAACA TGTGTATATG
CATAATTGGC AGTCGATTGA TTTTCGTGAA AGCATGAATG CCTTACTAAG CCAAGAACTA
CTTGGCATTG ACAATCATTT CCAATTAGAA GAGGTCATTT GGCAAGATAA TACTACTGAG
CAAACTTGGC AAGTTTTAGA TGCTTTCGGA GGAAACCATC AAGAGCAAAT TGGTTTAGGT
GATAGTAAAA AACTTATTGA TAACCATTAT GACAAAGAAG CCTTTGATAC TTATTGTAAA
GACTTCAATG TGTTCAAAAA TGATCTTTTC AAGGGAAATA ATAAAACCAA TCAAATCACT
ATTAATCTTC CTCTAAAGAA AAATTATCTC CTGAATGGAC AGTGCAAACT CCATCTACGT
GTTAAAACTA GTGACAAAAA GGCCATTTTA TCAGCCCAAA TCTTAGACTA TGGTCCTAAA
AAACGATTCA AAGATACACC AACCATCAAA TTCTTAAACA GCCTTGATAA TGGTAAAAAT
TTTGCCAGAG AAGCTTTACG TGAACTCCCG TTTACTAAAG ATCATTATCG TGTCATCAGT
AAAGGTGTCT TGAACCTTCA AAATCGTACA GACTTACTTA CAATTGAGGC TATCGAGCCA
GAACAATGGT TTGATATCGA GTTTAGCCTC CAACCAAGTA TATATCAATT GAGTAAAGGT
GATAATCTAA GGATTATCCT TTATACAACT GATTTTGAAC ATACCATTCG AGATAATGCT
AGTTACTCTA TAACAGTAGA TTTAAGTCAA TCTTATTTAA CTATCCCAAC TAATCAAGGA
AATTAA
 
Protein sequence
MRYNQFSYIP TKPNEAFEEL KGLGFPLNKK NSDKANLEAF LRHSFLNQTD TDYALSLLIV 
DAKTDALTFF KSNSDLTLEN LQWIYLQLLG FIPFVDFKDP KAFLQDINFP VSYDNIFQSL
HHLLACRGKS GNTLIDQLVA DGLLHADNHY HFFNGKSLAT FNTNQLIREV VYVETSLDTM
SSGEHDLVKV NIIRPTTEHT IPTMMTASPY HQGINDPAAD QKTYQMEGAL AVKQPKHIQV
DTKPFKEEVK HPSKLPISPA TESFTHIDSY SLNDYFLSRG FANIYVSGVG TAGSTGFMTS
GDYQQIQSFK AVIDWLNGKV TAFTSHKRDK QVKANWSNGL VATTGKSYLG TMSTGLATTG
VEGLKVIIAE AAISTWYDYY RENGLVCSPG GYPGEDLDVL TELTYSRNLL AGDYIKNNDC
YQALLNEQSK AIDRQSGDYN QYWHDRNYLT HVNNVKSRVV YTHGLQDWNV KPRHVYKVFN
ALPQTIKKHL FLHQGQHVYM HNWQSIDFRE SMNALLSQEL LGIDNHFQLE EVIWQDNTTE
QTWQVLDAFG GNHQEQIGLG DSKKLIDNHY DKEAFDTYCK DFNVFKNDLF KGNNKTNQIT
INLPLKKNYL LNGQCKLHLR VKTSDKKAIL SAQILDYGPK KRFKDTPTIK FLNSLDNGKN
FAREALRELP FTKDHYRVIS KGVLNLQNRT DLLTIEAIEP EQWFDIEFSL QPSIYQLSKG
DNLRIILYTT DFEHTIRDNA SYSITVDLSQ SYLTIPTNQG N