Gene SAG0534 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSAG0534 
Symbol 
ID1013337 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptococcus agalactiae 2603V/R 
KingdomBacteria 
Replicon accessionNC_004116 
Strand
Start bp549006 
End bp550403 
Gene Length1398 bp 
Protein Length465 aa 
Translation table11 
GC content37% 
IMG OID637315735 
Productdipeptidase 
Protein accessionNP_687563 
Protein GI22536712 
COG category[E] Amino acid transport and metabolism 
COG ID[COG4690] Dipeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTTGTA CAACAATATT GGTTGGTAAA AAGGCTTCTT ATGATGGTTC GACTATGATC 
GCTAGAACGG AAGACTCTGT TAATGGCGAT TTCACACCCA AAAAATTAAA GGTAATGACA
TCTAAAGATC AACCGCGTCA TTACAAATCA GTTTTATCAA ATTTTGAAGT AGATTTACCA
GATAACCCAC TTCCTTATAC TTCAGTACCG GACGCATTGG GAAAAGATGG TATATGGGGT
GAAGCCGGTA TTAACAGTAA AAATGTAGCG ATGAGTGCTA CAGAAACTAT TACAACGAAT
TCCCGCGTTT TGGGTGCAGA TCCTTTGGTT TCAGATGGTA TAGGGGAAGA GGATATACTC
ACTTTAGTGC TTCCCTATAT TCAGTCAGCG CGAGAAGGTG TGGAGCGTTT AGGTGCTATT
TTGGAAAAAT ATGGAACCTA TGAATCAAAT GGTATTGCTT TTTCAGATAC CGAAGAAATA
TGGTGGTTAG AAACAATTGG TGGGCATCAT TGGATTGCTC GTCGCGTACC TGATGATGTT
TATGTTACTA ATCCTAACCA ACTAGGAATT GATCATTTTG AATTTAATAA CTGTGATGAC
TACATGTGCT CTAGTGATTT GAAAGAGTTT ATCGAACAAT ACCATTTAGA TTTGACCTAT
TCTAATGAGC ATTTCAATCC TCGATATGCT TTTGGTAGCC AACGTGATAA AGATCGTCAT
TACAACACAC CAAGAAGTTG GGCAATGCAG CGTTTTTTAA ATCCTGAAAT TGAACAGGAT
CCACGTAGCT TGTTTATTCC CTGGTGTCAA AAGCCTTACC GAAAAATTAC TGTTGAGGAT
ATTAAATATG TGTTGAGTGA TCATTATCAA GACAGTGTGT ATGACCCATA TGGACCAGAA
GGGGATGCGG TAAGTAGGAG AGCTTTTCGT TCAGTTGGTA TCAACCGAAC TAGTCAAACG
TCTATTCTAC AATTACGACC AAATAAATCA CTTGAAACGA CAGGTGTTCA ATGGTTATCT
TATGGCTCTA TGCCATTTGC AACCATGGTG CCGTTGTTTA CACAAGTTGA GACTGTACCA
AACTATTTTT CGAATACAAC CAAGGATGCT TCAACAGATA ATTTTTATTG GACCAATCGT
TTAATTGCAG CTCTAGCAGA TCCACACTTT TATCAACATG AAGCTGATAT TGAAAGCTAT
ATCGAGAGAA CGATGGCTCA AGGACATGCA CATATTAACG GTGTTGATAG AGAAGTTGCT
GAGAATAAAG AGATTGATTT TCAACAGAAA AATCAAGAAA TGAGTGACTA TATCCAAAAA
GAAAGCCAAG AATTGTTAAA TCGTATTCTA TTTGATGCAA GTAATTTAAT GACAAATCGC
TTTTCAATGG GAGATTAA
 
Protein sequence
MACTTILVGK KASYDGSTMI ARTEDSVNGD FTPKKLKVMT SKDQPRHYKS VLSNFEVDLP 
DNPLPYTSVP DALGKDGIWG EAGINSKNVA MSATETITTN SRVLGADPLV SDGIGEEDIL
TLVLPYIQSA REGVERLGAI LEKYGTYESN GIAFSDTEEI WWLETIGGHH WIARRVPDDV
YVTNPNQLGI DHFEFNNCDD YMCSSDLKEF IEQYHLDLTY SNEHFNPRYA FGSQRDKDRH
YNTPRSWAMQ RFLNPEIEQD PRSLFIPWCQ KPYRKITVED IKYVLSDHYQ DSVYDPYGPE
GDAVSRRAFR SVGINRTSQT SILQLRPNKS LETTGVQWLS YGSMPFATMV PLFTQVETVP
NYFSNTTKDA STDNFYWTNR LIAALADPHF YQHEADIESY IERTMAQGHA HINGVDREVA
ENKEIDFQQK NQEMSDYIQK ESQELLNRIL FDASNLMTNR FSMGD