Gene SAG1444 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSAG1444 
Symbol 
ID1014253 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptococcus agalactiae 2603V/R 
KingdomBacteria 
Replicon accessionNC_004116 
Strand
Start bp1457503 
End bp1458975 
Gene Length1473 bp 
Protein Length490 aa 
Translation table11 
GC content37% 
IMG OID637316619 
Productproton/peptide symporter family protein 
Protein accessionNP_688441 
Protein GI22537590 
COG category[E] Amino acid transport and metabolism 
COG ID[COG3104] Dipeptide/tripeptide permease 
TIGRFAM ID[TIGR00924] amino acid/peptide transporter (Peptide:H+ symporter), bacterial 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000293034 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAAAAA CAAAATCATT CTTTGGACAT CCTCGTGGTT TGTCCACTCT TTTCTTTACT 
GAAATGTGGG AAAGATTCTC ATACTATGGT ATGCGTGCTA TTTTGCTGTA CTATATGTAC
TATAGTGTTT CTCAAGGTGG CCTTGGTATG GACAAGACTG TCGCTGCATC AATCATGGCT
GTCTATGGTT CACTGGTTTA TCTCTCATCA GTAATTGGTG GTTTTGTCAG CGACCGTATT
CTAGGTAGTC GTAAAACTGT TCTGTATGGT GGTATTCTAA TCATGCTAGG TCATATTGCT
CTAGCTACAC CATTTGGTCA AACAGCTCTA TTCATTTCGA TTGCTCTAAT TATCCTTGGT
ACTGGACTAT TAAAACCAAA TGTATCAGAG ATGGTTGGTA ATTTATATGG AGAAAATGAT
TCTCGTCGTG ATGCTGGTTT TAGTATCTTT GTTTTTGGTA TTAACCTTGG TGCTTTTATT
TCACCCATTG TAGTGGGGTA CCTCGGACAA GAAGTAAATT TCCATCTTGG TTTCTCACTT
GCTGCTATTG GTATGTTCTT TGGTCTCCTC CAATATACCT TAGATGGAAA AAAATATTTG
ACTGAAGAGA GTCTCAGACC AAACGATCCT TTAAGTCCTG AAGAAAAGTC CTCTCTATAT
AAAAAAGTTG GGCTTATCCT TATTGGTATT GTTATTGTAC TTATTCTACT TCACTTGATG
CATATGCTAA CAATTGAAGT AATTATCGAT ATTTTTAGTA TTATTGCAAT CGCCATCCCA
ATTATTTATT TTATCAAGAT TTTAAGTAGT AAAAAGATTT CTTCTGTTGA GCGTTCTCGA
GTGTGGGCAT ATATCCCTCT CTTTATCGCC TCAATTCTAT TTTGGTCAAT TGAAGAACAA
GGTTCAGTTG TCTTAGCCTT ATTTGCAGAT GAACAAACAA AACTTTACCT TAACTTCTTT
GGGCATCATA TTAATTTCCC ATCAAGTTAT TTCCAAAGTA TGAACCCTCT CTTCATTATG
CTTTATGTAC CATTCTTTGC TTGGTTATGG GCTAAATGGG GAAGTAAGCA ACCTTCATCA
CCTAAAAAAT TTGCGTATGG ACTTTTCTTT GCTGGAGCTT CATTCTTATG GATGATGCTA
CCAGGTTTAC TCTTTGGAGT TAACGCTAAA GTAAGCCCTC TTTGGTTAAC AATGAGTTGG
GCTATTGTCA TCGTTGGGGA AATGCTAATC TCACCAGTTG GATTATCAGC AACTAGTAAG
CTCGCACCTA AAGCATTCCA AGCTCAAATG ATGAGCATCT GGTTCTTAAG TAATGCTGCA
GCACAAGCTA TTAACGCTCA AATCGTTAAA TTGTACACAC CTGATACTCA AACTCTTTAT
TATGGTGTTG TTGGTGGTAT AACAGTTGTA TTTGGATTTA TCCTCTTATT TTATGTTCCA
CGCATTGAAA AACTAATGTC TGGAGTTAAA TAA
 
Protein sequence
MEKTKSFFGH PRGLSTLFFT EMWERFSYYG MRAILLYYMY YSVSQGGLGM DKTVAASIMA 
VYGSLVYLSS VIGGFVSDRI LGSRKTVLYG GILIMLGHIA LATPFGQTAL FISIALIILG
TGLLKPNVSE MVGNLYGEND SRRDAGFSIF VFGINLGAFI SPIVVGYLGQ EVNFHLGFSL
AAIGMFFGLL QYTLDGKKYL TEESLRPNDP LSPEEKSSLY KKVGLILIGI VIVLILLHLM
HMLTIEVIID IFSIIAIAIP IIYFIKILSS KKISSVERSR VWAYIPLFIA SILFWSIEEQ
GSVVLALFAD EQTKLYLNFF GHHINFPSSY FQSMNPLFIM LYVPFFAWLW AKWGSKQPSS
PKKFAYGLFF AGASFLWMML PGLLFGVNAK VSPLWLTMSW AIVIVGEMLI SPVGLSATSK
LAPKAFQAQM MSIWFLSNAA AQAINAQIVK LYTPDTQTLY YGVVGGITVV FGFILLFYVP
RIEKLMSGVK