Gene SAG0535 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSAG0535 
Symbol 
ID1013338 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptococcus agalactiae 2603V/R 
KingdomBacteria 
Replicon accessionNC_004116 
Strand
Start bp550545 
End bp552065 
Gene Length1521 bp 
Protein Length506 aa 
Translation table11 
GC content35% 
IMG OID637315736 
Productzinc ABC transporter, zinc-binding adhesion liprotein 
Protein accessionNP_687564 
Protein GI22536713 
COG category[P] Inorganic ion transport and metabolism
[R] General function prediction only 
COG ID[COG0803] ABC-type metal ion transport system, periplasmic component/surface adhesin
[COG3443] Predicted periplasmic or secreted protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0968782 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGAAAGA AATTTCTTTT ATTGATGAGC TTTGTAGCTA TGTTTGCAGC TTGGCAACTT 
GTTCAAGTTA AACAAGTTTG GGCTGATAGT AAACTTAAAG TGGTAACAAC TTTTTACCCA
GTTTATGAGT TTACAAAAAA TGTCGTTGGT GATAAAGCTG ATGTATCTAT GTTAATTAAA
GCAGGTACAG AACCGCATGA TTTTGAACCA TCAACTAAAA ACATCGCTGC CATCCAAGAT
TCAAATGCTT TTGTTTACAT GGATGATAAC ATGGAAACTT GGGCTCCAAA AGTAGCTAAG
TCAGTTAAAT CCAAAAAAGT AACAACTATT AAAGGTACTG GCGATATGTT ACTTACTAAA
GGCGTCGAAG AAGAAGGTGA AGAACATGAA GGACATGGTC ATGAAGGGCA TCATCATGAA
CTTGACCCAC ACGTATGGTT GTCTCCAGAA CGTGCGATTT CTGTTGTAGA AAACATCCGT
AATAAATTTG TCAAAGCTTA TCCAAAAGAT GCAGCTTCAT TTAACAAAAA TGCAGATGCT
TACATTGCAA AATTAAAAGA GCTTGACAAA GAATACAAAA ATGGTTTGTC AAATGCTAAA
CAAAAGAGTT TTGTGACTCA ACACGCAGCG TTTGGTTACA TGGCGCTTGA TTACGGTTTA
AATCAAGTTC CAATTGCTGG TCTTACTCCA GATGCAGAAC CTTCATCAAA ACGTTTAGGC
GAATTAGCTA AATACATCAA GAAATATAAC ATCAACTACA TTTATTTTGA AGAAAATGCT
TCAAATAAAG TTGCTAAAAC TTTAGCAGAT GAAGTTGGCG TGAAAACAGC TGTGCTTAGT
CCACTTGAAG GACTTTCTAA AAAAGAAATG GCAGCTGGCG AAGATTACTT CTCAGTTATG
AGACGTAATT TGAAAGTTCT TAAAAAGACA ACAGATGTTG CAGGTAAAGA AGTAGCTCCT
GAAGAAGATA AAACTAAAAC AGTTGAAACA GGTTACTTTA AAACTAAAGA TGTTAAAGAC
CGTAAATTGA CAGATTACTC TGGTAATTGG CAATCAGTAT ATCCTCTTCT TCAAGATGGG
ACACTTGATC CAGTTTGGGA TTACAAAGCT AAATCTAAAA AAGATATGAC TGCTGCAGAG
TACAAAAAAT ATTATACAGC AGGTTACAAG ACTGACGTAG AATCAATCAA GATTGATGGT
AAAAAACATC AAATGACCTT TGTACGTAAT GGTAAATCAC AAACATTTAC ATACAAATAT
GCAGGTTACA AAATCTTAAC TTATAAAAAA GGTAATCGTG GAGTACGTTA TCTCTTTGAA
GCTAAAGAAA AAGATGCTGG TCAATTCAAA TATATCCAAT TTAGTGACCA TGGTATTAAA
CCGAATAAAG CTGAACACTT CCATATCTTC TGGGGTTCAG AAAGCCAAGA AAAATTATTT
GAGGAAATGG AAAACTGGCC AACATACTTC CCAGCTAAAA TGTCTGGACG TGAAGTTGCC
CAAGACCTTA TGTCTCATTA A
 
Protein sequence
MRKKFLLLMS FVAMFAAWQL VQVKQVWADS KLKVVTTFYP VYEFTKNVVG DKADVSMLIK 
AGTEPHDFEP STKNIAAIQD SNAFVYMDDN METWAPKVAK SVKSKKVTTI KGTGDMLLTK
GVEEEGEEHE GHGHEGHHHE LDPHVWLSPE RAISVVENIR NKFVKAYPKD AASFNKNADA
YIAKLKELDK EYKNGLSNAK QKSFVTQHAA FGYMALDYGL NQVPIAGLTP DAEPSSKRLG
ELAKYIKKYN INYIYFEENA SNKVAKTLAD EVGVKTAVLS PLEGLSKKEM AAGEDYFSVM
RRNLKVLKKT TDVAGKEVAP EEDKTKTVET GYFKTKDVKD RKLTDYSGNW QSVYPLLQDG
TLDPVWDYKA KSKKDMTAAE YKKYYTAGYK TDVESIKIDG KKHQMTFVRN GKSQTFTYKY
AGYKILTYKK GNRGVRYLFE AKEKDAGQFK YIQFSDHGIK PNKAEHFHIF WGSESQEKLF
EEMENWPTYF PAKMSGREVA QDLMSH