Gene SAG1003 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSAG1003 
Symbol 
ID1013807 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptococcus agalactiae 2603V/R 
KingdomBacteria 
Replicon accessionNC_004116 
Strand
Start bp1010521 
End bp1013151 
Gene Length2631 bp 
Protein Length876 aa 
Translation table11 
GC content33% 
IMG OID637316187 
Productpermease, putative 
Protein accessionNP_688014 
Protein GI22537163 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG4591] ABC-type transport system, involved in lipoprotein release, permease component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0149153 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGAAAGA CATTTTGGAA AGATATCTAT CGATCAATTA CGACTTCAAA AGGTAGATTT 
TCATCAATCC TCTTACTTAT GATGTTAGGC TCATTTGCTT TTATAGGACT GAAGGTCTCA
GCACCTAATA TGCAAAGAAC AGCACAAAAT TACCTTGCTC ATCATCATGT TATGGATATT
ACCGTTTTCA ATTCTTGGGG ACTTGACAAA CATGATCAAA CTGTTTTAGA AAGCTTAAAA
GGATCACAAG TTGAATTTTC TTACTTTGTA GATACAACCC CTCAACAGAA TAGTAAATCT
TATCGATTGT ATTCCAATAC CAAAACGATA TCAACCTTTG ATCTTGTTAA GGGACGTCTC
CCTTTAAATA AATCAGAAAT TGCTCTCTCA TTTCAGGAAC GAAAAAAATA TGCAATAGGT
GATAAAATTA ATTTTAAACA AGATAAAAAC AAACTGTTTT CAAATACAGG TCCTCTAACT
ATTGTAGGTT TTGTTAATTC TACCGAGATT TGGTCTAAGA CAAACCTTGG TAGTTCTCAA
ACAGGGGACG GCGACTTAGA TAGTTATGGC GTATTAGATA AGACAGCATT TCACTCTCCA
GTATATACAA TGGCAAGGGT AACTTTTAAA GATCTTAGGT TGATTAACCC TTTCTCAATA
AGTTACAAAG AAAAAGTAGC GAAATACCAA GAAAAGGTAT CGCGAAAATT AAATATTCAC
AATAAAATAA GATATACCAA AACAAAAAAA GAGAGCCTAC GTAAAATCGA TGAGGAGGAA
AAAAGCCTAC TAAAAGCTCA AAAACAAATC AATCGGTTAG ATAATGATAG CCTAGCGATG
CCACTATCTC AAAGACAAGC TATTCAAATG AAAATTAAAC AAGACCGACT ATCTCTTTTG
AAACGAACAA AAGAGCTTCT AAAATTGAGA CATAACACCC AAATAATGGA ATCGCCTCAA
ATTATTGTCT ACAACCGTAC TACCTTTCCT GGAGGACAAG GATATAACAC TTTTGACTCC
AGTACAAATA GCACTTCTAA AATCAGTAAT CTTTTCCCCA TTATTTTATA TTTAGTCGCA
GCATTAGTAA CCTTAACAAC AATGACTAGG TTCGTCGAGG AAGAAAGAAC CAACGCAGGC
ATATTAAAAG CACTAGGCTA TAGTGACCGC CAAGTTATCT TTAAATTTAT TATTTATGGT
TTTATAGCAG GAACATTAGG GACTACTTTG GGTATCATTG GAGGGCATTA TCTTTTACCT
CGTATTATTT CTGATATCAT TTCCAAAGAC CTGACTATTC CAAACACACA GTATCATCTT
TTCTTAAATT ACAGTTTACT AGCCTTTGTT TTTTCATTAT TAAGCATTGT TCTTCCAGTT
TTTGTTATCA CTCGACGTGA GTTGAAAGAA AAAGCAGCAT TTTTATTATT ACCAAAACCA
CCAGCTAAAG GATCTAAAAT AGCTTTAGAG TATATCAATT GGATTTGGAA AAAACTGTCC
TTCACACAAA AAGTAACTGC TAGAAATATT TTTCGTTACA AACAAAGGAT GATTATGACC
ATATTTGGAG TTGCTGGATC AGTAGCGCTC TTATTTTCAG GTCTAGGAAT ACAATCCTCC
TTAAAGCAAA CCGTCAATGA ACATTTCGGT CGTATTATGC CCTATGATAT ATTGCTAACA
TATAACACAA ATGCTTCTCC CCCAAAAATA CTTGAACTAC TTTCAAAAGA TTCAAAAATA
GACAAATACC AACCAATACA CCTTGAAAAT CTTGACGAAT CCATTCCTGG ACAAATTAAT
AAACAGTCAA TTTCCCTATT CATAACAGAC AAAAAACAAT TACTACCTTT TATTTATTTA
CAAGAAGCAA CAACTAATAA GTCATTGCAC TTGAATAATA AAGGGATTAT TATTTCTAAA
AAATTAGCTC AATTTTATCA TGTTAATACC GGTGATTTTA TCCATTTATC TCATTCACAA
ACACTTCCTT CTAGAAAATT AAAAATAACA GGAGTTGTCA ATGCGAATGT TGGTCACTAT
ATTTTTATGA CAAAACAATA CTATCGAACT ATTTTTAAGA AAGAAGCTAA AGATAACGCT
TTTTTGGTTA AGTTAACCAA ACATAAAATC GCAAATAACT TAGCAGAAAA ACTTTTAGAA
ATTAATGGAG TTGAGTCTCT TACGCAAAAT GCTCTTCAGC TGGCTAGTGT AGAAGCCGTT
GTGCGTTCTT TAGATGGCTC TATGACTATT TTAGTTGTCG TATCTTTATT GTTAGCCATT
GTAATTCTCT ATAATCTCAC TAATATTAAT TTAGCAGAAC GTAAGCGGGA GCTATCAACT
ATAAAGGTAT TAGGTTTTTA TAATGAAGAA GTAACTTTAT ACATTTACCG CGAAACCATT
ATTTTATCAA CCATCGGTGT GATTCTAGGC ACCATTAGCG GTACTTATTT ACACCGTCAA
ATGATGTTGC TAATTGGTTC AGATCAAATA CTTTTTGGTG AAAAAGTATC ACCAACTACA
TTTATAATAC CAATAAGCGT AGTAGTCATC ATTCTAATAA GTCTAGGTTT TATAGTTAAC
CATCAATTAA AAAAACTCAA TATGTTAGAT GCCTTGAAAT CAGTAGATTA G
 
Protein sequence
MGKTFWKDIY RSITTSKGRF SSILLLMMLG SFAFIGLKVS APNMQRTAQN YLAHHHVMDI 
TVFNSWGLDK HDQTVLESLK GSQVEFSYFV DTTPQQNSKS YRLYSNTKTI STFDLVKGRL
PLNKSEIALS FQERKKYAIG DKINFKQDKN KLFSNTGPLT IVGFVNSTEI WSKTNLGSSQ
TGDGDLDSYG VLDKTAFHSP VYTMARVTFK DLRLINPFSI SYKEKVAKYQ EKVSRKLNIH
NKIRYTKTKK ESLRKIDEEE KSLLKAQKQI NRLDNDSLAM PLSQRQAIQM KIKQDRLSLL
KRTKELLKLR HNTQIMESPQ IIVYNRTTFP GGQGYNTFDS STNSTSKISN LFPIILYLVA
ALVTLTTMTR FVEEERTNAG ILKALGYSDR QVIFKFIIYG FIAGTLGTTL GIIGGHYLLP
RIISDIISKD LTIPNTQYHL FLNYSLLAFV FSLLSIVLPV FVITRRELKE KAAFLLLPKP
PAKGSKIALE YINWIWKKLS FTQKVTARNI FRYKQRMIMT IFGVAGSVAL LFSGLGIQSS
LKQTVNEHFG RIMPYDILLT YNTNASPPKI LELLSKDSKI DKYQPIHLEN LDESIPGQIN
KQSISLFITD KKQLLPFIYL QEATTNKSLH LNNKGIIISK KLAQFYHVNT GDFIHLSHSQ
TLPSRKLKIT GVVNANVGHY IFMTKQYYRT IFKKEAKDNA FLVKLTKHKI ANNLAEKLLE
INGVESLTQN ALQLASVEAV VRSLDGSMTI LVVVSLLLAI VILYNLTNIN LAERKRELST
IKVLGFYNEE VTLYIYRETI ILSTIGVILG TISGTYLHRQ MMLLIGSDQI LFGEKVSPTT
FIIPISVVVI ILISLGFIVN HQLKKLNMLD ALKSVD