Gene SAG1122 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSAG1122 
Symbol 
ID1013926 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptococcus agalactiae 2603V/R 
KingdomBacteria 
Replicon accessionNC_004116 
Strand
Start bp1129236 
End bp1130783 
Gene Length1548 bp 
Protein Length515 aa 
Translation table11 
GC content37% 
IMG OID637316304 
Producttransporter, BCCT family protein 
Protein accessionNP_688131 
Protein GI22537280 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1292] Choline-glycine betaine transporter 
TIGRFAM ID[TIGR00842] choline/carnitine/betaine transport 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGAGTAAAA AACATATTAC GCCTGTATTT ACAGGTTCAC TAATTGTATC GCTTATTTTA 
GTCTTATTAG GAATTATTGT TCCTCGTGGT TTTCAATCTT GGACACAAAT CTTGCGTGAA
CAGGTATCTA CCAATTTCGG TTGGTTGTAC TTGTTGTTAG TTACTTCAAT TCTTGCTTTG
TGTGTCTTTT TTATTATGAG TCCTCTTGGA CAAATACGTT TAGGGCAACC TCATTCACGT
CCTGAATATT CAACTGTATC ATGGATAGCA ATGATGTTTT CAGCAGGTAT GGGGATTGGT
TTGGTCTTCT ACGGAGCAGC TGAACCCTTA TCGCATTTTG CTATTTCGAC ACCTGGTGCA
CCTAAGGAAT CGCAAACAGC ATTAGCTGAT GCATTTCGTT TTACATTTTT TCACTGGGGG
ATACATGCTT GGGCAGTATA TGCATTGGTT GCTTTAGCTC TAGCTTATTT TGGATTTCGA
AAGCAAGAGA AATACCTCTT GTCTGTCACT TTAAAGCCTC TTTTCGGTGA TAAGACAGAT
GGTTGGCTAG GAAAAATTGT TGATATCACC ACAGTTGTTG CTACAGTTAT TGGAGTTGCT
ACGACACTTG GATTTGGAGC TGCTCAAATC AACGGAGGGT TAAGTTTTTT ATTGGGTGTT
CCCAATAATG CATTTGTTCA AATTGTTATT ATCCTGATTA CAACAGCTTT ATTTGTTATG
TCAGCTTTAT CAGGTTTAGG AAAAGGTGTT AAAATTTTAT CGAACTTAAA TTTGATTTTA
GCGGTAGCCC TCTTAGCTTT AGTTATTGTA TTGGGACCAA CGGTTCGTAT TTTTGATACC
CTAACAGAGT CTTTAGGCTC TTATTTACAA AATTTCTTTG GAATGAGCTT TCGTGCAGCT
GCTTTTGACA ATACTAAACG TTCTTGGATT GATAATTGGA CGATTTTTTA TTGGGCGTGG
TGGATTTCCT GGTCTCCTTT TGTTGGAGTT TTCATCGCTC GTATTTCTAA AGGGCGTAGC
ATTCGGGAGT TTTTAACGGT AGTTCTTTTA ATACCGACAT TATTGAGTTT TGTATGGTTT
GCAGCATTTG GCACATTATC AACTCAGGTA CAACAACTGG GTACTAATTT GACAAAGTTT
GCAACAGAGG AAGTATTGTT TGCTACTTTT AATCACTACA CTTTAGGTTG GCTTTTATCC
ATTATTGCTA TCATTTTAAT TTTTTCATTT TTTATTACAT CAGCAGATTC TGCAACGTAC
GTTTTGGCTA TGTTGACAGA AGATGGTAAT TTAAACCCAA AAAATCGAAC TAAAGTAATT
TGGGGGCTGG TGTTGGCAGT GATTGCTATT GTCTTACTCT TGTCTGGTGG TCTGTTAGCG
CTGCAGAACG TTTTAATTAT TGTCGCTCTG CCATTTTCAT TCGTAATGAT TTTGATGATG
CTAGCGTTAT TAGTGGAGCT TTTCCATGAG AAAAAAGAAA TGGGCTTATC GATTTCTCCA
GATCGTTATC CACGTAAAAA TGAACCATTT AAATCTTATG AAGAATAA
 
Protein sequence
MSKKHITPVF TGSLIVSLIL VLLGIIVPRG FQSWTQILRE QVSTNFGWLY LLLVTSILAL 
CVFFIMSPLG QIRLGQPHSR PEYSTVSWIA MMFSAGMGIG LVFYGAAEPL SHFAISTPGA
PKESQTALAD AFRFTFFHWG IHAWAVYALV ALALAYFGFR KQEKYLLSVT LKPLFGDKTD
GWLGKIVDIT TVVATVIGVA TTLGFGAAQI NGGLSFLLGV PNNAFVQIVI ILITTALFVM
SALSGLGKGV KILSNLNLIL AVALLALVIV LGPTVRIFDT LTESLGSYLQ NFFGMSFRAA
AFDNTKRSWI DNWTIFYWAW WISWSPFVGV FIARISKGRS IREFLTVVLL IPTLLSFVWF
AAFGTLSTQV QQLGTNLTKF ATEEVLFATF NHYTLGWLLS IIAIILIFSF FITSADSATY
VLAMLTEDGN LNPKNRTKVI WGLVLAVIAI VLLLSGGLLA LQNVLIIVAL PFSFVMILMM
LALLVELFHE KKEMGLSISP DRYPRKNEPF KSYEE