Gene SAG1920 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSAG1920 
Symbol 
ID1014730 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptococcus agalactiae 2603V/R 
KingdomBacteria 
Replicon accessionNC_004116 
Strand
Start bp1903230 
End bp1904567 
Gene Length1338 bp 
Protein Length445 aa 
Translation table11 
GC content36% 
IMG OID637317088 
ProductCCS family citrate carrier protein 
Protein accessionNP_688909 
Protein GI22538058 
COG category[C] Energy production and conversion 
COG ID[COG3493] Na+/citrate symporter 
TIGRFAM ID[TIGR00783] citrate carrier protein, CCS family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.296247 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAGACG TGAAGGTAGT GAATAATGAG GATTCTAGAG GTCAAAAGCA AGACTTAAAG 
GCTAAACTAT TTCATATTAA GATAGGGTCA GTTCCCCTAC CAGTATATGT TTGTTTAGCA
TTATTGATTC TTCTAGCAGG CTTTTTACAA AAATTGCCAG TCAATATGCT AGGAGGATTT
GCAGTTATCT TAACAATGGG GTGGTTCTTA GGGACTATCG GAGCTAGCAT TCCTGGTTTT
AAAAACTTCG GTGGCCCAGC TATTTTATCT TTATTAGTAC CATCTATTTT GGTGTTTTTC
AACCTCATTA ATAAAAATGT TTTAGAATCA ACAAATATGT TGATGAAGCA AGCTAACTTT
CTTTATTTTT ATATTGCTTG TTTAGTGTCC GGTAGTATTT TAGGGATGAA TCGGAAAATG
TTGATTCAGG GATTGCTAAG AATGATTTTC CCCATGTTAT TAGGAATGGT TTGTGCGATG
ATGGTAGGGA CATTTGTCGG TGTTATTTTA GGCTTAGAGT GGCGACACAC TTTGTTTTAT
ATCGTAACAC CCGTTTTAGC TGGTGGTATT GGTGAAGGTA TTTTACCATT ATCGTTAGGC
TATAGTTCAA TTACCGGTGT AGCTAGTGAA CAACTAGTTG CTCAACTCAT CCCAGCCACT
ATTATTGGTA ATTTCTTTGC CATTTTATGT ACTGCACTAT TGAATCGTTT GGGAGAAAAG
AAACCACACT TGTCTGGTCA AGGGCAATTA GTAAGGTTAA ATAAAGGAGA GGACATGTCA
GATATTATTG CTGATCATTC TGGCCCAATT GACGTTAAGA AAATGGGTGG AGGTGTTTTA
ACAGCATGTA GTCTCTTTAT TTTTGGACAT TTGTTGCAGC AATTAACTGG ATTTCCTGGT
CCCGTATTAA TGATTGTTGC AGCAGCTATT TTGAAATATA TTAATGTTAT TCCTAGAGAA
ACACAAAATG GAGCTAAGCA ACTTTATAAA TTTATTTCTG GTAATTTTAC ATTTCCTCTA
ATGGCAGGTC TAGGATTGCT TTATATCCCG TTAAAAGATG TTGTGGCAAC GCTTAGCATA
CAATATTTCA TAGTTGTTAT TAGTGTTGTA TTTACAGTTA TTTCTGTTGG ATTCTTTGTA
TCGCGATTCC TTAATATGAA TCCTGTTGAA GCAGGTATTA TTTCAGCTTG TCAAAGTGGT
ATGGGAGGAA CAGGAGATGT TGCCATTTTA AGTACAGCAG ACCGAATGAA CTTGATGCCA
TTTGCTCAAG TTGCTACGCG TTTAGGAGGA GCTATTACTG TTATCACAAT GACAGCCATT
TTACGCATGT TATTCTAA
 
Protein sequence
MADVKVVNNE DSRGQKQDLK AKLFHIKIGS VPLPVYVCLA LLILLAGFLQ KLPVNMLGGF 
AVILTMGWFL GTIGASIPGF KNFGGPAILS LLVPSILVFF NLINKNVLES TNMLMKQANF
LYFYIACLVS GSILGMNRKM LIQGLLRMIF PMLLGMVCAM MVGTFVGVIL GLEWRHTLFY
IVTPVLAGGI GEGILPLSLG YSSITGVASE QLVAQLIPAT IIGNFFAILC TALLNRLGEK
KPHLSGQGQL VRLNKGEDMS DIIADHSGPI DVKKMGGGVL TACSLFIFGH LLQQLTGFPG
PVLMIVAAAI LKYINVIPRE TQNGAKQLYK FISGNFTFPL MAGLGLLYIP LKDVVATLSI
QYFIVVISVV FTVISVGFFV SRFLNMNPVE AGIISACQSG MGGTGDVAIL STADRMNLMP
FAQVATRLGG AITVITMTAI LRMLF