Gene SAG2000 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSAG2000 
Symbol 
ID1014811 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptococcus agalactiae 2603V/R 
KingdomBacteria 
Replicon accessionNC_004116 
Strand
Start bp1978897 
End bp1980897 
Gene Length2001 bp 
Protein Length666 aa 
Translation table11 
GC content33% 
IMG OID637317167 
Producthypothetical protein 
Protein accessionNP_688987 
Protein GI22538136 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACAAAA TAGATGATGG TTTGATTGCT TCGACCTTTG CGAAGGTAAA AGATGTGGAT 
ATATTTGCGT TAAAGGCTTA TATGGAGATA ACTCACGGAG CTGAGACTGG AGCTCAAAGC
ATTTTGTTAG ATGTTTTTGT TAATTTTCCC TTTTTCCTAC TTAACTTGAT TGTTGGTTTA
TTTTCTGTTA TTCTACGCTT TTTTGAAAAT TTCAGTTTGT ATGATACCTA TAAACAAACA
GTTTATCATT CGTCTCAAAA ATTATGGGAA AATCTATCAG GTAATGGGTC TTATACAAGC
TCCTTGCTTT ATTTATTGGT TGCGATTTCA GCTTTTTCAA TATTTATTTC ATATCTTTTT
TCAAAAGGAG ATTTTTCTAA ACGTTTGATC CACTTGTTCG TGGTTATCAT TTTAGGTATG
GGTTACTTTG GTACGATTCA ATCAACATCA GGTGGTATTT ATATTTTAGA TACCGTTCAT
CAACTAGCTG GTTCGTTTTC AGATGCGGTA ACTAATTTAT CACTGGATAA TCCATCAGGT
GGTAAAACAA AGATCACACA AAAATCATCA GTAGCTGATA ATTATGTGAT GAAAACTTCT
TATACGGCTT ATTTGTTTGT CAATACTGGT CAGTTAAATG GTAAGTTTCA CAATAATCAA
ACAGGTAAAG AAGAAAAGTT TGATAACGAA CAAGTTTTAG GTAAGTACGA TAAATCAGGT
AAATTTATCA CTCCAAAACA AAAGGATATA TTGAATTATA CTGATAATTT GGGAGATAAG
GCAACCGAAG GGGAGGAAAA AAATAGATGG CTTTCTGCAG TAAATGATTA TCTTTGGATA
AAATCAGGCT ATGTTATCTT AAAGATTTTT GAAGCGGTTA TACTCGCAGT TCCATTAATA
CTCATTCAGT TAATTGCTTT TATGGCAGAT GTTTTAGTGA TTATATTAAT GTTTATTTTT
CCATTAGCTT TATTAGTTTC ATTTTTGCCT AGAATGCAAG ATATCATTTT CAATGTTTTG
AAAGTCATGT TTGGGGCTGT TTCCTTTCCT GCCTTAGCTG GTTTTTTAAC CTTGATAGTC
TTCTATACTC AAACTTTAAT AGCCACCTTT GTTAAGAAAA AATTTACAGA TGGTTCCCTG
CTTAGTGGTA GTAATTTTAA AGGTCAGGCA ATACTGTTTA TGTTGCTAAT AACAGTTTTC
GTCCAAGGTT GTGTGTTTTG GGGAATTTGG AAATATAAAG AAACTTTCTT AAGACTGATT
ATAGGGTCTA GAGCTTCTCA AGTCATTAAT CAGTCTGTAG ATAAAATTAA TGAAAAAGCA
GAAAATCTAG GAATTACGCC AAAATCCATT TATGAGAGGG CACACGATAT GTCAAGCCTT
GCCATGATGG GAGCTGGCTA TGGTGTTGGT ACAATGATGA ATGCACAAGA TAATTGGAAT
GCTTTCAAAG AAAGACAACA AGCTAATTTA GATGATGGTC AGTCTAAAAC TAATGATGCT
GATAAGTATG ATGAAGCTAA TGCGGATGAT ACTGTTATTT CTAAAGAGGC TGAGCTGACT
AATGAAGGTG AATATCAATC AGAGTTACCT AAAGAAGCAA GTAAAAGGAT TGAACAATTG
GGAAAGGAAT CTTCTTATGA GCTATCGTTT ATATCAGAAG GTAATTCAAC AGAGGAAATT
TTAAAAAATG TTAAGTCGGA TAATCACACG TTTCAAGAAG GAGATGGAGA TACTTCATTA
ACAAATCAGG ATATGATAAC AAATGATATA GAAAATCATT CAAATAACTA TACTTCACCA
TTAAAACAGC GAAAGCTTAA TAAGCTAGAG GGTGAATTAA GTCAATTTAA TAGTGATGTA
TCCATGACAA AAAATCATGG TAAAAATGCT TTTGAAAAGG GATTTAATGC CAGTAAAACT
AAGGAAGTTA GAAAGCAACA TAATCTTGAA CGACAGTCAA AAGTTTTGGA AGAACTAGAA
AAGCTGAGAG AAGGTAAGTA G
 
Protein sequence
MDKIDDGLIA STFAKVKDVD IFALKAYMEI THGAETGAQS ILLDVFVNFP FFLLNLIVGL 
FSVILRFFEN FSLYDTYKQT VYHSSQKLWE NLSGNGSYTS SLLYLLVAIS AFSIFISYLF
SKGDFSKRLI HLFVVIILGM GYFGTIQSTS GGIYILDTVH QLAGSFSDAV TNLSLDNPSG
GKTKITQKSS VADNYVMKTS YTAYLFVNTG QLNGKFHNNQ TGKEEKFDNE QVLGKYDKSG
KFITPKQKDI LNYTDNLGDK ATEGEEKNRW LSAVNDYLWI KSGYVILKIF EAVILAVPLI
LIQLIAFMAD VLVIILMFIF PLALLVSFLP RMQDIIFNVL KVMFGAVSFP ALAGFLTLIV
FYTQTLIATF VKKKFTDGSL LSGSNFKGQA ILFMLLITVF VQGCVFWGIW KYKETFLRLI
IGSRASQVIN QSVDKINEKA ENLGITPKSI YERAHDMSSL AMMGAGYGVG TMMNAQDNWN
AFKERQQANL DDGQSKTNDA DKYDEANADD TVISKEAELT NEGEYQSELP KEASKRIEQL
GKESSYELSF ISEGNSTEEI LKNVKSDNHT FQEGDGDTSL TNQDMITNDI ENHSNNYTSP
LKQRKLNKLE GELSQFNSDV SMTKNHGKNA FEKGFNASKT KEVRKQHNLE RQSKVLEELE
KLREGK