Gene SAG1801 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSAG1801 
Symbol 
ID1014610 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptococcus agalactiae 2603V/R 
KingdomBacteria 
Replicon accessionNC_004116 
Strand
Start bp1796304 
End bp1797983 
Gene Length1680 bp 
Protein Length559 aa 
Translation table11 
GC content35% 
IMG OID637316969 
ProductBglG family transcriptional antiterminator 
Protein accessionNP_688791 
Protein GI22537940 
COG category[K] Transcription 
COG ID[COG3711] Transcriptional antiterminator 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGATTATTC TCGATAAAAA AAGTTATGAC CTCCTCTTTT ACCTATTGAA ATTAGAGGAA 
CCTGAAACAG TTATGGCAAT TGCCAACGCA CTTAATCAGT CTAGACGTAA AGTGTATTAT
CACTTAGAGA AGATAAACGA TGCACTGCCT AGCGATGTGC CTCAGATTGT TAGTTATCCA
CGTGTAGGAA TCTTGCTAAC AGAAAAACAA AAAGCAGCCT GTCGTCTTTT ATTAGATGAA
GTGACTGATT ATAGTTACGT CATGAAAAGT AGTGAGAGGT TGCAGCTGTC TTTAGTATCT
ATCGTAGTAG CTAAGGACCG CGTAACGATT GATAGGTTGA TGCAACTAAA CGATGTTTCT
CGAAATACTA TCTTAAACGA TTTAAACGAA TTAAGAAGTG AGTTAGCAGA GAAAGAATAT
AATTTACAGT TACAATCAAC AAAATGTCGT GGTTATTTTT TAGATGGTCA CCCATTGTCC
ATTATCCAGT ACTTATATAA GCTCTTAGAT GATATCTACC ATAATGGAAG TAGTAGTTTT
ATAGACCTTT TTAATCATAA ACTGTCTCAA GCTTTTGGTG CCAGCACTTA TTTTTCTAAA
GAGGTTCTTG ATTATTTTCA TCATTATCTC TTCATTTCTC AACGAAGTCT AGGTAAGAAA
ATCAACAGTC AAGATGGTCA GTTTATGATT CAGATTTTGC CTTTTATACT AATGGCTTAT
CGTAAGATGC GATTAAGTCC TGAAGTACAG ACCTCTCTTA ATAGTGATTT TAGCTTGGTT
TGGCAACGTA AGGAATATGA GATTGCTAAA GAGTTGGCTG ATGAGCTGGA AGAAAATTTT
CAGTTATCAC TGGATGAGAT TGAAGTGGGA CTAGTAGCCA TGCTTATGCT TAGTTTCCGC
AAGGACCGTG ACAATCATTT AGAGAGCCAG GATTATGATG ATATGCGAGC TACTCTAACC
AGTTTTTTGA AAGAATTGGA AGAACGATAT CACCTTCACT TTGTTCATAA AAAGGACTTA
CTAAGACAAC TTCTTACTCA CTGCAAGGCA CTCTTATATC GTAAACGTTA TGGTATTTTT
TCTGTTAATC CTTTAACAGA GCATATTAAA GACAAATATG AAGAACTTTT TGCCATAACC
TCGTCTTCTG TAAAGCTTTT AGAGAAAGCT TGGCAAATCA AATTGACCGA TGATGATGTA
GCATATCTAA CGATTCATTT AGGAGGGGAA CTTCGTAATA GTCAACAATC TCCTAATAAA
CTTAAGTTAG TTATTGTATC TGATGAAGGA ATAGCGATTC AGAAACTTCT TTTAAAGCAA
TGTCAACGCT ACTTAACAAA TAGTGATATA GAAGCTGTTT TTACAACCGA ACAGTATCAA
AGTGTGAGTG ATCTTATGCA TGTAGATATG GTTGTCTCTA CTAGTGATGC TTTAGAATCT
CGTTTTCCGA TGTTAGTAGT TCACCCTGTT TTGACAGATG ATGATATTAT TCGCTTGATT
CGCTTTTCTA AAAAAGGTAA CTGTGCAAAT AGTAATCAAT TTACCAATGA ACTTGAAAAA
ACAATTGCTC AATATGTCAA GGAAGATAGT GAACGCTACG TGCTGAAATC TAAGATTGAG
AAACTTATTC ATCAAGAATT GCTCCAAGAC GTCCTTCCCC TTCAAAGTAC AGTTTGTTAA
 
Protein sequence
MIILDKKSYD LLFYLLKLEE PETVMAIANA LNQSRRKVYY HLEKINDALP SDVPQIVSYP 
RVGILLTEKQ KAACRLLLDE VTDYSYVMKS SERLQLSLVS IVVAKDRVTI DRLMQLNDVS
RNTILNDLNE LRSELAEKEY NLQLQSTKCR GYFLDGHPLS IIQYLYKLLD DIYHNGSSSF
IDLFNHKLSQ AFGASTYFSK EVLDYFHHYL FISQRSLGKK INSQDGQFMI QILPFILMAY
RKMRLSPEVQ TSLNSDFSLV WQRKEYEIAK ELADELEENF QLSLDEIEVG LVAMLMLSFR
KDRDNHLESQ DYDDMRATLT SFLKELEERY HLHFVHKKDL LRQLLTHCKA LLYRKRYGIF
SVNPLTEHIK DKYEELFAIT SSSVKLLEKA WQIKLTDDDV AYLTIHLGGE LRNSQQSPNK
LKLVIVSDEG IAIQKLLLKQ CQRYLTNSDI EAVFTTEQYQ SVSDLMHVDM VVSTSDALES
RFPMLVVHPV LTDDDIIRLI RFSKKGNCAN SNQFTNELEK TIAQYVKEDS ERYVLKSKIE
KLIHQELLQD VLPLQSTVC