Gene SAG1947 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSAG1947 
Symbol 
ID1014757 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptococcus agalactiae 2603V/R 
KingdomBacteria 
Replicon accessionNC_004116 
Strand
Start bp1935525 
End bp1937174 
Gene Length1650 bp 
Protein Length549 aa 
Translation table11 
GC content33% 
IMG OID637317114 
Producthypothetical protein 
Protein accessionNP_688935 
Protein GI22538084 
COG category[T] Signal transduction mechanisms 
COG ID[COG2972] Predicted signal transduction protein with a C-terminal ATPase domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGAGGAT ATAGAATGGA AGAACGCTTT AAGAAACGAC TACAAGATGA TATCTCTAAA 
CATTTTTCCC GCCAGTCTTT AATTTTATCC TTGTTATTGA TTGCACTTTT CGTCCTTTTT
TCTTTAGCGC CTCAACAGAT AGGTCTTTAT AAAGATGTTA ATTCAGTTTC CTATAGTTAT
AAGCAATTGA TTCAAAAACA TGATACCTTA TTAGATGACT TAGGTAAAAA TAGTTTGAAA
CCATTTGTCT CAGGACACTT AGGAAGTGCT GATTTGAGTA AGCAATATTA TCATTTAAGG
AATCATTTAC AAAGTCAAAC AGAACTTTTA GTATTCTCAC CTAATCAAGA ACTTTTGTTT
GCTAGTAATT CACATTTAGG TAATTTTTTT AGTAAGTCTA TTTACATTAG TGAAGTTTTA
GATAAGGCTA AAATAAATCA AAGGCTACTA AAAATTATTG TGGATAGTGA GGGAGGGCAT
TACCTTGCAC TCATTAAGCC AATTATAGTT AATAAAAAAG TTTCAGGTTA TGCCTTTTTA
TTGATGAATG GGAAAGACTT TTTACTTCCA ACGAAGGCTA TTAACTCAGA CTTGATTATT
GCCGATCAGC TAAATAATAG CTTTACTTTT ACTAATCGTG ATTTTATCTC CTCTAGTTTA
GATAAGGTAG ATAGTCAATT CTTAACTAGA TATTTTAGTT TTCACGATCA TCGAGCTTTT
GTTGTTAGGA AAGTAGCTCT GCAAGACAAT ATCCTACTTT ACATGTATCG TCCCCTGATA
CCAGTAACTC TGGTCGTCCT TTTTTCTCTA GTATCATCAG TTATTATTTT TGTGATTTTA
CGACAAAAAT CTAGAGTTCT AGCCGATCGC ATCGCAGTAA AAAATTCTAG CGCCATAAAT
CAAATGGTGC TAGATATGGA TGCCATTTCC CGCCAAGAAA AATCTAGTAT TGAGCTTGAT
AGTCAAGATG AATTTCAATA TCTATCTGTG CAAATCAATC AGATGGTTTC ACGGTTAAAA
GATTTACATG AGAAAACTCT TGATTTGGAA ACACAAAAGT TGTTGTTTGA AAAGAGGATG
TTAGAAGCTC AGTTTAATCC TCACTTCCTT TATAATACCC TTGAGACAAT TTTAATAACC
AGTCACTATG ATTCACAATT AACAGAAAGA ATCGTTATAC AACTAACAAA ATTATTGCGC
TATAGCCTTA GTGGTAGCAC AGAAGCAGCT GTGCTTAAGG ATGATTTAGC AATCATTGAA
TCCTACCTTT TGATTAATCA AGTGAGATTT GAAGAATTAA CTTACACTAT TTCAGTATCT
CCTGAACTTG AACACATGCG TGTACCAAAA CTTTTCTTGT TACCGCTAAT TGAGAATGCT
ATTAAATATG GCTTAAAAGA ACGTCATGAT GTAGCAATAA ACATAGATAT TTGGCAAGAT
AGTGATGGAA TTTGGTTTAC TGTTTCAAAT AATGGATCAG GTATTAGTTT GGCTAGGCAA
CAAGCCATAC GTACAATGTT AAGATCAACT CACTCGCATC ATGGGCTTAT TAATTCTTAT
AGACGATTAC AATATCAATT TTCAACAGTA TTGTTAGAGT TTACGAAGAC AGATGATGCT
TTTCGAGTTA GCTATATAGT AAAGGAGTGA
 
Protein sequence
MRGYRMEERF KKRLQDDISK HFSRQSLILS LLLIALFVLF SLAPQQIGLY KDVNSVSYSY 
KQLIQKHDTL LDDLGKNSLK PFVSGHLGSA DLSKQYYHLR NHLQSQTELL VFSPNQELLF
ASNSHLGNFF SKSIYISEVL DKAKINQRLL KIIVDSEGGH YLALIKPIIV NKKVSGYAFL
LMNGKDFLLP TKAINSDLII ADQLNNSFTF TNRDFISSSL DKVDSQFLTR YFSFHDHRAF
VVRKVALQDN ILLYMYRPLI PVTLVVLFSL VSSVIIFVIL RQKSRVLADR IAVKNSSAIN
QMVLDMDAIS RQEKSSIELD SQDEFQYLSV QINQMVSRLK DLHEKTLDLE TQKLLFEKRM
LEAQFNPHFL YNTLETILIT SHYDSQLTER IVIQLTKLLR YSLSGSTEAA VLKDDLAIIE
SYLLINQVRF EELTYTISVS PELEHMRVPK LFLLPLIENA IKYGLKERHD VAINIDIWQD
SDGIWFTVSN NGSGISLARQ QAIRTMLRST HSHHGLINSY RRLQYQFSTV LLEFTKTDDA
FRVSYIVKE