Gene SAG1190 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSAG1190 
SymbolpavA 
ID1013997 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptococcus agalactiae 2603V/R 
KingdomBacteria 
Replicon accessionNC_004116 
Strand
Start bp1193614 
End bp1195269 
Gene Length1656 bp 
Protein Length551 aa 
Translation table11 
GC content36% 
IMG OID637316375 
Productadherence and virulence protein A 
Protein accessionNP_688199 
Protein GI22537348 
COG category[K] Transcription 
COG ID[COG1293] Predicted RNA-binding protein homologous to eukaryotic snRNP 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.252284 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTTTTG ATGGATTTTT TTTACATCAC CTCACAAATG AATTACAAGA ACAGATTGAA 
AAAGGACGTA TCCAAAAAGT TAACCAACCT TTTGACCATG AATTGGTATT AACAATACGC
AACAATCGTC GCAACTATAA ACTACTCCTA TCTGCTCATC CTGTATTTGG ACGTATCCAA
ACTACCGAAG CAAATTTCCA AAATCCTCAA AATCCAAATA CTTTTACAAT GATCATGAGA
AAGTATTTGC AAGGTGCAGT GATTGAAACC ATTCAACAAA TTGAAAATGA CCGTATATTA
GAAATCGTCG TGTCTAATAA AAACGAAATC GGTGATCACA TTAAAGCAAC ACTTGTTGTT
GAAATCATGG GAAAACACAG TAACATTATC TTGATTGATA AAAATGAACA TAAAATTATT
GAATCAATTA AGCATGTTGG TTTCTCACAA AACTCCTATC GTACTATCCT TCCTGGTTCG
ACTTACATTG CCCCACCCAA AACAAAAGCA ATCAATCCTT TTGATATTTC TGATCAAACC
CTATTTGAGC TGCTTCAAAC CAATGATCTA AGCCCTAAAA ACCTCCAACA GCTCCTCCAA
GGTTTAGGAC GCGATACTGC GCTAGAATTA TCTCACTGTT TGAAAGACAA TAAACTTAAC
GATTTTCGTC AATTCTTTTC GAGAGAATAT TATCCCAGCC TAACAGAAAA ATCCTTTTCC
GCTGTCCAAT TCTCAAGCAG TCACGAAACA TTTCAGTCTC TTGGACAATT GTTAGATTAT
TACTACCAAG AGAAGGCTGA AAAAGATCGC ATAGCACAGC AAGCTAGTGA CCTCATCCAC
CGTGTTCAAA GTGAGTTAGA GAAAAATATC AAAAAACTAG CTAAGCAACA AGACGAACTT
CTAGCTACTG AAAATGCAGA GGAGTTTCGC CAAAAAGGAG AGCTTCTAAC TACCTATCTC
TCTATGGTAC CAAATAATCA AGATGTCGTT GTGCTTGATA ATTATTACAC CAACCAAACC
ATTGAGATTT CACTTGATCG AGCTTTAACA CCCAACCAAA ATGCCCAACG CTACTTTAAA
AAATATCAAA AATTAAAAGA AGCTGTAAAA CATTTAAAAG GAATTATTTC AGATACGGAA
AATACAATCA CCTACCTTGA ATCTGTTGAA ACATCGCTAA ATCATGCTTC TATGGAAGAT
ATCAATGATA TTCGTGAAGA ACTTGTTGAA ACTGGATTTA TTAAGCGACG CGCACATGAT
AAACAACATA AGCGTAAAAA ACCTGAACAA TATTTAGCAT CCGATGGCAA GACTATTATT
ATGGTGGGAC GTAATAATCT CCAAAATGAC GAGTTAACAT TTAAAATGGC TCGTAAAGGG
GAGTTGTGGT TTCATGCCAA GGATATCCCC GGAAGCCATG TGTTAATTCG GGATAACCTC
AATCCAAGTG ACGAGGTTAA GACAGATGCC GCCGAGTTAG CTGCTTATTA CTCTAAAGCT
CGTCTCTCCA ATCTTGTTCA AGTAGACATG ATTGAAGCAA AAAAACTCAA TAAACCAAGT
GGTACTAAGC CAGGTTTTGT AACTTACACT GGTCAAAAAA CCTTACGAGT CACTCCCACA
CAAGAAAAAA TAGATAGCCT TAAACTAAAA AAATAG
 
Protein sequence
MSFDGFFLHH LTNELQEQIE KGRIQKVNQP FDHELVLTIR NNRRNYKLLL SAHPVFGRIQ 
TTEANFQNPQ NPNTFTMIMR KYLQGAVIET IQQIENDRIL EIVVSNKNEI GDHIKATLVV
EIMGKHSNII LIDKNEHKII ESIKHVGFSQ NSYRTILPGS TYIAPPKTKA INPFDISDQT
LFELLQTNDL SPKNLQQLLQ GLGRDTALEL SHCLKDNKLN DFRQFFSREY YPSLTEKSFS
AVQFSSSHET FQSLGQLLDY YYQEKAEKDR IAQQASDLIH RVQSELEKNI KKLAKQQDEL
LATENAEEFR QKGELLTTYL SMVPNNQDVV VLDNYYTNQT IEISLDRALT PNQNAQRYFK
KYQKLKEAVK HLKGIISDTE NTITYLESVE TSLNHASMED INDIREELVE TGFIKRRAHD
KQHKRKKPEQ YLASDGKTII MVGRNNLQND ELTFKMARKG ELWFHAKDIP GSHVLIRDNL
NPSDEVKTDA AELAAYYSKA RLSNLVQVDM IEAKKLNKPS GTKPGFVTYT GQKTLRVTPT
QEKIDSLKLK K