Gene SAG1897 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSAG1897 
Symbol 
ID1014707 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptococcus agalactiae 2603V/R 
KingdomBacteria 
Replicon accessionNC_004116 
Strand
Start bp1877560 
End bp1879464 
Gene Length1905 bp 
Protein Length634 aa 
Translation table11 
GC content33% 
IMG OID637317065 
Producthypothetical protein 
Protein accessionNP_688886 
Protein GI22538035 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0598703 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAGATC GCTTATATAA TAAATTCAAG GATTTTGATA GGGAATTTTG TCAAAAGTAT 
ATTAAAACAT ACCAATCTAA TGCCTATCAA GAAATGAAAG CCTCTGTTAA TTTGATGATG
AGGAACACTT TTGTATTTAA TGATAATTGG GATATGGAAC CTTGTTCTAA AGCATATTGT
CTTGATCCTT TGGAGTGGGA TAAGCCAGTA ACAGATGATC CAGAATGGTT GTATATGCTA
AATCGTCAAA CTTATCTTTT TAAATTCCTG GTGGTCTACA TTGTTGAAGG CGATAAGTCT
TACCTTAGAC AAATGAAATA TTTTATGTAC CATTGGATTG ATTGTCAATT TACACTAAAG
CCAGAAGGGG CGGTAAGTCG TACTATTGAC ACTGGGATAC GTTGCATGTC ATGGCTCAAA
GTTTTGATTT TTTTAGATTA TTTTGGATTA ATAACAGAAA CTAAAAAAAT TAAACTATTA
ACTAGTTTAC GGGAGCAGAT AACTTATATG AGGGACTATT ATCGTGAAAA AGATAGTCTA
AGTAACTGGG GAATTTTACA AACAACAGCA ATATTGGCGT GTTTATACTA TTATGAAGAT
GAATTAAATC TACCTGAAAT TCAGAGTTTT GCTGAAGAAG AATTATTACT TCAAATCAAG
CTTCAAATTT TAGATGACGG GAGCCAGTAT GAACAATCAA TTATGTATCA TGTAGAAGTC
TTGAAGTCCT TAATGGAACT AGTTATCCTT GCGCCTAAAT ATTATCTACC ATTAGAAGAA
ACTATTGAAA AAATGGTTAC CTATCTAATT GCTATGACTG GTCCGGATTA TTGTCAACTA
GCTATAGGGG ACAGTGATGT GACCGATACT CGTGATATTC TAACTTTGGC AACACTTGTG
TTGAAATCTT CTAAAACAAA ATCATTTTCT TTTGATAATG TTAATTTAGA AACTTTACTT
TTGTTTGGGA AGCCATCAAT TTATCTTTTT GAAGAAATAC CGCGTGCGAC AATAGGAGAG
TCTGCTTATC TTTTTCCAGA TTCTGGTCAT GTGTGTTTAC GTGATGATAG GCGTTATATA
TTTTTTAAGA ACGGTCCATT TGGTAGCGCT CATACTCATA GTGATAATAA TAGTGTTTGT
CTCTATGATA AAAAGAAACC TATTTTCATT GATGCAGGAA GATATACTTA CAAAGAAGAA
CAACTAAGGT ATGATTTTAA ACGTTCGACT AGTCATTCAA CATGTACCCT TGATGGGCAA
CCCTTAGAAA TGATCAAGGA CTCTTGGACA TACAATTCTT ATCCAAAATG TGACTATTGT
CAGTTGACTT CAAAGGATAG GTACCATTTA GTCGAAGGAC AACTACATGT CCAAAGAGCT
TCTGATATCT ATTACCATAA GCGATGGTTG TTAACTTTAC CGCAGGCCAT TACCTTAGTT
ATTGATAAGG TGAGTTGTCC AGGAGAGCAT GTCTTAACAA ATCAATATAT TTTAGATGAT
CAGGTCATTT ATGAAAATGG GTTTGTTAAT GACTTGAAAT TAGTAAGTCC TACGACCTTT
AATCTAGAAG ATTGCCTTAT TTCTAAGCGG TATAATCAAT TGACAGAAAG TCATAAATTA
GTTAAGAAAA TAAAATTTGT TGATGAGGTG ATGGATTATA CCTTGATAGT TGATCGGAAC
TGTCAGGTGA AATATGTTCC TTTAGTGCAA ACGAACAGTC ATAAGGAACT AAGTAATAGC
ATTGCATTTG ATATTAGGTC TCAAGACTTT CATTATTTAA TTGGAGTGCT TATGGATGAT
ATTATCTTTG GGGATAAACT CTATTTGATG CAAGGGATAA AATGCAAAGG AAAAGTCATT
GTATATGATA AAAATAATGG CAAAATGAGT CGTTTAAAAA ATTAA
 
Protein sequence
MKDRLYNKFK DFDREFCQKY IKTYQSNAYQ EMKASVNLMM RNTFVFNDNW DMEPCSKAYC 
LDPLEWDKPV TDDPEWLYML NRQTYLFKFL VVYIVEGDKS YLRQMKYFMY HWIDCQFTLK
PEGAVSRTID TGIRCMSWLK VLIFLDYFGL ITETKKIKLL TSLREQITYM RDYYREKDSL
SNWGILQTTA ILACLYYYED ELNLPEIQSF AEEELLLQIK LQILDDGSQY EQSIMYHVEV
LKSLMELVIL APKYYLPLEE TIEKMVTYLI AMTGPDYCQL AIGDSDVTDT RDILTLATLV
LKSSKTKSFS FDNVNLETLL LFGKPSIYLF EEIPRATIGE SAYLFPDSGH VCLRDDRRYI
FFKNGPFGSA HTHSDNNSVC LYDKKKPIFI DAGRYTYKEE QLRYDFKRST SHSTCTLDGQ
PLEMIKDSWT YNSYPKCDYC QLTSKDRYHL VEGQLHVQRA SDIYYHKRWL LTLPQAITLV
IDKVSCPGEH VLTNQYILDD QVIYENGFVN DLKLVSPTTF NLEDCLISKR YNQLTESHKL
VKKIKFVDEV MDYTLIVDRN CQVKYVPLVQ TNSHKELSNS IAFDIRSQDF HYLIGVLMDD
IIFGDKLYLM QGIKCKGKVI VYDKNNGKMS RLKN