Gene SAG0993 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSAG0993 
Symbol 
ID1013797 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptococcus agalactiae 2603V/R 
KingdomBacteria 
Replicon accessionNC_004116 
Strand
Start bp1002308 
End bp1003618 
Gene Length1311 bp 
Protein Length436 aa 
Translation table11 
GC content37% 
IMG OID637316177 
ProductNOL1/NOP2/sun family protein 
Protein accessionNP_688004 
Protein GI22537153 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0144] tRNA and rRNA cytosine-C5-methylases 
TIGRFAM ID[TIGR00446] NOL1/NOP2/sun family putative RNA methylase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000219997 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAATTAC CGAATGAATT TATTGAAAAA TATCAGACTA TACTTAAAGA TGAAGCAGAA 
GCTTTTTTTG ACAGTTTTGA ACAAAAACCC ATATCGGCAT ATCGCACCAA TCCTCTCAAA
GAAAAGCAAC TTGATTTTCC AAATGCCATC CCAAGTACCC CTTGGGGGCA TTATGGGAAA
ATATCAGGAA AATCAATAGA ACACACAACT GGCCTTGTTT ATTCTCAAGA ACCTGCTGCT
CAGATAGTGG CTCAGATTGC TGAACCACAA GAAGGAATGA AAGTGCTAGA TTTAGCAGCT
GCACCTGGTG GAAAAACAAC ACACCTTCTA TCATATTTAA ACAATACTGG TCTATTAGTT
AGTAATGAAA TTTCAAATAA ACGTAGTAAA ATTTTAGTCG AAAATGTCGA ACGTTTTGGT
GCTAGAAATG TTATCGTGAC TAATGAAAGC TCCCAACGAC TAGCCAAATG TTTTAATTCT
TTTTTTGATC TTATTGTTTT TGACGGTCCG TGTTCTGGTG AGGGAATGTT CCGCAAAGAT
CCTCAAGCAA TACAATATTG GCACAAAGAT TACCCCACTG AATGTGCACA GTTACAAAGA
GATATTTTAA AAGAAGCAAT CAAAATGCTA GCTCATGGTG GTATACTTGT ATACTCTACT
TGTACATGGT CACCAGAAGA GAACGAAGAA GTGGTTAATT GGCTCCTTCA AGAGTATGAT
TACTTAGAAC TTGTTGATAT TCCTAAACTA AATGGGATGG TTGAAGGGAT TAATGTACCA
CAAGTTGCAA GGATGTACCC TCATCATTTT CAAGGTGAAG GACAATTCGT TGCAAAACTG
AGAGATACTA GATCTAAGGA AGCACAGAAA ATTAAGCCAA AAGCACAGAA AATAAATAAA
ATGCAGTTAC AATTGTGGCA ACAATTTGCA CAAGACCATT TAAAGATAGA CTTGAACGGT
GTTTTAGATG TTTTCGGCGA CCAACTCTAT CTTCTACCTA ATGGTCTCCC CGATTTGTCT
AAATTAAAAA TAGCACGCAA TGGTCTCCAT CTAGGCACTT TCAAGAAGAA TCGTTTTGAG
CCATCGTTTG CTTTAGGAAT GGCACTTAGC GAACATGACC TAGTACAGTC TATTGAAATT
GATATAGAAC AGTTTGAGGT GTACGTATCT GGAAATGTAG TCAAACTAGA CAAGACTGTT
CCAAACGGTT GGTATCAAAT TCTTGTAAAA GGCAATGGAT TAGGTTTTGC AAAAGTGACA
AATAATACTC TTAAAAATTA TTACCCTAAA GGACTACGAT TTCAGACATA A
 
Protein sequence
MKLPNEFIEK YQTILKDEAE AFFDSFEQKP ISAYRTNPLK EKQLDFPNAI PSTPWGHYGK 
ISGKSIEHTT GLVYSQEPAA QIVAQIAEPQ EGMKVLDLAA APGGKTTHLL SYLNNTGLLV
SNEISNKRSK ILVENVERFG ARNVIVTNES SQRLAKCFNS FFDLIVFDGP CSGEGMFRKD
PQAIQYWHKD YPTECAQLQR DILKEAIKML AHGGILVYST CTWSPEENEE VVNWLLQEYD
YLELVDIPKL NGMVEGINVP QVARMYPHHF QGEGQFVAKL RDTRSKEAQK IKPKAQKINK
MQLQLWQQFA QDHLKIDLNG VLDVFGDQLY LLPNGLPDLS KLKIARNGLH LGTFKKNRFE
PSFALGMALS EHDLVQSIEI DIEQFEVYVS GNVVKLDKTV PNGWYQILVK GNGLGFAKVT
NNTLKNYYPK GLRFQT