Gene SAG2022 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSAG2022 
Symbol 
ID1014833 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptococcus agalactiae 2603V/R 
KingdomBacteria 
Replicon accessionNC_004116 
Strand
Start bp1997467 
End bp1998720 
Gene Length1254 bp 
Protein Length417 aa 
Translation table11 
GC content39% 
IMG OID637317188 
ProductISL3 family transposase 
Protein accessionNP_689008 
Protein GI22538157 
COG category[L] Replication, recombination and repair 
COG ID[COG3464] Transposase and inactivated derivatives 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCAGACT TATTATCCCT ACCAGACATT AAAACAATAG AACCGCCACA AGAAAATGAA 
ACCGATATGA TGTTTAAAGT TGAAGCAGTC GGACCACCTG AACGTTGTCC TGAATGTGGT
TTTGACAAGT TGTACAAACA CAGTTCAAGA AATCAACTAA TTATGGATTT GCCCATTCGT
TTAAAGCGAG TGGGCTTACA TTTGAACCGT AGACGATACA AGTGTCGTGA ATGCGGATCT
ACCATATCTG TAGATGAAAA GCGTAGTATG ACCAAAAGGC TTTTAAAGTC CATTCAAGAG
CAATCCATGT CTAAGACCTT TGTAGAAGTC GCAGAAAGCG TTGGTGTTGA CGAGAAAACC
ATTAGGAACG TTTTTAAGGA CTATGTGGCA CTCAAAGAAC GTGAATACCA GTTTGAAACT
CCTAAGTGGC TTGGGATAGA CGAGATACAT ATTATCCGTA GACCTCGGCT TGTATTGACT
AATATTGAAC GCAGGACTAT TTATGACATC AAGCCTAACC GTAACAAGGA AACAGTCATC
CAACGTCTTT CAGAAATCAG TGACAGGACT TACATTGAGT ACGTCACAAT GGATATGTGG
AAGCCCTACA AAGACGCAGT GAACACTATC CTTCCACAAG CTAAAGTTGT CGTAGATAAG
TTTCATGTAG TTAGAATGGC TAATCAAGCC TTAGATAACG TCAGAAAGTC TTTGAAAGCC
CATATGAGCC AAAAAGAAAG ACGTACCCTT ATGCGTGAAA GGTTTATCCT TCTAAAGCGT
AAACACGATC TAAATGAACG TGAATCATTC CTCTTAGATA CTTGGTTAGG TAATCTTCCT
GCTTTAAAAG AAGCCTATGA ACTCAAAGAA GAGTTTTACT GGATATGGGA TACTCCTGAT
CCAGATGAAG GTCATCTTCG TTATAGTCAA TGGAGACACC GTTGTATGTC CAGCAACTCT
AAAGACGCAT ATAAAGACCT CGTGAGAGCC GTAGACAACT GGCATGTTGA AATATTCAAC
TACTTTGATA AAAGGCTCAC TAATGCTTAT ACGGAGTCAA TTAACAGCAT TATTAGGCAG
GTAGAGCGAA TGGGTAGAGG TTACTCGTTT GATGCCTTAC GAGCCAAAAT CCTTTTCAAT
GAGAAGCTCC ATAAAAAGCG TAAGCCACGA TTTAATTCAA GTGCTTTCAA TAAAGCTATG
TTATACGATA CTTTCAATTG GTATGAAGTG AATGATCACG ACATTACAGA CTAG
 
Protein sequence
MSDLLSLPDI KTIEPPQENE TDMMFKVEAV GPPERCPECG FDKLYKHSSR NQLIMDLPIR 
LKRVGLHLNR RRYKCRECGS TISVDEKRSM TKRLLKSIQE QSMSKTFVEV AESVGVDEKT
IRNVFKDYVA LKEREYQFET PKWLGIDEIH IIRRPRLVLT NIERRTIYDI KPNRNKETVI
QRLSEISDRT YIEYVTMDMW KPYKDAVNTI LPQAKVVVDK FHVVRMANQA LDNVRKSLKA
HMSQKERRTL MRERFILLKR KHDLNERESF LLDTWLGNLP ALKEAYELKE EFYWIWDTPD
PDEGHLRYSQ WRHRCMSSNS KDAYKDLVRA VDNWHVEIFN YFDKRLTNAY TESINSIIRQ
VERMGRGYSF DALRAKILFN EKLHKKRKPR FNSSAFNKAM LYDTFNWYEV NDHDITD