Gene SAG1855 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSAG1855 
Symbol 
ID1014665 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptococcus agalactiae 2603V/R 
KingdomBacteria 
Replicon accessionNC_004116 
Strand
Start bp1849791 
End bp1851503 
Gene Length1713 bp 
Protein Length570 aa 
Translation table11 
GC content42% 
IMG OID637317024 
Productprophage LambdaSa2, terminase large subunit, putative 
Protein accessionNP_688845 
Protein GI22537994 
COG category[R] General function prediction only 
COG ID[COG4626] Phage terminase-like protein, large subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.526141 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTTGAGA TGAGATATTT TGATAAGTAT GCTCAGCTCA TCTATACTGG TAAGATTCGT 
ATTTGTAAGC TCACAATGAA ATCAATTAGA CGTGTTGAGC GATACAAAGA GCAATACCTC
TTCAAACAGG AGGAAGCTGA CAAACGGATT GAGTTCATTG AGGAAGAGTG CAGCAATACT
AAAGGCCTTG CTGGTAAGTT ACGCTTAGCA TTACCACAAA AGGTTTGGTT AGAAACAACG
TGGGGCTTTT ATCACACGGT TGAGGTTACT AAGACCAATC CTGATACCTT GGAAGAATAC
ACAGATTATG AAGAAAGGCG TCTCATTCAT GAGGTGCCTA TTATTGTGCC TCGTGGCACA
GGTAAGACTA CTCTTGGTTC TGCTATTGCT GAGGTTGGTC AAATCATTGA CGGTGAGTGG
GGTGCTGATA TTCAGCTTCT TGCTTACAGT CGTGAACAGG CTGGCTATTT GTTCAATGCC
TCAAGGGCGA TGTTGTCGAA TGAAGAAAGC TTGCTGCACT ATATGCGTGA GGCTGACATC
CTACGGTCAA CCAAGCAAGG TATCTTGTAT GAAACAACTA ACAGTCTTAT GTCTATCAAG
ACTTCTGACT ATGAAAGCCT TGACGGTACT AATGCTCACT ACAATATCTT TGATGAGGTG
CACACTTATG ATGATGACTT CATCAAGGTT GTGAATGATG GTTCCAGCCG TAAGCGTAAG
AATTGGATAA CCTGGTACAT TTCCACAAAT GGAACGAAGC GTGACAAGCT CTTTGATAAG
TATTACAACA TCTGGGTAGA TATCCTTGAT GACAAGATTA TCAATGATTC TGTCATGCCT
TGGATTTATC AGTTGGACGA TGTGTCAGAG ATTCATGACC CTGATATGTG GCAGAAAGCT
ATGCCATTAC TTGGTATCAC GACAGAGAAA GAAACCATCG CTCGTGATAT TGAGATGAGC
AAGAATGATC CAGCACAACA AGCTGAGCTG ATGGCTAAGA CTTTCAATCT TCCTGTCAAC
AACTATCTTG CTTACTTCAG CAATGAAGAG TGTAAAGGTT GGTCAGATAA GTTTGATGAG
AGTTTGTTTG TCGGAGATGA TGAACGGAAC GCCCGTTGTG TGATTGGGAT TGACTTGTCA
GATGTCAATG ACATCTGCTC TATCTCTTTT ATGGTTGTGC GTGGGGAAGA ACGGCACTAT
CTAAACAAGA AATTCATGCC ACGGCATACC ATTGAGACAT TGCCAAAGGA ACTGCGTGAT
AAGTACACTG AGTGGGAATT AAGTGGCATG CTGCATGTGC ATGAATTGGA CTACAATGAC
CAAGCCTATA TATTTGAAGA GTTACGGCAG TTTATGAGTG ACAACAGAAT TTTGCCTGTG
GCAGTCGGTT ATGACCGCTA CAATGCAAGG GAACTTATTC GCTTGTTTAA CGACTACTAC
GGGGATATTT GTCACGATAT TCCCCAGACG GTCAAATCGT TATCAAATCC GCTCAAGGTT
TACAAGGAGA AGGCTAAGAT GGGCAAAATC ATCTTTGATG ATCCTGTGGC GACATGGAAT
CATGCCAACG TCCGTGTCAA AATTGATGCC AATAACAATA TTTTTCCAAA CAAGGAAAAG
GCAAAAGAAA AGATTGATGT CTTTGCTAGT CAGCTAGATG CCTTTATCTG TTATGAAAAT
TTCAAGGAAG ACTTGAGCTA CTACTTTGAT TGA
 
Protein sequence
MVEMRYFDKY AQLIYTGKIR ICKLTMKSIR RVERYKEQYL FKQEEADKRI EFIEEECSNT 
KGLAGKLRLA LPQKVWLETT WGFYHTVEVT KTNPDTLEEY TDYEERRLIH EVPIIVPRGT
GKTTLGSAIA EVGQIIDGEW GADIQLLAYS REQAGYLFNA SRAMLSNEES LLHYMREADI
LRSTKQGILY ETTNSLMSIK TSDYESLDGT NAHYNIFDEV HTYDDDFIKV VNDGSSRKRK
NWITWYISTN GTKRDKLFDK YYNIWVDILD DKIINDSVMP WIYQLDDVSE IHDPDMWQKA
MPLLGITTEK ETIARDIEMS KNDPAQQAEL MAKTFNLPVN NYLAYFSNEE CKGWSDKFDE
SLFVGDDERN ARCVIGIDLS DVNDICSISF MVVRGEERHY LNKKFMPRHT IETLPKELRD
KYTEWELSGM LHVHELDYND QAYIFEELRQ FMSDNRILPV AVGYDRYNAR ELIRLFNDYY
GDICHDIPQT VKSLSNPLKV YKEKAKMGKI IFDDPVATWN HANVRVKIDA NNNIFPNKEK
AKEKIDVFAS QLDAFICYEN FKEDLSYYFD