Gene EcSMS35_1186 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_1186 
Symbol 
ID6146211 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp1192092 
End bp1193849 
Gene Length1758 bp 
Protein Length585 aa 
Translation table11 
GC content52% 
IMG OID641616064 
Productputative phage terminase, large subunit 
Protein accessionYP_001743247 
Protein GI170684273 
COG category[R] General function prediction only 
COG ID[COG4626] Phage terminase-like protein, large subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.619537 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.0000000060413 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCAAAAG TGGCTGACGG GATCCGCTAC GCCGAACGTG TTGTTGCAGG AGAAATTGTT 
GCTGGCGAAT TTGTCCGCCT GGCCTGCCAG CGTTTTCTTG ATGATCTGAA GTACGGCGAA
GAGCGGGGGA TTTATTTCAG TGAACCCCGT GCGCAGCACA TCCTGAATTT CTACAAATTT
GTGCCTCATG TAAAAGGGGC GCTGGCAGGC CAGCCCATTG AGTTGATGGA CTGGCATGTA
TTTATCCTCA TTAATATTTT TGGTTTTGTC ATTCCGCTGG TCAATGAAGA GACCGGGGAA
GTTGTCATGC GCAGCGATGG CAGCGGACGT CCGGTGATGG TGCGCCGGTT CCGGACGGCG
TACAACGAAG TCGCCCGTAA AAACGCAAAA TCAACTCTGT CATCGGGTAT CGGCCTGTAT
ATGACGGGGG CAGATGGTGA AGGCGGAGCT GAGGTGTATT CAGCCGCAAC CACGCGTGAC
CAGGCCAGAA TCGTGTTTGA AGACGCCAAA AATATGGTCA GAAAAGCCCG GTCGACACTC
GGGCGGTTGT TTGATTTCAA CAAGCTGGCG ATTTACCAGG AGCAGAGCGC ATCAAAATTT
GAACCGCTTT CTTCGGATGC AAACAACCTG GATGGTCTGA ACATCCACTG CGCCATTATT
GATGAGCTGC ATGCACATAA AACTCGTGAC GTGTGGGACG TTCTGGAAAC GGCAACCGGT
GCCCGTCTGC AGTCCCTTTT ATTTGGTATC ACCACGGCAG GGTTTAACAA GGAAGGGATT
TGTTACGAGC AGCGTGATTA CGCCATCAAG GTATTGCGTG GCTATAACAG CGACGTGGAG
GGCGCGGTAA AAGACGACTC CTACTTTGCG ATTATTTACA CCCTCGATGA GGGAGATGAT
CCGTTTGATG AAACGGTCTG GCAGAAAGCG AATCCCGGCC TGGGCATCTG TAAACGCTGG
GATGATCTGC GTCGCCTGGC GAAAAAAGCG AAAGAACAGG TCTCTGCGCG GGTGAATTTT
TTTACCAAAC ACATGAATGT GTGGGTAACA GCAGAGTCTG CCTGGATGGA CATGATTAAG
TGGGATAAGT GCGAATACAT TGCCCCACGA CATGAGCTGA AAACGTATCC CATGTGGGTC
GGCGTTGACC TTGCTCATAA GATTGATATC TGTGCGGCGG CAAAACTCTG GCGAACGGAT
AACGGGCATG TTCATGCCGA TTTTAAATTC TGGCTTCCGG AAGGACGGCT GGAACGATGC
TCGCGGCAGC AGGCAGAACT TTACCGGAAG TGGGCGGAGA TGGATAAGCT GATTCTGACG
GATGGTGATG TTATCGATCA TGCTCAGATA AAAAGTGACT TACTGGAATG GATTGGTGGT
GAAAACCTCA GGGAACTGGG ATTTGACCCG TGGAGCGCGA TGCAGTTCAG CCTGGCACTG
GCTGAAGAAG GGATACCGCT GGTGGAGGTT CCGCAGACGG TTCGCAATCT GTCAGAGGCC
ATGAAGGAAA CGGAATCACT GGTCTATGCC GGGCGTTTCC ATCACAGCAA TCATCCGGTC
ATGAACTGGA TGATGTCTAA CGTTACGGTA AAACCGGACA AAAACGACAA TATCTTCCCG
AATAAATCCA CGCTGGAAGC CAAAATCGAC GGCCCTGTTG CGATGTTTAC AGCAATGAGC
CGGATGCTGG TCAATGGTGG TGAACCGGAG CTGGATCTGT CTGAACATCT GGTCAGCGTG
GGCATCCGCT CGCTTTAA
 
Protein sequence
MAKVADGIRY AERVVAGEIV AGEFVRLACQ RFLDDLKYGE ERGIYFSEPR AQHILNFYKF 
VPHVKGALAG QPIELMDWHV FILINIFGFV IPLVNEETGE VVMRSDGSGR PVMVRRFRTA
YNEVARKNAK STLSSGIGLY MTGADGEGGA EVYSAATTRD QARIVFEDAK NMVRKARSTL
GRLFDFNKLA IYQEQSASKF EPLSSDANNL DGLNIHCAII DELHAHKTRD VWDVLETATG
ARLQSLLFGI TTAGFNKEGI CYEQRDYAIK VLRGYNSDVE GAVKDDSYFA IIYTLDEGDD
PFDETVWQKA NPGLGICKRW DDLRRLAKKA KEQVSARVNF FTKHMNVWVT AESAWMDMIK
WDKCEYIAPR HELKTYPMWV GVDLAHKIDI CAAAKLWRTD NGHVHADFKF WLPEGRLERC
SRQQAELYRK WAEMDKLILT DGDVIDHAQI KSDLLEWIGG ENLRELGFDP WSAMQFSLAL
AEEGIPLVEV PQTVRNLSEA MKETESLVYA GRFHHSNHPV MNWMMSNVTV KPDKNDNIFP
NKSTLEAKID GPVAMFTAMS RMLVNGGEPE LDLSEHLVSV GIRSL