Gene EcSMS35_4323 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_4323 
Symbol 
ID6142866 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp4420121 
End bp4421893 
Gene Length1773 bp 
Protein Length590 aa 
Translation table11 
GC content58% 
IMG OID641619144 
Productphage large terminase subunit GpP 
Protein accessionYP_001746268 
Protein GI170683976 
COG category[S] Function unknown 
COG ID[COG5484] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value0.00512482 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACCATCA CCACAGACAC CACTCTTTTA CACGACCCGC GTCGTCAGGC GGCGCTGCTG 
TACTGGCAGG GGTTTTCCGT GCCGCAGATT GCCGCCATGT TGCAGATGAA ACGCCCGACG
GTGCAGAGCT GGAAACAGCG CGACGGCTGG GACAGCGTTG CCCCCATCAG CCGTGTCGAA
ATGAGTCTGG AAGCGCGGCT GACCCAGCTC ATCATCAAAC CGCAGAAAAC CGGCGGTGAC
TTCAAGGAAA TTGACCTGCT GGGACGCCAG ATTGAACGAC TGGCACGGGT CAACCGCTAC
AGTCAGACCG GCAACGAGGC AGACCTTAAT CCGAACATCG CTAACCGCAA CAAAGGCGGG
CGGCGCAAAC CGAAAAAGAA TTTTTTCAGC GACGAAGCCA TCGAAAAGCT GGAGCAGATT
TTCTTTGAGC AGTCTTTCGA CTATCAGTTG CACTGGTATC GCGCCGGGCT TGAGCACCGC
ATCCGCGATA TCCTGAAATC CCGCCAGATT GGCGCAACGT TTTATTTTTC CCGCGAGGCG
CTGCTGCGTG CCCTGAAAAC CGGTCATAAC CAGATTTTTC TGTCGGCCAG TAAAACGCAG
GCGTATGTGT TCCGCGAATA CATCATCGCC TTTGCCCGGC TGGTTGACGT TGACCTGACC
GGTGACCCGA TTGTCCTGGG CAATAACGGC GCAAAACTGA TTTTTCTCGG CACCAACTCC
AACACCGCGC AGAGCCATAA CGGCGACCTG TACGTCGACG AGATTTTCTG GATCCCGAAT
TTTCAGGTAC TGCGTAAGGT GGCATCAGGT ATGGCCTCAC AGAGTCACCT GCGCTCGACC
TATTTCTCCA CCCCGTCCAC GCTGGCGCAC GACGCCTACC CGTTCTGGTC GGGTGAACTG
TTTAACCGGG GACGCGCCAG CGCCGCCGAA CGCGTGGAAA TCGACGTCAG TCATAACGCC
CTTGCCGGTG GGCTTCTCTG TGCGGACGGC CAGTGGCGGC AGATTGTCAC CATTGAGGAC
GCCCTGAAAG GCGGCTGCAC GCTGTTCGAC ATTGAGCAGC TCAAACGCGA AAACAGCGCC
GACGATTTTA AAAACCTGTT CATGTGTGAA TTTGTTGACG ACAAGGCGTC GGTATTCCCG
TTCGAGGAGC TGCAACGCTG CATGGTCGAC ACGCTGGAAG AATGGGAAGA CTATGCGCCG
TTTGCCGCCA ATCCGTTCGG CTCCCGCCCG GTATGGATTG GTTACGACCC GTCACACCGT
GGCGACAGCG CCGGATGCGT GGTGCTGGCA CCGCCGGTGG TGGCCGGAGG TAAATTCAGA
ATACTTGAGC GTCACCAGTG GAAAGGCATG GACTTTGCCA CTCAGGCGGA ATCCATCCGC
AAACTCACCG AAAAATACAA CGTCGAATAC ATCGGAATTG ATGCCACCGG CCTCGGTGTC
GGCGTGTTCC AGCTCGTGCG CTCGTTCTAT CCCGCCGCGC GCGATATCCG CTACACGCCG
GAAATGAAAA CCGCAATGGT GCTCAAGGCA AAAGACGTCA TCCGCCGTGG CTGTCTGGAA
TATGACGTCA GCGCCACCGA CATCACCAGC TCGTTCATGG CTATCCGCAA GACCATGACC
AGCAGCGGAC GCAGCGCCAC CTATGAGGCC AGCCGTAGCG AGGAAGCCAG CCACGCCGAC
CTCGCCTGGG CGACCATGCA CGCCCTGTTA AATGAGCCAC TCACCGCCGG TATCAGCACC
CCGCTGACAT CCACCATTCT GGAGTTTTAC TGA
 
Protein sequence
MTITTDTTLL HDPRRQAALL YWQGFSVPQI AAMLQMKRPT VQSWKQRDGW DSVAPISRVE 
MSLEARLTQL IIKPQKTGGD FKEIDLLGRQ IERLARVNRY SQTGNEADLN PNIANRNKGG
RRKPKKNFFS DEAIEKLEQI FFEQSFDYQL HWYRAGLEHR IRDILKSRQI GATFYFSREA
LLRALKTGHN QIFLSASKTQ AYVFREYIIA FARLVDVDLT GDPIVLGNNG AKLIFLGTNS
NTAQSHNGDL YVDEIFWIPN FQVLRKVASG MASQSHLRST YFSTPSTLAH DAYPFWSGEL
FNRGRASAAE RVEIDVSHNA LAGGLLCADG QWRQIVTIED ALKGGCTLFD IEQLKRENSA
DDFKNLFMCE FVDDKASVFP FEELQRCMVD TLEEWEDYAP FAANPFGSRP VWIGYDPSHR
GDSAGCVVLA PPVVAGGKFR ILERHQWKGM DFATQAESIR KLTEKYNVEY IGIDATGLGV
GVFQLVRSFY PAARDIRYTP EMKTAMVLKA KDVIRRGCLE YDVSATDITS SFMAIRKTMT
SSGRSATYEA SRSEEASHAD LAWATMHALL NEPLTAGIST PLTSTILEFY