Gene EcSMS35_0939 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_0939 
Symbol 
ID6145601 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp955827 
End bp957047 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content48% 
IMG OID641615826 
Productphage integrase family site specific recombinase 
Protein accessionYP_001743018 
Protein GI170682065 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.456123 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones64 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGGAAAAG CACTAAACAA ACTGAGCGAT TCGACGTTAA AAAAATTGGC GGCTGTCCAG 
GCAGAAAAAG AGCGTTTTTA CTCCGACGGG GGCGGGCTGG AGATTAAACA CTCAAAGGGC
GGCAAATTAA CCTGGTATTT CCGGTACCGA ACGGGAGGCC GTGAGGTTGC CGCAGAGCGG
TTAAAGCTGG GGGCTTACCC TGAATTGTCG CTGAAAGCCG CAAGGGAAAA ACGCACACTG
TGCCGGGCAT GGCTGGCTGA GGGTAAAAAC CCCCGTTATG AGTTGTGCGC CACAGTACAG
GAAGCACTAA AACCCGTTAC GGTGAAGGAA GCGATTAACT ACTGGCTGGA GGAATACGCG
AAGGATAACC GCAAAGATTA CATAAAGCTG GTGCAGCGTA TGGATAAGCA CATCATTTGC
CATATTGGGG CAATTCCCCT TGATAAGTGT GATACAAGGC AGTGGATCGC ATGTTTTGAC
CGCGTACGAA AAAAAGCACC AGTAGCAGCG GGCCATGTCA TGCAGACATG CAAACAGGCG
CTAAAGTTTT GCCGCAGGCG GCGCTACGCG TTTAGCAACG CCCTGGACGA TTTGATCGTT
ACTGATGTGG GTAAGAGAGC AGAAATCCGC GAGAGAGTGC ACAGCAACAG CGAACTAAAA
GAAATTCTAC GCGCTATTGA TGGTGATGTG TTCGCTCCCT ATTACAGTGC GTTAATGCGC
TTGTTAATTG TGTTCGGGTG CAGAACGGCA GAGATCAGAC TTTCAGAGAT CAAAGAATGG
GATCTGAAAG AAATGTTGTG GACAGTGCCA AAAGAGCACA GCAAAACGAA GGTAACAATA
TTCCGACCTA TTCCTGATGG TATTTTGCCG TTCATTCAGA AGCTGGTGGA GCAAAACGCA
CACACTGGGT TATTACTCGG CGAACTGAAA AAAGATACCA CGGTGGCGCA ATATGGACGA
AATGCGCATA AGCGGCTTAA GCAGGAACAC TGGACGCTGC ATGATTTCCG ACACACGTTT
ACAACTATGC TGAATGATTT AGGTGTCGAT CCGCATATCG TGGAGCACAT CACAGCGCAT
CAGATGCCAG GTCAGCAAAA AACCTATAAC CATTCACGCT ATTTGCAGGC GAAACGGGAC
GCACTGAATC TATGGGTTGA GCGTCTTGAT ATGATTGCAG GATATAATGA AAATATTGTG
ATATTGAGAG GGATACAATG A
 
Protein sequence
MGKALNKLSD STLKKLAAVQ AEKERFYSDG GGLEIKHSKG GKLTWYFRYR TGGREVAAER 
LKLGAYPELS LKAAREKRTL CRAWLAEGKN PRYELCATVQ EALKPVTVKE AINYWLEEYA
KDNRKDYIKL VQRMDKHIIC HIGAIPLDKC DTRQWIACFD RVRKKAPVAA GHVMQTCKQA
LKFCRRRRYA FSNALDDLIV TDVGKRAEIR ERVHSNSELK EILRAIDGDV FAPYYSALMR
LLIVFGCRTA EIRLSEIKEW DLKEMLWTVP KEHSKTKVTI FRPIPDGILP FIQKLVEQNA
HTGLLLGELK KDTTVAQYGR NAHKRLKQEH WTLHDFRHTF TTMLNDLGVD PHIVEHITAH
QMPGQQKTYN HSRYLQAKRD ALNLWVERLD MIAGYNENIV ILRGIQ