Gene EcSMS35_4353 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_4353 
Symbol 
ID6145049 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp4442260 
End bp4443588 
Gene Length1329 bp 
Protein Length442 aa 
Translation table11 
GC content54% 
IMG OID641619174 
ProductIS4 transposase 
Protein accessionYP_001746298 
Protein GI170681391 
COG category[L] Replication, recombination and repair 
COG ID[COG3385] FOG: Transposase and inactivated derivatives 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones45 
Fosmid unclonability p-value0.621464 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCACATTG GACAGGCTCT TGATCTGGTA TCCCGTTACG ATTCTCTGCG TAACCCACTG 
ACTTCTCTGG GGGATTACCT CGACCCCGAA CTCATCTCTC GTTGCCTTGC CGAATCAGGT
ACTGTAACGC TACGCAAGCG CCGTCTTCCC CTCGAAATGA TGGTCTGGTG TATTGTTGGC
ATGGCGCTTG AGCGTAAAGA ACCTCTTCAC CAGATTGTGA ATCGCCTGGA CATCATGCTG
CCGGGCAATC GCCCCTTCGT TGCCCCCAGT GCCGTTATTC AGGCCCGCCA GCGCCTGGGA
AGTGAGGCTG TCCGCCGCGT GTTCACGAAA ACAGCGCAGC TCTGGCATAA CGCCACGCCG
CATCCGCACT GGTGCGGCCT GACCCTGCTG GCCATCGATG GTGTGTTCTG GCGCACACCG
GATACACCAG AGAACGATGC AGCCTTCCCC CGCCAGACAC ATGCCGGGAA CCCGGCGCTC
TACCCGCAGG TCAAAATGGT CTGCCAGATG GAACTGACCA GCCATCTGCT GACGGCTGCA
GCCTTCGGCA CGATGAAGAA CAGCGAAAAT GAGCTTGCTG AGCAACTTAT AGAACAAACC
GGCGATAACA CTCTGACGTT AATGGATAAA GGTTATTACT CACTGGGACT GTTAAATGCC
TGGAGCCTGG CGGGAGAACA CCGCCACTGG ATGATACCTC TCAGAAAGGG AGCGCAATAT
GAAGAGATCA GAAAACTGGG TAAAGGCGAT CATCTGGTGA AGCTGAAAAC CAGCCCGCAG
GCACGAAAAA AGTGGCCGGG ACTGGGAAAT GAAGTGACAG CCCGCCTGCT GACCGTGACG
CGCAAAGGAA AAGTCTGCCA TCTGCTGACG TCGATGACGG ACGCCATGCG CTTCCCCGGA
GGAGAAATGG CGGATCTGTA CAGTCATCGC TGGGAAATCG AACTGGGATA CAGGGAGATA
AAACAGACGA TGCAACTGAG CAGGCTGACG CTGAGAAGTA AAAAACCGGA GCTTGTGGAG
CAAGAGCTGT GGGGTGTCTT ACTGGCTTAT AATCTGGTGA GATATCAGAT GATTAAAATG
GCGGAATATC TGAAAGGTTA CTGGCCGAAT CAACTGAGTT TCTCAGAATC ATGTGGAATG
GTGATGAGAA TGCTGATGAC ATTGCAGGGC GCTTCACCGG GACGTATACC GGAGCTGATG
CGCGATCTTG CAAGTATGGG ACAACTTGTG AAATTACCGA CAAGAAGGGA AAGGGCCTTC
CCGAGAGTGG TAAAGGAGAG GCCCTGGAAA TACCCCACAG CCCCGAAAAA GAGCCAGTCA
GTTGCTTAA
 
Protein sequence
MHIGQALDLV SRYDSLRNPL TSLGDYLDPE LISRCLAESG TVTLRKRRLP LEMMVWCIVG 
MALERKEPLH QIVNRLDIML PGNRPFVAPS AVIQARQRLG SEAVRRVFTK TAQLWHNATP
HPHWCGLTLL AIDGVFWRTP DTPENDAAFP RQTHAGNPAL YPQVKMVCQM ELTSHLLTAA
AFGTMKNSEN ELAEQLIEQT GDNTLTLMDK GYYSLGLLNA WSLAGEHRHW MIPLRKGAQY
EEIRKLGKGD HLVKLKTSPQ ARKKWPGLGN EVTARLLTVT RKGKVCHLLT SMTDAMRFPG
GEMADLYSHR WEIELGYREI KQTMQLSRLT LRSKKPELVE QELWGVLLAY NLVRYQMIKM
AEYLKGYWPN QLSFSESCGM VMRMLMTLQG ASPGRIPELM RDLASMGQLV KLPTRRERAF
PRVVKERPWK YPTAPKKSQS VA