Gene EcSMS35_3126 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_3126 
Symbol 
ID6145603 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp3211704 
End bp3213278 
Gene Length1575 bp 
Protein Length524 aa 
Translation table11 
GC content56% 
IMG OID641617990 
ProductIS66 family transposase orfB 
Protein accessionYP_001745140 
Protein GI170683550 
COG category[L] Replication, recombination and repair 
COG ID[COG3436] Transposase and inactivated derivatives 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones53 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACACCT CACTTGCTCA TGAGAACGCC CGCCTGCGGG CACTGTTGCA GACGCAACAG 
GACACCATCC GCCAGATGGC TGAATACAAC CGCCTGCTCT CACAGAGGGT GGCGGCTTAT
GCTTCTGAAA TCAACCGGCT GAAGGCGCTG GTTGCGAAAC TGCAACGTAT GCAGTTCGGT
AAAAGCTCAG AAAAACTTCG TGCAAAAACC GAACGGCAGA TACAGGAAGC ACAGGAGCGA
ATCAGCGCAC TTCAGGAAGA AATGGCGGAA ACGCTGGGTG AGCAATATGA CCCGGTACTG
CCATCCGCCG CCCTGCGTCA GTCTTCAGCC TGTAAACAGT TACCGGCCTC ACTTCCCCGT
GAAACCCGGG TTATCCGGCC GGAAGAGGAA TGCTGTCCTG CCTGTGGTGG TGAACTCAGT
TCTCTGGGAT GTGATGTGTC AGAGCAACTG GAGCTTATCA GCAGCGCCTT TAAGGTTATC
GAAACACAAC GTCCGAAACA GGCCTGTTGC CGGTGCGACC ATATCGTGCA GGCACCAGTA
CCTTCAAAAC CCATTGCACG CAGTTATGCC GGAGCGGGGC TTCTGGCCCA TGTTGTCACC
GGGAAATATG CAGACCATCT GCCGTTATAC CGCCAGTCAG AAATATACCG TCGTCAGGGA
GTGGAGCTGA GCCGTGCCAC ACTGGGGCGC TGGACAGGTG CTGTTGCTGA ACTGCTGGAG
CCGCTGTATG ACGTCCTGCG CCAGTATGTG CTGATGCCCG GTAAAGTCCA TGCTGATGAT
ATCCCCGTCC CGGTCCAGGA GCCGGGCAGC GGTAAAACCC GGACAGCCCG GCTGTGGGTC
TACGTCCGTG ATGACCGTAA CGCCGGTTCA CAGATGCCCC CGGCGGTCTG GTTCGCGTAC
AGTCCGGACC GGAAAGGTAT CCATCCACAA AATCACCTGG CCGGTTACAG CGGTGTGCTT
CAGGCCGATG CTTACGGTGG TTACCGGGCG TTATACGAAT CCGGCAGAAT AACGGAAGCC
GCGTGTATGG CTCATGCCCG GAGAAAAATC CACGATGTGC ATGCAAGAGC GCCCACCTAC
ATCACCACGG AAGCCCTGCA GCGTATCGGT GAACTGTATG CCATCGAGGC AGAGGTCCGG
GGCTGTTCAG CAGAACAGCG TCTGGCGGCA AGAAAAGCCA GAGCCGCGCC ACTGATGCAG
TCACTGTATG ACTGGATACA GCAACAGATG AAAACACTGT CGCGTCACTC AGATACGGCA
AAAGCGTTCG CATACCTGCT GAAACAGTGG GATGCACTGA ACGTGTACTG CAGTAATGGC
TGGGTGGAAA TCGACAACAA CATCGCAGAG AACGCCTTAC GGGGAGTGGC CGTAGGCCGG
AAAAACTGGA TGTTCGCGGG TTCCGACAGC GGTGGTGAAC ATGCGGCGGT GTTGTACTCG
CTGATCGGCA CATGCCGTCT GAACAATGTG GAGCCAGAAA AGTGGCTGCG TTACGTCATT
GAACATATCC AGGACTGGCC GGCAAACCGG GTACGCGATC TGTTGCCCTG GAAAGTTGAT
CTGAGCTCTC AGTAA
 
Protein sequence
MDTSLAHENA RLRALLQTQQ DTIRQMAEYN RLLSQRVAAY ASEINRLKAL VAKLQRMQFG 
KSSEKLRAKT ERQIQEAQER ISALQEEMAE TLGEQYDPVL PSAALRQSSA CKQLPASLPR
ETRVIRPEEE CCPACGGELS SLGCDVSEQL ELISSAFKVI ETQRPKQACC RCDHIVQAPV
PSKPIARSYA GAGLLAHVVT GKYADHLPLY RQSEIYRRQG VELSRATLGR WTGAVAELLE
PLYDVLRQYV LMPGKVHADD IPVPVQEPGS GKTRTARLWV YVRDDRNAGS QMPPAVWFAY
SPDRKGIHPQ NHLAGYSGVL QADAYGGYRA LYESGRITEA ACMAHARRKI HDVHARAPTY
ITTEALQRIG ELYAIEAEVR GCSAEQRLAA RKARAAPLMQ SLYDWIQQQM KTLSRHSDTA
KAFAYLLKQW DALNVYCSNG WVEIDNNIAE NALRGVAVGR KNWMFAGSDS GGEHAAVLYS
LIGTCRLNNV EPEKWLRYVI EHIQDWPANR VRDLLPWKVD LSSQ