Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcolC_2203 |
Symbol | |
ID | 6064820 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli ATCC 8739 |
Kingdom | Bacteria |
Replicon accession | NC_010468 |
Strand | - |
Start bp | 2422154 |
End bp | 2423290 |
Gene Length | 1137 bp |
Protein Length | 378 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 641601609 |
Product | transposase IS4 family protein |
Protein accession | YP_001725168 |
Protein GI | 170020214 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG5433] Transposase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.000366545 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAACTTA AAAAATTGAT GGAACATATT TCTATTATCC CCGATTACAG ACAAGCCTGG AAAGTGGAAC ATAAATTATC GGATATTCTA CTGTTAACTA TTTGTGCCGT TATTTCTGGC GCAGAGGGTT GGGAAGATAT AGAGGATTTT GGGGAAACAC ATCTCGATTT TTTGAAGCAA TATGGTGATT TTGAAAATGG TATTCCTGTT CACGATACCA TTGCCAGAGT TGTATCCTGT ATCAGTCCTG CAAAATTTCA CGAGTGCTTT ATTAACTGGA TGCGTGACTG CCATTCTTCA GATGATAAAG ACGTCATTGC AATTGATGGA AAAACGCTCC GGCACTCTTA TGACAAGAGT CGCCGCAGGG GAGCGATTCA TGTCATTAGT GCGTTCTCAA CAATGCACAG TCTGGTCATC GGACAGATCA AGACGGATGA GAAATCTAAT GAGATTACAG CTATCCCAGA ACTTCTTAAC ATGCTGGATA TTAAAGGAAA AATCATCACA ACTGATGCGA TGGGTTGCCA GAAAGATATT GCAGAGAAGA TACAAAAACA GGGAGGTGAT TATTTATTCG CGGTAAAAGG AAACCAGGGG CGGCTAAATA AAGCCTTTGA GGAAAAATTT CCGCTGAAAG AATTAAATAA TCCAGAGCAT GACAGTTACG CAATCAGTGA AAAGAGTCAC GGCAGAGAAG AAATCCGTCT TCATATTGTT TGCGATGTCC CTGATGAACT TATTGATTTC ACGTTTGAAT GGAAAGGTCT GAAGAAATTA TGCGTGGCAG TCTCCTTTCG GTCAATAATA GCAGAACAAA AGAAAGAGCC CGAAATGACG GTCAGATATT ATATCAGTTC TGCTGATTTA ACCGCAGAGA AGTTCGCCAC AGCAATCCGA AACCACTGGC ACGTGGAGAA TAAGCTGCAC TGGCGTCTGG ACGTGGTAAT GAATGAAGAC GACTGCAAAA TAAGAAGAGG AAACGCCGCA GAATTATTTT CAGGGATACG GCACATCGCT ATTAATATTT TAACGAATGA TAAGGTATTC AAGGCAGGGT TAAGACGTAA GATGCGAAAA GCAGCCATGG ACAGAAACTA TCTCGCGTCA GTCCTTGCGG GGAGCGGGCT TTCGTAA
|
Protein sequence | MELKKLMEHI SIIPDYRQAW KVEHKLSDIL LLTICAVISG AEGWEDIEDF GETHLDFLKQ YGDFENGIPV HDTIARVVSC ISPAKFHECF INWMRDCHSS DDKDVIAIDG KTLRHSYDKS RRRGAIHVIS AFSTMHSLVI GQIKTDEKSN EITAIPELLN MLDIKGKIIT TDAMGCQKDI AEKIQKQGGD YLFAVKGNQG RLNKAFEEKF PLKELNNPEH DSYAISEKSH GREEIRLHIV CDVPDELIDF TFEWKGLKKL CVAVSFRSII AEQKKEPEMT VRYYISSADL TAEKFATAIR NHWHVENKLH WRLDVVMNED DCKIRRGNAA ELFSGIRHIA INILTNDKVF KAGLRRKMRK AAMDRNYLAS VLAGSGLS
|
| |