Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcolC_2950 |
Symbol | |
ID | 6064758 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli ATCC 8739 |
Kingdom | Bacteria |
Replicon accession | NC_010468 |
Strand | - |
Start bp | 3217156 |
End bp | 3218292 |
Gene Length | 1137 bp |
Protein Length | 378 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 641602362 |
Product | transposase IS4 family protein |
Protein accession | YP_001725904 |
Protein GI | 170020950 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG5433] Transposase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.179437 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAACTTA AAAAACTGAT GGAACATATT TCTATTATCC CCGATTACAG ACAAGCCTGG AAAGTAGAAC ATAAATTGTC AGATATTCTA CTGTTGACTA TTTGTGCCGT TATTTCTGGT GCAGAGGGTT GGGAAGATAT AGAGGATTTT GGGGAAACAC ATCTCGATTT TTTGAAGCAA TATGGTGATT TTGAAAATGG TATTCCTGTT CACGATACCA TTGCCAGAGT TGTATCCTGT ATCAGTCCTG CAAAATTTCA CGAGTGCTTT ATTAACTGGA TGCGTGACTG CCATTCTTCA GATGATAAAG ACGTCATTGC AATTGATGGA AAAACGCTCC GGCACTCTTA TGACAAGAGT CGCCGCAGGG GAGCGATTCA TGTCATTAGT GCGTTCTCAA CAATGCACAG TCTGGTCATC GGACAGATCA AGACGGATGA GAAATCTAAT GAGATTACAG CCATTCCTGA ACTTCTTAAC ATGCTGGATA TTAAAGGAAA AATCATCACA ACTGATGCGA TGGGATGCCA GAAAGATATT GCAGAGAAGA TACAAAAACA GGGCGGTGAT TATTTATTCG CTGTAAAAGG AAACCAGGGG CGGCTTAATA AAGCCTTTGA GGAAAAATTT CCGCTGAAAG AATTAAATAA TCCAGAGCAT GACAGTTACG CAATGAGTGA AAAGAGTCAC GGCAGAGAAG AAATCCGTCT TCATATTGTT TGCGATGTCC CTGATGAACT TATTGATTTC ACGTTTGAAT GGAAAGGACT GAAGAAATTA TGCGTGGCAG TCTCCTTTCG GTCAATAATA GCAGAACAAA AGAAAGAGCC AGAAATGACG GTCAGATATT ATATCAGTTC TGCTGATTTA ACCGCAGAAA AGTTCGCCAC AGCAATCCGA AACCACTGGC ACGTGGAGAA TAAGCTGCAC TGGCGTCTGG ACGTGGTAAT GAATGAAGAC GACTGCAAAA TAAGAAGAGG AAACGCCGCA GAATTATTTT CAGGGATACG GCACATCGCT ATTAATATTT TAACGAATGA TAAGGTATTC AAGGCAGGGT TAAGACGTAA GATGCGAAAA GCAGCCATGG ATAGAAACTA TCTCGCGTCA GTCCTTGCGG GGAGCGGGCT TTCGTAA
|
Protein sequence | MELKKLMEHI SIIPDYRQAW KVEHKLSDIL LLTICAVISG AEGWEDIEDF GETHLDFLKQ YGDFENGIPV HDTIARVVSC ISPAKFHECF INWMRDCHSS DDKDVIAIDG KTLRHSYDKS RRRGAIHVIS AFSTMHSLVI GQIKTDEKSN EITAIPELLN MLDIKGKIIT TDAMGCQKDI AEKIQKQGGD YLFAVKGNQG RLNKAFEEKF PLKELNNPEH DSYAMSEKSH GREEIRLHIV CDVPDELIDF TFEWKGLKKL CVAVSFRSII AEQKKEPEMT VRYYISSADL TAEKFATAIR NHWHVENKLH WRLDVVMNED DCKIRRGNAA ELFSGIRHIA INILTNDKVF KAGLRRKMRK AAMDRNYLAS VLAGSGLS
|
| |