Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcE24377A_D0063 |
Symbol | |
ID | 5585769 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli E24377A |
Kingdom | Bacteria |
Replicon accession | NC_009788 |
Strand | - |
Start bp | 61001 |
End bp | 62572 |
Gene Length | 1572 bp |
Protein Length | 523 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 640913875 |
Product | IS66 family transposase |
Protein accession | YP_001451525 |
Protein GI | 157149461 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG3436] Transposase and inactivated derivatives |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.394688 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGACACCT CACTTGCTCA TGAGAACGCC CGCCTGCGGG CACTGTTGCA GACGCAACAG GACACCATAC GCCAGATGGC TGAATACAAC CGCCTGCTCT CACAGCGGGT GGCGGCTTAT GCTTCCGAAA TCAACCGGCT GAAGGCGCTG GTTGCGAAAC TGCAACGTAT GCAGTTCGGT AAAAGCTCAG AAAAACTTCG TGCAAAAACC GAACGGCAGA TACAGGAAGC ACAGGAGCGA ATCAGCGCAC TTCAGGAAGA AATGGCGGAA ACGCTGGGTG AGCAATATGA CCCGGTACTG CCATCCGCCC TGCGCCAGTC TTCAGCCCGT AAACCGTTAC CGGCCTCACT TCCCCGTGAA ACCCGGGTTA TCCGGCCGGA AGAGGAATGC TGTCCTGCCT GTGGTGGTGA ACTCAGTTCT CTGGGATGTG ATGTGTCAGA GCAACTGGAG CTTATCAGCA GCGCCTTTAA GGTTATCGAA ACACAACGTC CGAAACAGGC CTGTTGCCGG TGCGACCATA TCGTGCAGGC ACCAGTACCT TCAAAACCCA TTGCACGCAG TTATGCCGGA GCGGGGCTTC TGGCCCATGT TGTCACCGGG AAATATGCAG ACCATCTGCC GTTATACCGC CAGTCAGAAA TATACCGTCG TCAGGGAGTG GAGCTGAGCC GTGCCACACT GGGGCGCTGG ACAGGTGCTG TTGCTGAACT GCTGGAGCCG CTGTATGACG TCCTGCGCCA GTATGTGCTG ATGCCCGGTA AAGTCCATGC TGATGATATC CCCGTCCCGG TCCAGGAGCC GGGCAGCGGT AAAACCCGGA CAGCCCGGCT GTGGGTCTAC GTCCGTGATG ACCGTAACGC CGGTTCACAG ATGCCCCCGG CGGTCTGGTT CGCGTACAGT CCGGACCGGA AAGGTATCCA TCCACAAAAT CACCTGGCCG GTTACAGCGG TGTGCTTCAG GCCGATGCTT ACGGTGGTTA CCGGGCGTTA TACGAATCCG GCAGAATAAC GGAAGCCGCG TGTATGGCTC ATGTCCGGAG AAAAATCCAC GATGTGCATG CAAGAGCGCC CACCTACATC ACCACGGAAG CCCTGCAGCG TATCGGTGAA CTGTATGCCA TCGAGGCAGA GGTCCGGGGC TGTTCAGCAG AACAGCGTCT GGCGGCAAGA AAAGCCAGAG CCGCGCCACT GATGCAGTCA CTGTATGACT GGATACAGCA ACAGATGAAA ACACTGTCGC GTCACTCAGA TACGGCAAAA GCGTTCGCAT ACCTGCTGAA ACAGTGGGAT GCACTGAACG TGTACTGCAG TAATGGCTGG GTGGAAATCG ACAACAACAT CGCAGAGAAC GCCTTACGGG GAGTGGCCGT AGGCCGGAAA AACTGGATGT TCGCGGGTTC CGACAGCGGT GGTGAACATG CGGCGGTGTT GTACTCGCTG ATCGGCACAT GCCGTCTGAA CAATGTGGAG TCAGAAAAGT GGCTGCGTTA CGTCATTGAA CATATCCAGG ACTGGCCGGC AAACCGGGTA CGCGATCTGT TGCCCTGGAA AGTTGATCTG AGCTCTCAGT AA
|
Protein sequence | MDTSLAHENA RLRALLQTQQ DTIRQMAEYN RLLSQRVAAY ASEINRLKAL VAKLQRMQFG KSSEKLRAKT ERQIQEAQER ISALQEEMAE TLGEQYDPVL PSALRQSSAR KPLPASLPRE TRVIRPEEEC CPACGGELSS LGCDVSEQLE LISSAFKVIE TQRPKQACCR CDHIVQAPVP SKPIARSYAG AGLLAHVVTG KYADHLPLYR QSEIYRRQGV ELSRATLGRW TGAVAELLEP LYDVLRQYVL MPGKVHADDI PVPVQEPGSG KTRTARLWVY VRDDRNAGSQ MPPAVWFAYS PDRKGIHPQN HLAGYSGVLQ ADAYGGYRAL YESGRITEAA CMAHVRRKIH DVHARAPTYI TTEALQRIGE LYAIEAEVRG CSAEQRLAAR KARAAPLMQS LYDWIQQQMK TLSRHSDTAK AFAYLLKQWD ALNVYCSNGW VEIDNNIAEN ALRGVAVGRK NWMFAGSDSG GEHAAVLYSL IGTCRLNNVE SEKWLRYVIE HIQDWPANRV RDLLPWKVDL SSQ
|
| |