Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcolC_3752 |
Symbol | |
ID | 6067778 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli ATCC 8739 |
Kingdom | Bacteria |
Replicon accession | NC_010468 |
Strand | + |
Start bp | 4104731 |
End bp | 4106242 |
Gene Length | 1512 bp |
Protein Length | 503 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 641603167 |
Product | leucyl aminopeptidase |
Protein accession | YP_001726686 |
Protein GI | 170021732 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0260] Leucyl aminopeptidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.00102834 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAGTTTA GTGTAAAAAG CGGTAGCCCG GAGAAACAGC GGAGTGCCTG CATCGTCGTG GGCGTCTTCG AACCACGTCG CCTTTCTCCG ATTGCAGAAC AGCTCGATAA AATCAGCGAT GGGTACATCA GCGCCCTGCT ACGTCGGGGC GAACTGGAAG GAAAACCGGG GCAGACATTG TTGCTGCACC ATGTTCCGAA TGTACTTTCC GAGCGAATTC TCCTTATTGG TTGCGGCAAA GAACGTGAGC TGGATGAGCG TCAGTACAAA CAAGTCATTC AGAAAACCAT TAATACGCTG AATGATACTG GCTCAATGGA AGCGGTCTGC TTTCTGACTG AACTGCACGT TAAAGGCCGT AACAACTACT GGAAAGTGCG TCAGGCTGTC GAGACGGCAA AAGAGACGCT CTACAGTTTC GATCAGCTGA AAACGAACAA GAGCGAACCG CGTCGTCCGC TGCGTAAGAT GGTGTTCAAC GTGCCGACCC GCCGTGAACT GACCAGCGGT GAGCGCGCGA TCCAGCACGG TCTGGCGATT GCCGCCGGGA TTAAAGCAGC AAAAGATCTC GGCAATATGC CGCCGAATAT CTGTAACGCC GCTTACCTCG CTTCACAAGC GCGCCAGCTG GCTGACAGCT ACAGCAAGAA TGTCATCACC CGCGTTATCG GCGAACAGCA GATGAAAGAG CTGGGGATGC ATTCCTATCT GGCGGTCGGT CAGGGTTCGC AAAACGAATC GCTGATGTCG GTGATTGAGT ACAAAGGCAA CGCGTCGGAA GATGCACGCC CAATCGTGCT GGTGGGTAAA GGTTTAACCT TCGACTCCGG CGGTATCTCG ATCAAGCCTT CAGAAGGCAT GGATGAGATG AAGTACGATA TGTGCGGTGC GGCAGCGGTT TACGGCGTGA TGCGGATGGT CGCGGAGCTA CAACTGCCGA TTAACGTTAT CGGCGTGTTG GCAGGCTGCG AAAACATGCC TGGCGGACGA GCCTATCGTC CGGGCGATGT GTTAACCACC ATGTCCGGTC AAACCGTTGA AGTGCTGAAC ACCGACGCTG AAGGCCGCCT GGTACTGTGC GACGTGTTAA CTTACGTTGA GCGTTTTGAG CCGGAAGCGG TGATTGACGT GGCGACGCTG ACCGGTGCCT GCGTGATCGC GCTGGGTCAT CATATTACTG GTCTGATGGC GAACCATAAT CCGCTGGCCC ATGAACTGAT TGCCGCGTCT GAACAATCCG GTGACCGCGC ATGGCGCTTA CCGCTGGGTG ACGAGTATCA GGAACAACTG GAGTCCAATT TTGCCGATAT GGCGAACATT GGCGGTCGTC CTGGTGGGGC GATTACCGCA GGTTGCTTCC TGTCACGCTT TACCCGTAAG TACAACTGGG CGCACCTGGA TATCGCCGGT ACCGCCTGGC GTTCTGGTAA AGCAAAAGGC GCCACCGGTC GTCCGGTAGC GTTGCTGGCA CAGTTCCTGT TAAACCGCGC TGGGTTTAAC GGCGAAGAGT AA
|
Protein sequence | MEFSVKSGSP EKQRSACIVV GVFEPRRLSP IAEQLDKISD GYISALLRRG ELEGKPGQTL LLHHVPNVLS ERILLIGCGK ERELDERQYK QVIQKTINTL NDTGSMEAVC FLTELHVKGR NNYWKVRQAV ETAKETLYSF DQLKTNKSEP RRPLRKMVFN VPTRRELTSG ERAIQHGLAI AAGIKAAKDL GNMPPNICNA AYLASQARQL ADSYSKNVIT RVIGEQQMKE LGMHSYLAVG QGSQNESLMS VIEYKGNASE DARPIVLVGK GLTFDSGGIS IKPSEGMDEM KYDMCGAAAV YGVMRMVAEL QLPINVIGVL AGCENMPGGR AYRPGDVLTT MSGQTVEVLN TDAEGRLVLC DVLTYVERFE PEAVIDVATL TGACVIALGH HITGLMANHN PLAHELIAAS EQSGDRAWRL PLGDEYQEQL ESNFADMANI GGRPGGAITA GCFLSRFTRK YNWAHLDIAG TAWRSGKAKG ATGRPVALLA QFLLNRAGFN GEE
|
| |