Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ent638_3948 |
Symbol | |
ID | 5114668 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Enterobacter sp. 638 |
Kingdom | Bacteria |
Replicon accession | NC_009436 |
Strand | - |
Start bp | 4277635 |
End bp | 4278966 |
Gene Length | 1332 bp |
Protein Length | 443 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 640494162 |
Product | proline dipeptidase |
Protein accession | YP_001178654 |
Protein GI | 146313580 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0006] Xaa-Pro aminopeptidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.34685 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.023319 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACTCAC TGGCAGCACT TTATAAAAAT CATATTGTTA CGTTGCAAGA ACGTACCCGC GACGTACTGA CTCGTTTTAA ACTCGATGCG CTGCTTATCC ACTCCGGTGA GCTGTTGAAT GTCTTCCTCG ATGACCATGC TTATCCGTTC AAGGTTAACC CACAGTTCAA AGCCTGGGTT CCGGTAACGC AGGTTCCAAA CTGCTGGTTG CTGGTTGATG GCGTGAACAA ACCGAAACTG TGGTTCTACT TGCCGGTCGA TTACTGGCAC AACGTTGAAC CGTTGCCGAC GACGTTCTGG ACGGAAGAAG TGGATGTGAT CGCGCTGCCG AAAGCGGACG GTATTGGCAG CCAGTTACCT GCTGCACGTG GCAACATCGC CTACATCGGT CCGGTTCCTG AGCGCGCGTT GGGTCTGGAT ATTCCGGCAG ACAAAATCAA CCCGAAAGGC GTGATCGATT ATCTGCATTA CTACCGCGCT TATAAGACCG ACTATGAACT GTCCTGCATG CGCGAAGCGC AGAAAACCGC CGTGAATGGT CATCAGGCGG CGCACGAAGC GTTCCTGTCC GGCATGAGCG AGTTCGATAT TAATCAGGCT TACCTGACGG CGACGGGTCA TCGTGATACC GATGTGCCTT ACGGGAATAT TGTGGCGCTG AACGAGCACG CCTCCGTTCT ACACTACACC AAACTGGATC ACCGCGCGCC TTCGGAAATT CGCAGTTTCC TGCTGGATGC GGGTGCTGAG TACAACGGTT ACGCGGCGGA TCTGACGCGT ACCTGGGCCG CAAACAGCGA TACCGATTTT GCGCATCTGA TTAAAGACGT GAACGACGAA CAGCTGGCGC TTATCAGCAC CATGAAAGCG GGCACGAGCT ATGTTGACTA TCATATTCAG TTCCATCAGC GCATCGCTAA GCTGCTGCGT AAGCATCAGA TTGTGACGGA TATGAGCGAA GAGGCGATGG TCGAAAACGA TCTCACCGGG CCTTTTATGC CGCACGGTAT TGGTCATCCG CTGGGTCTGC AGGTTCATGA TGTGGCCGGT TTTATGCAGG ATGATACGGG AACGCATCTG GCGGCACCGT CTAAATATCC GTACCTGCGT TGCACTCGCG TACTGGAACC GCGCATGGTG TTGACCATTG AGCCAGGCAT CTACTTTATT GATTCTCTGC TGAATCCATG GCGTGAAGGC CAGTTCAGCA AGCACTTCAA CTGGCAGAAA ATTGATGCGC TGAAACCGTT TGGTGGCATT CGTATTGAAG ATAACGTGGT GGTTCACGAG AACAATATCG AAAACATGAC GCGAGATCAG AAGCTGGCGT GA
|
Protein sequence | MDSLAALYKN HIVTLQERTR DVLTRFKLDA LLIHSGELLN VFLDDHAYPF KVNPQFKAWV PVTQVPNCWL LVDGVNKPKL WFYLPVDYWH NVEPLPTTFW TEEVDVIALP KADGIGSQLP AARGNIAYIG PVPERALGLD IPADKINPKG VIDYLHYYRA YKTDYELSCM REAQKTAVNG HQAAHEAFLS GMSEFDINQA YLTATGHRDT DVPYGNIVAL NEHASVLHYT KLDHRAPSEI RSFLLDAGAE YNGYAADLTR TWAANSDTDF AHLIKDVNDE QLALISTMKA GTSYVDYHIQ FHQRIAKLLR KHQIVTDMSE EAMVENDLTG PFMPHGIGHP LGLQVHDVAG FMQDDTGTHL AAPSKYPYLR CTRVLEPRMV LTIEPGIYFI DSLLNPWREG QFSKHFNWQK IDALKPFGGI RIEDNVVVHE NNIENMTRDQ KLA
|
| |