Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ent638_1940 |
Symbol | |
ID | 5112680 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Enterobacter sp. 638 |
Kingdom | Bacteria |
Replicon accession | NC_009436 |
Strand | + |
Start bp | 2101114 |
End bp | 2103153 |
Gene Length | 2040 bp |
Protein Length | 679 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 640492128 |
Product | dipeptidyl carboxypeptidase II |
Protein accession | YP_001176667 |
Protein GI | 146311593 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0339] Zn-dependent oligopeptidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.244675 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCGTTG CCAATCCGTT TTTTGAAGTC AGCTTGTTGC CTTACCGGGC GCCTCGCTTT GATATTATTG AGGACAGCCA TTATCGCCCG GCGTTTGATG TGGCGACGCG CCAGAAGCGG GCGGAAATCG CCGCCATTAT TGCAGACACG GCTGCGCCCG ATTTCACCAA TACCGTGCTG GCCCTGGAGA AAAGCGGCGT CATGCTTTCC CGCGTCAGCA GCGTATTTTT CGCCATGACG TCATCCCATA CCAACGATTA TCTTCAGGAA CTCGATGAGG CGTTCTCTAC TGAACTGGCG GGGTTATCCA ATGATATTTG GCTGAATGAC GCGCTGTTTT CTCGCGTCGA GGCCGTCTGG CAAGAGCGGG AATCGCTGGA TGGCGAGTCG CGTCGCTTGG TCGAACAGAC GTATCAGCAT TTTGTCCTGG CGGGTGCAAC GCTCAGTGAA GCGCAAAAAT TGGAGCTAAA AGCGCTCAAT ACCGAGTCAG CGTCGTTGAC CAGCCAGTTT AATCAACGTC TATTGGCAGC GGATAAAGCC GGGGGGCTGG TGGTGGATGA TGTTCATCAG CTCGATGGAT TATCGCCCGA TGAAATCGCC TCTGCTGCGC AGGCCGCCAC TGACAAAGGG CTGGCCGATC GCTGGCTGAT TCCCCTGCTG AACACCACTC AACAGCCCGC GCTGGCAGCG TTGCGCGATC GACAAACGCG CGAAAATCTG TTTATGGCCG GTTGGTTACG CACCCAAAAA GGTGATGAGC ACGATACGCA GCACATCGTT CGTCGGCTGG TGGCGTTACG CGCGCGGCAG GCACAACTGC TTGGCTTTGA CAATTACGCC AGCTGGAGCA CCGCCGATCA GATGGCGAAA ACCCCGGAGG CAGCGCTGGC ATTTATGCGC GGAATCGTTC CGGCAGCACG CGCTCGTGCT GAGCGGGAAC AGGCGGATAT CCAGACGGTA ATCGACGACC AGCAGGGCGG ATTCAGCGTG CAGGCATGGG ACTGGGCCTT TTACGCCGAG CGGGTGCGTC TGGGGAAATA CGCGCTGGAT GAATCGCAAA TCAAACCGTA CTTAGCACTT AACAGAGCGC TGGAAGATGG TGTGTTCTGG GCGGCCAGCC AGCTTTTTGG CATCCGTTTT GTCGAGCGAT TTGATATTCC CGTCTACCAC CCAGATGTCC GCGTGTGGGA GATATTCGAT CATAATGGCG AAGGTATGGC GCTGTTTTAC GGCGATTTCT TCGCGCGGGA TTCCAAAGGT GGCGGGGCGT GGATGGGCAA TTTCGTTGAG CAATCGCACG AGTTTGCCGC ACGCCCGGTG ATTTACAACG TCTGTAATTA TCAAAAACCG GCCAACGGTC AGACGGCGCT GCTCTCCTGG GACGACGTCA TCACGCTGTT CCATGAATTT GGCCATACCC TGCACGGCCT GTTTGCCAAT CAACGTTTTG CCACGTTATC CGGGACCAAT ACGCCGCGCG ATTTCGTCGA ATTCCCGTCG CAAATCAATG AGCATTGGGC CAGCCATCCG CAGGTTTTTG CCCGTTTTGC CCGGCACTAT CAGACAGGCG AACCGATGCC AGATGCCCTG CGCGAAAAAA TGCTCAATGC CACGCAGTTC AACAAAGGTT ACGACATGAC CGAACTGCTT AGCGCGGCGC TACTGGATAT GAACTGGCAC GCGATTGATG TGCAGGAAAA CGTAGAAGAT CTCGACACCT TCGAATCTGC CGCGCTGAAA AAAGAGGGTC TGGATCTGCC TGCCGTACCA CCGCGCTATC GCAGCAGTTA TTTTGCCCAT ATCTTCGGCG GTGGATACGC GGCGGGGTAT TACGCCTATT TGTGGACGCA AATGCTGGCC GACGACGGCT ATCAGTGGTT TGAAGAGCAC GGCGGATTGA CGCGCGAGAA CGGACAGAAA TTCCGTGAAG CCATTTTATC GCGCGGGAAC AGCACGGATT TAGCTGAACT TTATCGTGAT TGGCGTGGAC ACGATCCAAA GCTTGAACCG ATGCTGGTGA ATCGTGGCTT GAACGGATAA
|
Protein sequence | MSVANPFFEV SLLPYRAPRF DIIEDSHYRP AFDVATRQKR AEIAAIIADT AAPDFTNTVL ALEKSGVMLS RVSSVFFAMT SSHTNDYLQE LDEAFSTELA GLSNDIWLND ALFSRVEAVW QERESLDGES RRLVEQTYQH FVLAGATLSE AQKLELKALN TESASLTSQF NQRLLAADKA GGLVVDDVHQ LDGLSPDEIA SAAQAATDKG LADRWLIPLL NTTQQPALAA LRDRQTRENL FMAGWLRTQK GDEHDTQHIV RRLVALRARQ AQLLGFDNYA SWSTADQMAK TPEAALAFMR GIVPAARARA EREQADIQTV IDDQQGGFSV QAWDWAFYAE RVRLGKYALD ESQIKPYLAL NRALEDGVFW AASQLFGIRF VERFDIPVYH PDVRVWEIFD HNGEGMALFY GDFFARDSKG GGAWMGNFVE QSHEFAARPV IYNVCNYQKP ANGQTALLSW DDVITLFHEF GHTLHGLFAN QRFATLSGTN TPRDFVEFPS QINEHWASHP QVFARFARHY QTGEPMPDAL REKMLNATQF NKGYDMTELL SAALLDMNWH AIDVQENVED LDTFESAALK KEGLDLPAVP PRYRSSYFAH IFGGGYAAGY YAYLWTQMLA DDGYQWFEEH GGLTRENGQK FREAILSRGN STDLAELYRD WRGHDPKLEP MLVNRGLNG
|
| |