Gene Ent638_1940 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEnt638_1940 
Symbol 
ID5112680 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEnterobacter sp. 638 
KingdomBacteria 
Replicon accessionNC_009436 
Strand
Start bp2101114 
End bp2103153 
Gene Length2040 bp 
Protein Length679 aa 
Translation table11 
GC content56% 
IMG OID640492128 
Productdipeptidyl carboxypeptidase II 
Protein accessionYP_001176667 
Protein GI146311593 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0339] Zn-dependent oligopeptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.244675 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCGTTG CCAATCCGTT TTTTGAAGTC AGCTTGTTGC CTTACCGGGC GCCTCGCTTT 
GATATTATTG AGGACAGCCA TTATCGCCCG GCGTTTGATG TGGCGACGCG CCAGAAGCGG
GCGGAAATCG CCGCCATTAT TGCAGACACG GCTGCGCCCG ATTTCACCAA TACCGTGCTG
GCCCTGGAGA AAAGCGGCGT CATGCTTTCC CGCGTCAGCA GCGTATTTTT CGCCATGACG
TCATCCCATA CCAACGATTA TCTTCAGGAA CTCGATGAGG CGTTCTCTAC TGAACTGGCG
GGGTTATCCA ATGATATTTG GCTGAATGAC GCGCTGTTTT CTCGCGTCGA GGCCGTCTGG
CAAGAGCGGG AATCGCTGGA TGGCGAGTCG CGTCGCTTGG TCGAACAGAC GTATCAGCAT
TTTGTCCTGG CGGGTGCAAC GCTCAGTGAA GCGCAAAAAT TGGAGCTAAA AGCGCTCAAT
ACCGAGTCAG CGTCGTTGAC CAGCCAGTTT AATCAACGTC TATTGGCAGC GGATAAAGCC
GGGGGGCTGG TGGTGGATGA TGTTCATCAG CTCGATGGAT TATCGCCCGA TGAAATCGCC
TCTGCTGCGC AGGCCGCCAC TGACAAAGGG CTGGCCGATC GCTGGCTGAT TCCCCTGCTG
AACACCACTC AACAGCCCGC GCTGGCAGCG TTGCGCGATC GACAAACGCG CGAAAATCTG
TTTATGGCCG GTTGGTTACG CACCCAAAAA GGTGATGAGC ACGATACGCA GCACATCGTT
CGTCGGCTGG TGGCGTTACG CGCGCGGCAG GCACAACTGC TTGGCTTTGA CAATTACGCC
AGCTGGAGCA CCGCCGATCA GATGGCGAAA ACCCCGGAGG CAGCGCTGGC ATTTATGCGC
GGAATCGTTC CGGCAGCACG CGCTCGTGCT GAGCGGGAAC AGGCGGATAT CCAGACGGTA
ATCGACGACC AGCAGGGCGG ATTCAGCGTG CAGGCATGGG ACTGGGCCTT TTACGCCGAG
CGGGTGCGTC TGGGGAAATA CGCGCTGGAT GAATCGCAAA TCAAACCGTA CTTAGCACTT
AACAGAGCGC TGGAAGATGG TGTGTTCTGG GCGGCCAGCC AGCTTTTTGG CATCCGTTTT
GTCGAGCGAT TTGATATTCC CGTCTACCAC CCAGATGTCC GCGTGTGGGA GATATTCGAT
CATAATGGCG AAGGTATGGC GCTGTTTTAC GGCGATTTCT TCGCGCGGGA TTCCAAAGGT
GGCGGGGCGT GGATGGGCAA TTTCGTTGAG CAATCGCACG AGTTTGCCGC ACGCCCGGTG
ATTTACAACG TCTGTAATTA TCAAAAACCG GCCAACGGTC AGACGGCGCT GCTCTCCTGG
GACGACGTCA TCACGCTGTT CCATGAATTT GGCCATACCC TGCACGGCCT GTTTGCCAAT
CAACGTTTTG CCACGTTATC CGGGACCAAT ACGCCGCGCG ATTTCGTCGA ATTCCCGTCG
CAAATCAATG AGCATTGGGC CAGCCATCCG CAGGTTTTTG CCCGTTTTGC CCGGCACTAT
CAGACAGGCG AACCGATGCC AGATGCCCTG CGCGAAAAAA TGCTCAATGC CACGCAGTTC
AACAAAGGTT ACGACATGAC CGAACTGCTT AGCGCGGCGC TACTGGATAT GAACTGGCAC
GCGATTGATG TGCAGGAAAA CGTAGAAGAT CTCGACACCT TCGAATCTGC CGCGCTGAAA
AAAGAGGGTC TGGATCTGCC TGCCGTACCA CCGCGCTATC GCAGCAGTTA TTTTGCCCAT
ATCTTCGGCG GTGGATACGC GGCGGGGTAT TACGCCTATT TGTGGACGCA AATGCTGGCC
GACGACGGCT ATCAGTGGTT TGAAGAGCAC GGCGGATTGA CGCGCGAGAA CGGACAGAAA
TTCCGTGAAG CCATTTTATC GCGCGGGAAC AGCACGGATT TAGCTGAACT TTATCGTGAT
TGGCGTGGAC ACGATCCAAA GCTTGAACCG ATGCTGGTGA ATCGTGGCTT GAACGGATAA
 
Protein sequence
MSVANPFFEV SLLPYRAPRF DIIEDSHYRP AFDVATRQKR AEIAAIIADT AAPDFTNTVL 
ALEKSGVMLS RVSSVFFAMT SSHTNDYLQE LDEAFSTELA GLSNDIWLND ALFSRVEAVW
QERESLDGES RRLVEQTYQH FVLAGATLSE AQKLELKALN TESASLTSQF NQRLLAADKA
GGLVVDDVHQ LDGLSPDEIA SAAQAATDKG LADRWLIPLL NTTQQPALAA LRDRQTRENL
FMAGWLRTQK GDEHDTQHIV RRLVALRARQ AQLLGFDNYA SWSTADQMAK TPEAALAFMR
GIVPAARARA EREQADIQTV IDDQQGGFSV QAWDWAFYAE RVRLGKYALD ESQIKPYLAL
NRALEDGVFW AASQLFGIRF VERFDIPVYH PDVRVWEIFD HNGEGMALFY GDFFARDSKG
GGAWMGNFVE QSHEFAARPV IYNVCNYQKP ANGQTALLSW DDVITLFHEF GHTLHGLFAN
QRFATLSGTN TPRDFVEFPS QINEHWASHP QVFARFARHY QTGEPMPDAL REKMLNATQF
NKGYDMTELL SAALLDMNWH AIDVQENVED LDTFESAALK KEGLDLPAVP PRYRSSYFAH
IFGGGYAAGY YAYLWTQMLA DDGYQWFEEH GGLTRENGQK FREAILSRGN STDLAELYRD
WRGHDPKLEP MLVNRGLNG