Gene Ent638_3948 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEnt638_3948 
Symbol 
ID5114668 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEnterobacter sp. 638 
KingdomBacteria 
Replicon accessionNC_009436 
Strand
Start bp4277635 
End bp4278966 
Gene Length1332 bp 
Protein Length443 aa 
Translation table11 
GC content52% 
IMG OID640494162 
Productproline dipeptidase 
Protein accessionYP_001178654 
Protein GI146313580 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0006] Xaa-Pro aminopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.34685 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.023319 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACTCAC TGGCAGCACT TTATAAAAAT CATATTGTTA CGTTGCAAGA ACGTACCCGC 
GACGTACTGA CTCGTTTTAA ACTCGATGCG CTGCTTATCC ACTCCGGTGA GCTGTTGAAT
GTCTTCCTCG ATGACCATGC TTATCCGTTC AAGGTTAACC CACAGTTCAA AGCCTGGGTT
CCGGTAACGC AGGTTCCAAA CTGCTGGTTG CTGGTTGATG GCGTGAACAA ACCGAAACTG
TGGTTCTACT TGCCGGTCGA TTACTGGCAC AACGTTGAAC CGTTGCCGAC GACGTTCTGG
ACGGAAGAAG TGGATGTGAT CGCGCTGCCG AAAGCGGACG GTATTGGCAG CCAGTTACCT
GCTGCACGTG GCAACATCGC CTACATCGGT CCGGTTCCTG AGCGCGCGTT GGGTCTGGAT
ATTCCGGCAG ACAAAATCAA CCCGAAAGGC GTGATCGATT ATCTGCATTA CTACCGCGCT
TATAAGACCG ACTATGAACT GTCCTGCATG CGCGAAGCGC AGAAAACCGC CGTGAATGGT
CATCAGGCGG CGCACGAAGC GTTCCTGTCC GGCATGAGCG AGTTCGATAT TAATCAGGCT
TACCTGACGG CGACGGGTCA TCGTGATACC GATGTGCCTT ACGGGAATAT TGTGGCGCTG
AACGAGCACG CCTCCGTTCT ACACTACACC AAACTGGATC ACCGCGCGCC TTCGGAAATT
CGCAGTTTCC TGCTGGATGC GGGTGCTGAG TACAACGGTT ACGCGGCGGA TCTGACGCGT
ACCTGGGCCG CAAACAGCGA TACCGATTTT GCGCATCTGA TTAAAGACGT GAACGACGAA
CAGCTGGCGC TTATCAGCAC CATGAAAGCG GGCACGAGCT ATGTTGACTA TCATATTCAG
TTCCATCAGC GCATCGCTAA GCTGCTGCGT AAGCATCAGA TTGTGACGGA TATGAGCGAA
GAGGCGATGG TCGAAAACGA TCTCACCGGG CCTTTTATGC CGCACGGTAT TGGTCATCCG
CTGGGTCTGC AGGTTCATGA TGTGGCCGGT TTTATGCAGG ATGATACGGG AACGCATCTG
GCGGCACCGT CTAAATATCC GTACCTGCGT TGCACTCGCG TACTGGAACC GCGCATGGTG
TTGACCATTG AGCCAGGCAT CTACTTTATT GATTCTCTGC TGAATCCATG GCGTGAAGGC
CAGTTCAGCA AGCACTTCAA CTGGCAGAAA ATTGATGCGC TGAAACCGTT TGGTGGCATT
CGTATTGAAG ATAACGTGGT GGTTCACGAG AACAATATCG AAAACATGAC GCGAGATCAG
AAGCTGGCGT GA
 
Protein sequence
MDSLAALYKN HIVTLQERTR DVLTRFKLDA LLIHSGELLN VFLDDHAYPF KVNPQFKAWV 
PVTQVPNCWL LVDGVNKPKL WFYLPVDYWH NVEPLPTTFW TEEVDVIALP KADGIGSQLP
AARGNIAYIG PVPERALGLD IPADKINPKG VIDYLHYYRA YKTDYELSCM REAQKTAVNG
HQAAHEAFLS GMSEFDINQA YLTATGHRDT DVPYGNIVAL NEHASVLHYT KLDHRAPSEI
RSFLLDAGAE YNGYAADLTR TWAANSDTDF AHLIKDVNDE QLALISTMKA GTSYVDYHIQ
FHQRIAKLLR KHQIVTDMSE EAMVENDLTG PFMPHGIGHP LGLQVHDVAG FMQDDTGTHL
AAPSKYPYLR CTRVLEPRMV LTIEPGIYFI DSLLNPWREG QFSKHFNWQK IDALKPFGGI
RIEDNVVVHE NNIENMTRDQ KLA