Gene Ent638_2698 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEnt638_2698 
Symbol 
ID5114559 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEnterobacter sp. 638 
KingdomBacteria 
Replicon accessionNC_009436 
Strand
Start bp2910149 
End bp2911510 
Gene Length1362 bp 
Protein Length453 aa 
Translation table11 
GC content55% 
IMG OID640492886 
Productpeptidase U32 
Protein accessionYP_001177415 
Protein GI146312341 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0826] Collagenase and related proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTAAAC CAGAACTCCT TTCCCCGGCG GGAACGCTGC AGAATATGCG TTACGCTTTC 
GCCTATGGCG CAGATGCCGT GTATGCGGGC CAGCCGCGTT ACTCACTGCG CGTGCGTAAT
AACGAATTCA ACCACGAGAA TTTACAGCTC GGCATCAACG AAGCGCATGC TCTGGGCAAA
AAATTCTACG TCGTGGTGAA CATTGCGCCG CACAATGCCA AGCTGAAAAC GTTTATTCGC
GATCTCAAGC CGGTCGTGGA AATGGGTCCT GACGCGCTGA TTATGTCGGA TCCGGGTTTA
ATCATGTTGG TGCGCGAGAA CTTCCCGGAA ATGGATATTC ACTTATCCGT GCAGGCGAAC
GCCGTGAACT GGGCGACGGT GAAATTCTGG AAACAAATGG GCCTGACCCG CGTGATCCTG
TCGCGTGAGC TGTCGCTCGA AGAGATCGAA GAAATCCGTA CTCAAGTCCC GGAAATGGAG
CTAGAAATCT TCGTTCATGG CGCGCTGTGC ATGGCCTATT CTGGCCGCTG CCTGCTGTCG
GGTTACATCA ACAAACGAGA TCCGAATCAG GGCACCTGCA CCAACGCCTG TCGCTGGGAA
TATAACGTCC AGGAAGGCAA AGAAGATGAT GTAGGTAATA TCGTGCATCA GCACGAACCG
ATTCCGGTGA AAAACGTTGT GCCAACGCTG GGCGTGGGCG CGCCGATCGA CAGCGTGTTT
ATGATTGAAG AAGCCAAGCG TCCGGGCGAG TACATGACGG CGTTCGAAGA CGAGCACGGC
ACCTACATCA TGAACTCCAA AGATCTGCGG GCGATTGCGC ACGTTGAGCG TCTGACCCAG
ATGGGCGTGC ATTCGTTGAA AATCGAGGGC CGCACCAAAT CGTATTACTA CTGCGCACGC
ACCGCGCAGG TGTATCGCAA AGCCATCGAC GACGCTGCAG CCGGTAAACC GTTTGACACC
AGCCTGCTGG AAACGCTGGA AGGCCTGGCG CATCGCGGCT ATACAGAAGG TTTCCTGCGT
CGCCACACGC ACGACGATTA CCAGAACTAC GAGCACGGCT ATTCCGTGTC CGAGCGCCAG
CAGTTTGTCG GTGATTTTAC CGGCGCGCGT AAAGGCCATT TAGCCGCCGT CGCGGTGAAA
AACAAATTCA CTATGGGCGA TAGCCTTGAG CTGATGACCC CGCAGGGGAA CATCAACTTT
ACGCTGGAAC ACATGGAAAA CGGCAAAGGC GAGACCATCA CGGTGGCGCC TGGTGACGGG
CACACCGTGT GGTTGCCAGT ACCGGAAGAA GTGGAGCTGG AATATGCGCT GCTGATGCGC
AATTTCGCGG GTGAAAGCAC CCGTAATCCG CACAACAAAT AG
 
Protein sequence
MFKPELLSPA GTLQNMRYAF AYGADAVYAG QPRYSLRVRN NEFNHENLQL GINEAHALGK 
KFYVVVNIAP HNAKLKTFIR DLKPVVEMGP DALIMSDPGL IMLVRENFPE MDIHLSVQAN
AVNWATVKFW KQMGLTRVIL SRELSLEEIE EIRTQVPEME LEIFVHGALC MAYSGRCLLS
GYINKRDPNQ GTCTNACRWE YNVQEGKEDD VGNIVHQHEP IPVKNVVPTL GVGAPIDSVF
MIEEAKRPGE YMTAFEDEHG TYIMNSKDLR AIAHVERLTQ MGVHSLKIEG RTKSYYYCAR
TAQVYRKAID DAAAGKPFDT SLLETLEGLA HRGYTEGFLR RHTHDDYQNY EHGYSVSERQ
QFVGDFTGAR KGHLAAVAVK NKFTMGDSLE LMTPQGNINF TLEHMENGKG ETITVAPGDG
HTVWLPVPEE VELEYALLMR NFAGESTRNP HNK