Gene Ent638_1983 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEnt638_1983 
Symbol 
ID5113399 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEnterobacter sp. 638 
KingdomBacteria 
Replicon accessionNC_009436 
Strand
Start bp2155943 
End bp2157907 
Gene Length1965 bp 
Protein Length654 aa 
Translation table11 
GC content54% 
IMG OID640492171 
Productpeptidase U32 
Protein accessionYP_001176710 
Protein GI146311636 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0826] Collagenase and related proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.156367 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCCTGC AATCCCATCA TCTTGAACTT TTAAGCCCGG CTCGCGACGC CTCCATCGCC 
CGTGAAGCGA TTCTTCACGG TGCGGACGCG GTCTATATCG GCGGCCCTGG CTTTGGCGCT
CGCCATAACG CCAGCAACAG CCTGCAGGAT ATTGCGGAGC TGGTGCCGTT TGCCCACCGT
TTTGGTGCAA AAGTGTTCGT GACCCTGAAC ACCATTCTTC ATGATGATGA GCTTGAACCC
GCCCAACGAC TGATTACCGA CCTGTATCAG ACTGGAGTCG ATGCGCTGAT CGTTCAGGAC
ATGGGCGTGC TCGAGCTGGA TATTCCGCCG ATTGAACTGC ATGCCAGTAC CCAGTGCGAT
ATTCGTACCG TTGAAAAAGC GAAGTTTCTG TCTGACGTAG GCTTTACCCA GATCGTTCTG
GCGCGCGAGC TGAATCTGAA TCAAATCCGC GACATTCACC AGGCCACTGA CGCCAACATC
GAATTCTTCA TTCACGGCGC GCTGTGTGTG GCGTATTCCG GCCAGTGCAA TATTTCCCAT
GCGCAGACCG GGCGCAGCGC CAACCGTGGC GATTGCTCGC AAGCGTGTCG TTTGCCTTAC
ACGCTGAAAG ACGATCAGGG CCGCGTCGTG GCGTTCGAAA AACATCTGCT GTCGATGAAA
GACAATGATC AGACGGCAAA CCTGGCGGCG CTCATCGACG CTGGCGTGCG CTCCTTCAAA
ATTGAAGGGC GCTACAAAGA CATGAGCTAC GTGAAGAACA TCACCGCGCA TTATCGCCAG
ATGCTTGACG CCATTATTGA AGATCGTGGC GACCTGGCGC GCTCGTCTGC TGGCCGCACC
GAGCATTTCT TCATTCCGTC GACGGATAAA ACGTTCCACC GCGGCAGCAC GGATTACTTT
GTGAATGCGC GTAAAGACGA TATCGGTGCG TTTGATTCGC CGAAATTTAT CGGCCTGCCG
GTGGGTGAAG TGTTAAAAGT ATCCAAAGAT TATCTGGACG TAAAAGTGAC CGAAACGCTG
GCTAACGGTG ACGGGCTGAA CGTGATGATC AAACGCGAAA TCGTCGGTTT CCGCGCCAAT
ACCGTCGAGA AAACGGGTGA GAATCAGTAT CGCGTCTGGC CGAACGAAAT GCCTGCGGAT
CTGTACAAAG CCCGCCCGAA TGCTGCGCTT AACCGTAACC TCGACCATAA CTGGCAGCAG
GCGCTGTTGA AAACCTCCAG TGAACGTCGT ATTGCGGTGG ATATGGAGCT GGGTGGTTGG
GAAGAACAGC TGATCCTGAC CATGACCAGT GAAGATGGCG TGAGCGTGAC CCATACCCTG
GACGGTCAGT TTGAGGTGGC GAATAACGCA GAGAAGGCGA TGAACAGCCT GAAAGACGGC
GTGGCGAAGC TGGGACAAAC GATCTATTAC GCCCGCGACA TTACGCTAAC GCTGCCGGAC
GCACTGTTCG TGCCGAACAG TCAGCTTAAC CAGTTCCGCC GCGAAACCGC AGAAATGCTT
GATGAGGCGC GCTTGGCCAA TTACCCGCGC GGGAGCCGCA AAGCGGTGTC TGTCCCTGCG
CCGGTTTATC CGGATTCTCA TTTGTCATTC CTGGCGAACG TGTACAACCA CAAAGCACGC
GAGTTTTATC ATCGTTACGG CGTGCAATTA ATTGATGCAG CTTATGAGGC GCACGAAGAG
AAGGGCGATG TGCCGGTGAT GATCACCAAG CACTGTTTGC GCTTCGCCTT TAACCTGTGC
CCGAAACAGG CGAAGGGCAA CATCAAAAGC TGGAAGGCCA CACCTATGCA GTTGGTGAAT
GGTGATGAAG TGTTAACGTT GAAATTTGAC TGCCGTCCCT GCGAAATGCA CGTGATTGGC
AAAATGAAAA ATCACATCTT CAAAATGCCA CAACCGGGAA GCGTTGTGGC CTCTGTTAGC
CCCGAAGATC TGATGAAAAC CCTGCCGAAG CGCAAGGGCG TTTAA
 
Protein sequence
MRLQSHHLEL LSPARDASIA REAILHGADA VYIGGPGFGA RHNASNSLQD IAELVPFAHR 
FGAKVFVTLN TILHDDELEP AQRLITDLYQ TGVDALIVQD MGVLELDIPP IELHASTQCD
IRTVEKAKFL SDVGFTQIVL ARELNLNQIR DIHQATDANI EFFIHGALCV AYSGQCNISH
AQTGRSANRG DCSQACRLPY TLKDDQGRVV AFEKHLLSMK DNDQTANLAA LIDAGVRSFK
IEGRYKDMSY VKNITAHYRQ MLDAIIEDRG DLARSSAGRT EHFFIPSTDK TFHRGSTDYF
VNARKDDIGA FDSPKFIGLP VGEVLKVSKD YLDVKVTETL ANGDGLNVMI KREIVGFRAN
TVEKTGENQY RVWPNEMPAD LYKARPNAAL NRNLDHNWQQ ALLKTSSERR IAVDMELGGW
EEQLILTMTS EDGVSVTHTL DGQFEVANNA EKAMNSLKDG VAKLGQTIYY ARDITLTLPD
ALFVPNSQLN QFRRETAEML DEARLANYPR GSRKAVSVPA PVYPDSHLSF LANVYNHKAR
EFYHRYGVQL IDAAYEAHEE KGDVPVMITK HCLRFAFNLC PKQAKGNIKS WKATPMQLVN
GDEVLTLKFD CRPCEMHVIG KMKNHIFKMP QPGSVVASVS PEDLMKTLPK RKGV