Gene Ent638_3911 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEnt638_3911 
Symbol 
ID5111563 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEnterobacter sp. 638 
KingdomBacteria 
Replicon accessionNC_009436 
Strand
Start bp4217701 
End bp4219743 
Gene Length2043 bp 
Protein Length680 aa 
Translation table11 
GC content56% 
IMG OID640494120 
Productoligopeptidase A 
Protein accessionYP_001178617 
Protein GI146313543 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0339] Zn-dependent oligopeptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCAATC CATTACTGAC GCCTTTCTCG TTACCGCCAT TTTCTAAAAT CCAGCCAGAG 
CATGTGGTTC CAGCCGTCAC CAAAGCGTTG GACGATTGCC GCACGGCGGT AGAAAGCGTG
GTCGCGAAGG GCGGACCGTA CACCTGGGAG AATTTGTGCC AGACGCTGGC CGAAGTGGAT
GACGTGTTGG GCCGCCTTTT TTCACCGGTC AGCCATCTGA ACTCTGTAAA AAACAGCCCG
GAACTGCGCG AAGCCTACGA ACAAACCCTG CCGCTGCTGT CGGAATACAG CACCTGGGTG
GGTCAGCACG AAGGGTTGTA CAACGCCTAT CGCGATCTGC GCGACGGTGA AAATTACGCC
AAACTGGATA TCGCGCAGAA AAAAGCGGTC GATAACGCGC TGCGTGACTT TGAACTGTCC
GGGATTGGCC TGCCGAAAGA GAAACAAACG CGCTACGGCG AAATCGCGGC GCGCCTGTCA
GAGCTGGGCA ATCAGTACAG CAACAACGTG CTTGATGCCA CCATGGGCTG GTCGAAGCTG
GTCACCGACG AATCCGAGCT GGCCGGCATG CCGGAGAGCG CGCTGGCGGC GGCTAAAGCG
CAGGCCGAAG CGAAAGAGCA GGAAGGTTTC CTGTTAACGC TCGATATCCC GAGCTATCTG
CCGGTGATGA CCTATTGCGA CAATCAGGCG CTGCGCGAAG AGCTGTACCG CGCGTACAGC
ACGCGTGCGT CAGACCAGGG CCCAAATGCC GGGAAATGGG ACAACAGCCC GGTGATGGCA
GAAATCCTCG CGCTACGCCA CGAGCTGGCA CAGCTGCTGG GCTTCGACAG CTACGCGGAT
AAATCTCTCG CCACCAAAAT GGCAGAAAAC CCGCAGCAGG TTCTGGAGTT CTTAACCGAT
CTGGCAAAAC GTGCGCGTCC GCAGGGCGAG AAAGAACTGG CACAGTTGCG CGCATTTGCG
AACGCTGAAT TTGGCGTCGA CGACCTGCAA CCGTGGGACA TCGCGTACTA CAGCGAAAAA
CAGAAACAGC ATTTGTACAG CATCAGCGAT GAACAGCTGC GCCCGTACTT CCCAGAAAAC
AAAGCCGTTA ACGGCCTGTT TGAAGTGGTA AAACGCATTT ACGGCATCAC CGCCAAAGAA
CGTACTGATG TGGATGTCTG GCATCCGGAA GTGCGTTTCT TCGAGCTGTA TGACGAGAAA
AACGAACTGC GCGGCAGTTT CTATCTGGAT CTGTATGCGC GAGAAAACAA ACGTGGCGGG
GCTTGGATGG ACGACTGCGT CGGCCAGATG CGTAAAGCCG ACGGCTCGCT GCAAAAACCG
GTTGCTTACC TGACCTGTAA CTTCAACCGT CCGGTCAGCG GTAAGCCTGC GCTGTTCACG
CACGATGAAG TGATCACCCT GTTCCACGAA TTCGGTCACG GTCTGCATCA TATGCTGACG
CGTATTGAAG CGGCGGGCGT AGCGGGTATC AGCGGTGTGC CGTGGGATGC CGTCGAGCTG
CCAAGCCAGT TTATGGAAAA TTGGTGCTGG GAGCCGGACG CGCTGGCGTT TATCTCCGGT
CACTATGAAA CGGGCGAACC CCTGCCAAAA GAGCTGTTGG ATAAAATGCT GGCGGCGAAA
AACTACCAGG CGGCGATGTT TATCCTGCGC CAGTTGGAGT TCGGTCTGTT CGATTTCCGT
CTGCACGCTG AATTCAGCCC AGAGCAGGGG GCGAAAATCC TCGAAACTCT GGCTGAAATT
AAAAAGCAGG TCGCACTGAT TCCTGGTCCA ACGTGGGGTC GATTCCCGCA CGCGTTCAGC
CATATCTTTG CTGGTGGCTA TGCGGCGGGT TACTACAGCT ACCTGTGGGC CGACGTGCTG
GCAGCCGATG CCTACTCGCG CTTTGAAGAG GAAGGTATCT TCAACCGCGA AACCGGTCAG
TCGTTCCTGG ATAACATCCT GACGCGCGGA GGTTCCGAAG AGCCAATGGA GCTGTTCAAA
CGCTTCCGTG GCCGCGAGCC GCAGCTGGAT GCGATGCTTG AGCATTACGG AATCAAAGGC
TAA
 
Protein sequence
MTNPLLTPFS LPPFSKIQPE HVVPAVTKAL DDCRTAVESV VAKGGPYTWE NLCQTLAEVD 
DVLGRLFSPV SHLNSVKNSP ELREAYEQTL PLLSEYSTWV GQHEGLYNAY RDLRDGENYA
KLDIAQKKAV DNALRDFELS GIGLPKEKQT RYGEIAARLS ELGNQYSNNV LDATMGWSKL
VTDESELAGM PESALAAAKA QAEAKEQEGF LLTLDIPSYL PVMTYCDNQA LREELYRAYS
TRASDQGPNA GKWDNSPVMA EILALRHELA QLLGFDSYAD KSLATKMAEN PQQVLEFLTD
LAKRARPQGE KELAQLRAFA NAEFGVDDLQ PWDIAYYSEK QKQHLYSISD EQLRPYFPEN
KAVNGLFEVV KRIYGITAKE RTDVDVWHPE VRFFELYDEK NELRGSFYLD LYARENKRGG
AWMDDCVGQM RKADGSLQKP VAYLTCNFNR PVSGKPALFT HDEVITLFHE FGHGLHHMLT
RIEAAGVAGI SGVPWDAVEL PSQFMENWCW EPDALAFISG HYETGEPLPK ELLDKMLAAK
NYQAAMFILR QLEFGLFDFR LHAEFSPEQG AKILETLAEI KKQVALIPGP TWGRFPHAFS
HIFAGGYAAG YYSYLWADVL AADAYSRFEE EGIFNRETGQ SFLDNILTRG GSEEPMELFK
RFRGREPQLD AMLEHYGIKG