Gene ECD_03347 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECD_03347 
SymbolprlC 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21(DE3) 
KingdomBacteria 
Replicon accessionCP001509 
Strand
Start bp3505526 
End bp3507568 
Gene Length2043 bp 
Protein Length680 aa 
Translation table11 
GC content55% 
IMG OID 
Productoligopeptidase A 
Protein accessionACT45148 
Protein GI253979478 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.642114 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGAATC CGTTACTGAC TCCCTTTGAA TTGCCTCCGT TTTCTAAAAT TCTCCCGGAA 
CATGTCGTTC CAGCCGTGAC TAAGGCGCTG AACGACTGCC GCGAAAATGT GGAGCGCGTA
GTAGCGCAAG GAGCACCGTA CACCTGGGAA AATCTCTGCC AGCCGTTGGC GGAAGTGGAC
GATGTGCTGG GGCGTATCTT CTCCCCGGTC AGCCACCTGA ACTCGGTGAA AAATAGCCCG
GAACTGCGTG AAGCGTACGA ACAAACCCTG CCGCTGCTGT CGGAATACAG CACCTGGGTA
GGGCAACATG AAGGGCTGTA TAAAGCGTAT CGCGACCTGC GCGATGGCGA TCATTACGCC
ACGCTGAACA CGGCTCAGAA AAAAGCGGTT GATAACGCAC TGCGCGACTT CGAACTCTCT
GGCATCGGTC TGCCAATAGA GAAACAACAA CGCTACGGTG AGATTGCCAC TCGTCTTTCC
GAGCTGGGCA ACCAGTACAG CAACAACGTC CTCGATGCGA CGATGGGCTG GACCAAACTC
GTTACCGACG AAGCGGAGCT GGCGGGGATG CCTGAAAGCG CGCTGGCTGC GGCAAAAGCC
CAGGCCGAAG CGAAAGAGCT GGAAGGCTAC CTGCTGACGC TGGATATCCC AAGCTACCTG
CCGGTAATGA CCTACTGCGA CAACCAGGCC TTGCGTGAAG AGATGTATCG CGCTTACAGC
ACCCGCGCCT CCGATCAAGG CCCGAACGCC GGTAAATGGG ATAACAGCAA GGTGATGGAA
GAGATCCTCG CTCTGCGTCA CGAACTGGCG CAACTGCTGG GCTTTGAAAA CTACGCCTTC
AAATCACTTG CCACTAAAAT GGCAGAAAAC CCGCAGCAGG TGCTGGATTT CTTAACCGAT
CTGGCAAAAC GCGCGCGTCC GCAAGGCGAA AAAGAGCTGG CACAACTGCG CGCCTTCGCC
AAAGCGGAAT TTGGCGTCGA TGAGTTGCAG CCGTGGGATA TCGCGTACTA CAGCGAAAAA
CAAAAACAGC ACCTCTACAG CATCAGTGAC GAACAGCTGC GTCCGTACTT CCCGGAAAAC
AAAGCGGTTA ACGGCCTGTT TGAAGTGGTG AAACGTATTT ACGGCATCAC CGCTAAAGAG
CGTAAAGATG TTGATGTCTG GCACCCGGAT GTACGTTTCT TCGAACTGTA TGACGAGAAC
AACGAACTGC GCGGTAGCTT CTACCTCGAC CTGTATGCCC GTGAAAACAA GCGCGGCGGG
GCGTGGATGG ATGACTGCGT AGGCCAGATG CGCAAAGCTG ACGGTTCTCT GCAAAAACCG
GTCGCGTATC TGACCTGTAA CTTTAACCGC CCGGTAAATG GTAAACCGGC GCTGTTTACC
CATGACGAAG TGATCACCCT GTTCCACGAG TTCGGTCATG GTCTGCATCA TATGCTGACC
CGCATCGAAA CCGCCGGTGT ATCCGGGATC AGTGGGGTGC CGTGGGATGC GGTCGAACTG
CCGAGTCAGT TTATGGAAAA CTGGTGCTGG GAGCCTGAGG CGCTGGCGTT TATCTCTGGT
CACTACGAAA CCGGCGAACC GCTGCCGAAA GAGTTGCTGG ATAAAATGCT GGCGGCGAAG
AACTACCAGG CGGCGCTGTT TATTCTGCGC CAGCTGGAGT TCGGCCTGTT CGATTTCCGC
CTCCATGCCG AGTTCCGCCC GGATCAGGGA GCGAAAATCC TCGAAACTCT GGCAGAAATC
AAGAAACTGG TTGCCGTAGT ACCGTCTCCA TCCTGGGGCC GTTTCCCGCA CGCTTTCAGC
CATATTTTCG CCGGTGGTTA TGCCGCAGGT TACTACAGCT ACCTGTGGGC TGACGTACTG
GCGGCAGATG CTTTCTCGCG CTTTGAGGAA GAGGGCATTT TCAACCGTGA AACCGGGCAG
TCGTTCCTCG ACAACATTCT GAGCCGTGGC GGTTCAGAAG AGCCGATGGA TCTGTTCAAA
CGCTTCCGTG GTCGTGAACC GCAGCTGGAT GCGATGCTGG AGCATTACGG CATTAAGGGC
TGA
 
Protein sequence
MTNPLLTPFE LPPFSKILPE HVVPAVTKAL NDCRENVERV VAQGAPYTWE NLCQPLAEVD 
DVLGRIFSPV SHLNSVKNSP ELREAYEQTL PLLSEYSTWV GQHEGLYKAY RDLRDGDHYA
TLNTAQKKAV DNALRDFELS GIGLPIEKQQ RYGEIATRLS ELGNQYSNNV LDATMGWTKL
VTDEAELAGM PESALAAAKA QAEAKELEGY LLTLDIPSYL PVMTYCDNQA LREEMYRAYS
TRASDQGPNA GKWDNSKVME EILALRHELA QLLGFENYAF KSLATKMAEN PQQVLDFLTD
LAKRARPQGE KELAQLRAFA KAEFGVDELQ PWDIAYYSEK QKQHLYSISD EQLRPYFPEN
KAVNGLFEVV KRIYGITAKE RKDVDVWHPD VRFFELYDEN NELRGSFYLD LYARENKRGG
AWMDDCVGQM RKADGSLQKP VAYLTCNFNR PVNGKPALFT HDEVITLFHE FGHGLHHMLT
RIETAGVSGI SGVPWDAVEL PSQFMENWCW EPEALAFISG HYETGEPLPK ELLDKMLAAK
NYQAALFILR QLEFGLFDFR LHAEFRPDQG AKILETLAEI KKLVAVVPSP SWGRFPHAFS
HIFAGGYAAG YYSYLWADVL AADAFSRFEE EGIFNRETGQ SFLDNILSRG GSEEPMDLFK
RFRGREPQLD AMLEHYGIKG