Gene B21_03300 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagB21_03300 
SymbolprlC 
ID8112561 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21 
KingdomBacteria 
Replicon accessionNC_012892 
Strand
Start bp3504737 
End bp3506779 
Gene Length2043 bp 
Protein Length680 aa 
Translation table11 
GC content55% 
IMG OID644849477 
Producthypothetical protein 
Protein accessionYP_003001050 
Protein GI251786746 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0339] Zn-dependent oligopeptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.762273 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGAATC CGTTACTGAC TCCCTTTGAA TTGCCTCCGT TTTCTAAAAT TCTCCCGGAA 
CATGTCGTTC CAGCCGTGAC TAAGGCGCTG AACGACTGCC GCGAAAATGT GGAGCGCGTA
GTAGCGCAAG GAGCACCGTA CACCTGGGAA AATCTCTGCC AGCCGTTGGC GGAAGTGGAC
GATGTGCTGG GGCGTATCTT CTCCCCGGTC AGCCACCTGA ACTCGGTGAA AAATAGCCCG
GAACTGCGTG AAGCGTACGA ACAAACCCTG CCGCTGCTGT CGGAATACAG CACCTGGGTA
GGGCAACATG AAGGGCTGTA TAAAGCGTAT CGCGACCTGC GCGATGGCGA TCATTACGCC
ACGCTGAACA CGGCTCAGAA AAAAGCGGTT GATAACGCAC TGCGCGACTT CGAACTCTCT
GGCATCGGTC TGCCAATAGA GAAACAACAA CGCTACGGTG AGATTGCCAC TCGTCTTTCC
GAGCTGGGCA ACCAGTACAG CAACAACGTC CTCGATGCGA CGATGGGCTG GACCAAACTC
GTTACCGACG AAGCGGAGCT GGCGGGGATG CCTGAAAGCG CGCTGGCTGC GGCAAAAGCC
CAGGCCGAAG CGAAAGAGCT GGAAGGCTAC CTGCTGACGC TGGATATCCC AAGCTACCTG
CCGGTAATGA CCTACTGCGA CAACCAGGCC TTGCGTGAAG AGATGTATCG CGCTTACAGC
ACCCGCGCCT CCGATCAAGG CCCGAACGCC GGTAAATGGG ATAACAGCAA GGTGATGGAA
GAGATCCTCG CTCTGCGTCA CGAACTGGCG CAACTGCTGG GCTTTGAAAA CTACGCCTTC
AAATCACTTG CCACTAAAAT GGCAGAAAAC CCGCAGCAGG TGCTGGATTT CTTAACCGAT
CTGGCAAAAC GCGCGCGTCC GCAAGGCGAA AAAGAGCTGG CACAACTGCG CGCCTTCGCC
AAAGCGGAAT TTGGCGTCGA TGAGTTGCAG CCGTGGGATA TCGCGTACTA CAGCGAAAAA
CAAAAACAGC ACCTCTACAG CATCAGTGAC GAACAGCTGC GTCCGTACTT CCCGGAAAAC
AAAGCGGTTA ACGGCCTGTT TGAAGTGGTG AAACGTATTT ACGGCATCAC CGCTAAAGAG
CGTAAAGATG TTGATGTCTG GCACCCGGAT GTACGTTTCT TCGAACTGTA TGACGAGAAC
AACGAACTGC GCGGTAGCTT CTACCTCGAC CTGTATGCCC GTGAAAACAA GCGCGGCGGG
GCGTGGATGG ATGACTGCGT AGGCCAGATG CGCAAAGCTG ACGGTTCTCT GCAAAAACCG
GTCGCGTATC TGACCTGTAA CTTTAACCGC CCGGTAAATG GTAAACCGGC GCTGTTTACC
CATGACGAAG TGATCACCCT GTTCCACGAG TTCGGTCATG GTCTGCATCA TATGCTGACC
CGCATCGAAA CCGCCGGTGT ATCCGGGATC AGTGGGGTGC CGTGGGATGC GGTCGAACTG
CCGAGTCAGT TTATGGAAAA CTGGTGCTGG GAGCCTGAGG CGCTGGCGTT TATCTCTGGT
CACTACGAAA CCGGCGAACC GCTGCCGAAA GAGTTGCTGG ATAAAATGCT GGCGGCGAAG
AACTACCAGG CGGCGCTGTT TATTCTGCGC CAGCTGGAGT TCGGCCTGTT CGATTTCCGC
CTCCATGCCG AGTTCCGCCC GGATCAGGGA GCGAAAATCC TCGAAACTCT GGCAGAAATC
AAGAAACTGG TTGCCGTAGT ACCGTCTCCA TCCTGGGGCC GTTTCCCGCA CGCTTTCAGC
CATATTTTCG CCGGTGGTTA TGCCGCAGGT TACTACAGCT ACCTGTGGGC TGACGTACTG
GCGGCAGATG CTTTCTCGCG CTTTGAGGAA GAGGGCATTT TCAACCGTGA AACCGGGCAG
TCGTTCCTCG ACAACATTCT GAGCCGTGGC GGTTCAGAAG AGCCGATGGA TCTGTTCAAA
CGCTTCCGTG GTCGTGAACC GCAGCTGGAT GCGATGCTGG AGCATTACGG CATTAAGGGC
TGA
 
Protein sequence
MTNPLLTPFE LPPFSKILPE HVVPAVTKAL NDCRENVERV VAQGAPYTWE NLCQPLAEVD 
DVLGRIFSPV SHLNSVKNSP ELREAYEQTL PLLSEYSTWV GQHEGLYKAY RDLRDGDHYA
TLNTAQKKAV DNALRDFELS GIGLPIEKQQ RYGEIATRLS ELGNQYSNNV LDATMGWTKL
VTDEAELAGM PESALAAAKA QAEAKELEGY LLTLDIPSYL PVMTYCDNQA LREEMYRAYS
TRASDQGPNA GKWDNSKVME EILALRHELA QLLGFENYAF KSLATKMAEN PQQVLDFLTD
LAKRARPQGE KELAQLRAFA KAEFGVDELQ PWDIAYYSEK QKQHLYSISD EQLRPYFPEN
KAVNGLFEVV KRIYGITAKE RKDVDVWHPD VRFFELYDEN NELRGSFYLD LYARENKRGG
AWMDDCVGQM RKADGSLQKP VAYLTCNFNR PVNGKPALFT HDEVITLFHE FGHGLHHMLT
RIETAGVSGI SGVPWDAVEL PSQFMENWCW EPEALAFISG HYETGEPLPK ELLDKMLAAK
NYQAALFILR QLEFGLFDFR LHAEFRPDQG AKILETLAEI KKLVAVVPSP SWGRFPHAFS
HIFAGGYAAG YYSYLWADVL AADAFSRFEE EGIFNRETGQ SFLDNILSRG GSEEPMDLFK
RFRGREPQLD AMLEHYGIKG