Gene B21_00235 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagB21_00235 
SymbolpepD 
ID8113448 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21 
KingdomBacteria 
Replicon accessionNC_012892 
Strand
Start bp258746 
End bp260203 
Gene Length1458 bp 
Protein Length485 aa 
Translation table11 
GC content52% 
IMG OID644846525 
Producthypothetical protein 
Protein accessionYP_002998098 
Protein GI251783794 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2195] Di- and tripeptidases 
TIGRFAM ID[TIGR01893] aminoacyl-histidine dipeptidase 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGTCTGAAC TGTCTCAATT ATCTCCACAG CCGCTGTGGG ATATTTTTGC CAAAATCTGT 
TCTATTCCTC ACCCGTCCTA TCATGAAGAG CAACTCGCTG AATACATTGT TGGTTGGGCA
AAAGAGAAAG GTTTCCATGT CGAACGCGAT CAGGTAGGTA ATATCCTGAT TCGTAAACCT
GCCACCGCAG GTATGGAAAA TCGTAAACCG GTCGTCTTGC AGGCCCACCT CGATATGGTG
CCGCAGAAAA ATAACGACAC CGTGCATGAC TTCACGAAAG ATCCTATCCA GCCTTATATT
GATGGCGAAT GGGTTAAAGC GCGCGGCACC ACGCTGGGTG CAGATAACGG CATTGGTATG
GCGTCTGCGC TGGCGGTTCT GGCTGACGAA AACGTGGTTC ACGGCCCGCT GGAAGTGCTG
CTGACCATGA CCGAAGAAGC CGGTATGGAC GGTGCGTTCG GCTTACAGAG CAACTGGTTG
CAGGCTGATA TTCTGATTAA CACCGACTCC GAAGAAGAAG GTGAAATCTA CATGGGTTGC
GCGGGGGGTA TCGACTTCAC CTCCAACCTG CATTTGGATC GTGAAGCGGT TCCAGCTGGC
TTTGAAACCT TCAAGTTAAC CTTAAAAGGT CTAAAAGGCG GTCACTCCGG CGGTGAAATC
CACGTTGGCC TGGGTAATGC CAACAAACTG CTGGTGCGCT TCCTGGCGGG TCATGCGGAA
GAGCTGGACC TGCGCCTTAT CGATTTCAAC GGTGGCACAC TGCGTAACGC CATCCCGCGT
GAAGCCTTTG CGACCATTGC TGTCGCAGCT GATAAAGTCG ACGCCCTGAA ATCTCTGGTG
AATACCTATC AGGACATCCT GAAAAACGAG CTGGCAGAGA AAGAGAAGAA TCTGGCCTTG
TTGCTGGACT CTGTAGCGAA CGATAAAGCT GCTCTGATTG CGAAATCTCG CGATACCTTT
ATTCGTCTGC TGAACGCCAC CCCGAACGGT GTGATCCGCA ATTCCGACGT GGCAAAAGGT
GTGGTCGAAA CCTCCCTGAA CGTCGGTGTG GTGACCATGA CTGACAATAA CGTAGAAATT
CACTGCCTGA TCCGTTCACT GATCGACAGC GGTAAAGACT ACGTGGTGAG CATGCTGGAT
TCGCTGGGTA AACTGGCTGG CGCGAAAACC GAAGCGAAAG GCGCATATCC TGGCTGGCAG
CCGGACGCTA ATTCTCCGGT GATGCATCTG GTACGTGAAA CCTATCAGCG CCTGTTCAAC
AAGACGCCGA ACATCCAGAT TATCCACGCG GGCCTGGAAT GTGGTCTGTT TAAAAAACCG
TATCCGGAAA TGGACATGGT TTCTATCGGG CCAACTATCA CCGGTCCACA CTCTCCGGAT
GAGCAAGTTC ACATCAAAAG CGTAGGTCAT TACTGGACAC TGCTGACTGA ACTGCTGAAA
GAAATTCCGG CGAAGTAA
 
Protein sequence
MSELSQLSPQ PLWDIFAKIC SIPHPSYHEE QLAEYIVGWA KEKGFHVERD QVGNILIRKP 
ATAGMENRKP VVLQAHLDMV PQKNNDTVHD FTKDPIQPYI DGEWVKARGT TLGADNGIGM
ASALAVLADE NVVHGPLEVL LTMTEEAGMD GAFGLQSNWL QADILINTDS EEEGEIYMGC
AGGIDFTSNL HLDREAVPAG FETFKLTLKG LKGGHSGGEI HVGLGNANKL LVRFLAGHAE
ELDLRLIDFN GGTLRNAIPR EAFATIAVAA DKVDALKSLV NTYQDILKNE LAEKEKNLAL
LLDSVANDKA ALIAKSRDTF IRLLNATPNG VIRNSDVAKG VVETSLNVGV VTMTDNNVEI
HCLIRSLIDS GKDYVVSMLD SLGKLAGAKT EAKGAYPGWQ PDANSPVMHL VRETYQRLFN
KTPNIQIIHA GLECGLFKKP YPEMDMVSIG PTITGPHSPD EQVHIKSVGH YWTLLTELLK
EIPAK