Gene Acry_1042 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcry_1042 
Symbol 
ID5160427 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidiphilium cryptum JF-5 
KingdomBacteria 
Replicon accessionNC_009484 
Strand
Start bp1160734 
End bp1161684 
Gene Length951 bp 
Protein Length316 aa 
Translation table11 
GC content71% 
IMG OID640552960 
Productproline iminopeptidase 
Protein accessionYP_001234177 
Protein GI148260050 
COG category[R] General function prediction only 
COG ID[COG0596] Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily)  
TIGRFAM ID[TIGR01249] proline iminopeptidase, Neisseria-type subfamily 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCGCGGG GTGACCTGTT CCCCGAGGTC GGCCCGTATC AGACGGGGTA TCTGCCGGTC 
GGCGACGGGC ATGTGATCTA CTGGGAGCAG GTGGGCAACC CGCGCGGGCG GCCGGTGCTG
TTCCTGCATG GTGGGCCGGG CGCCGGCGCG GGCGCGGTGC ACCGGCGCTT CTTCGACCCG
GCATTCTGGC GCGTGGTGAT CTTCGACCAG CGCGGCGCCG GGCGCTCGAC GCCGCTGGGC
AGCCTCGCGC GCAACACGAC GCCGGCGCTG ATCGAGGATA TCGAGGCGCT GCGCGAGCAT
CTCGGCATCA GGCAGTTCCT GCTGTTCGGC GGTTCCTGGG GATCGACCCT CGCGCTGGCC
TATGCCCAGG CGCATCCCGA GCGGGTGATG GGCATGGTGC TGCGCGGCAT CTTCCTCGGC
CGGCCGAGCG AGGTGGAATG GTTCCTCGAA GGAATCGCCC GCTTCTTCCC CGATGCGCAC
GCGGCGCTGG TGAACTTCCT GCCCGAGGCG GAGCGGGGCG ATCTGCTGGG GAGCTATTTC
CGCCGGCTCT GCGACCCCGA TCCGGCCATT CACCTGCCGG CGGCGCAGGC CTGGTCGGTC
TATGAGGGAT CGTGCTCGAC GCTGCTGCCG AGCTACGAGA CGGTGAGCGC CTTCGCGCAG
GACCGCACCT CGCTCGGGCT CGCGCGGATC GAGGCGTATT ACTTCCTGAA CAACCTGTTC
CTGCCGCCGG ACGGGCTGCT GGCCGGGATG GGACGGCTGG CCGGGGTGCC GGGCGAGATC
GTGCAGGGGC GATACGACAT GATCTGCCCG CCGAATTCCG CCTTCGACCT CGCCGACGCC
TGGCCCGCCG CGCGGCTGAC GGTGGTGCCG GATGCCGGGC ACTCGGCGCT GGAGCCGGGC
ATTCGCGCGG CCCTGCTGGC CGGGCTGGAG CGGATCCGCA ACCTGACCTG A
 
Protein sequence
MPRGDLFPEV GPYQTGYLPV GDGHVIYWEQ VGNPRGRPVL FLHGGPGAGA GAVHRRFFDP 
AFWRVVIFDQ RGAGRSTPLG SLARNTTPAL IEDIEALREH LGIRQFLLFG GSWGSTLALA
YAQAHPERVM GMVLRGIFLG RPSEVEWFLE GIARFFPDAH AALVNFLPEA ERGDLLGSYF
RRLCDPDPAI HLPAAQAWSV YEGSCSTLLP SYETVSAFAQ DRTSLGLARI EAYYFLNNLF
LPPDGLLAGM GRLAGVPGEI VQGRYDMICP PNSAFDLADA WPAARLTVVP DAGHSALEPG
IRAALLAGLE RIRNLT