Gene Gdia_3255 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_3255 
Symbol 
ID6976694 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp3566208 
End bp3567155 
Gene Length948 bp 
Protein Length315 aa 
Translation table11 
GC content72% 
IMG OID643392765 
Productproline iminopeptidase 
Protein accessionYP_002277597 
Protein GI209545368 
COG category[R] General function prediction only 
COG ID[COG0596] Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily)  
TIGRFAM ID[TIGR01249] proline iminopeptidase, Neisseria-type subfamily 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.189812 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones51 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCACGCC ACGACCTGTT CCCGGACATC GCGCCCTACG ACAGCGGCTA CCTGCCTGTC 
GGCGACGGGC ACGAACTCTA TTGGGAACAG GCGGGCAACC CCGCAGGGCG GACGGTGCTG
TTCCTGCATG GCGGTCCGGG CGCGGGCGCG GGCGCGGTCC ATCGCCGCTT CTTCGACCCG
GAACATTGGC GCGTCGTCCT GTTCGACCAG CGGGGGGCCG GGCGGTCGCG GCCGCATGCG
TCGATCGCGG CCAACACGAC ACCGCACCTG GTGCGCGATA TCGAAACGCT GCGCCAGGCG
CTGGGCATCG GGGACTGGCT GCTGTTCGGC GGGTCGTGGG GATCGACGCT GGCGCTGGCC
TACGCCCAGG CGCACCCCGA ACGGGTGCGC GCGATGATCC TGCGCGGGAT CTTCCTGGGC
CGGCCGCGCG AACTGGACTG GTTCTTCCAC GGCCTGGCCC ATGTCTTCCC CGACGCGCAC
GCGGCCTTCC TGTCGCACCT GCCCGAAGCG GAACGGGATG ATCCGCTGGG CGCCTATGGC
CGGCTGCTGT TCGATCCCGA CCCCGCGATC CACCTGCCGG CGGCGCGGGC CTGGTCGGCC
TACGAGGGAA CGTGCTCGAC GCTGATTCCC GCCCCCGCCG CCGTCGCCGG CTTCGCGCAG
GACCGCGCCG TCATCGGCCT GGCGCGGATC GAGGCCCATT ATTTCCGGCA CGGCCTGTTC
CTGCCGCCCG AGGGCCTGCT GGGCGCGATG GAGCGGATCG CGCATATTCC CTGCACCATC
GTCCAGGGCC GGTATGACAT GATCTGCCCC AGCGAGTCCG CCTGGGACCT GTCCCGGCAC
TGGCCGCGCG CCACCCTGGT CATGGTGCCG GATGCCGGGC ACTCGGCCCT GGAACCGGGC
ATCCGCCGCC GGCTGGTCGC GTGCGTCGAG GAGATGCGCG ACGCATGA
 
Protein sequence
MPRHDLFPDI APYDSGYLPV GDGHELYWEQ AGNPAGRTVL FLHGGPGAGA GAVHRRFFDP 
EHWRVVLFDQ RGAGRSRPHA SIAANTTPHL VRDIETLRQA LGIGDWLLFG GSWGSTLALA
YAQAHPERVR AMILRGIFLG RPRELDWFFH GLAHVFPDAH AAFLSHLPEA ERDDPLGAYG
RLLFDPDPAI HLPAARAWSA YEGTCSTLIP APAAVAGFAQ DRAVIGLARI EAHYFRHGLF
LPPEGLLGAM ERIAHIPCTI VQGRYDMICP SESAWDLSRH WPRATLVMVP DAGHSALEPG
IRRRLVACVE EMRDA