Gene Gdia_0594 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_0594 
Symbol 
ID6973991 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp665317 
End bp666759 
Gene Length1443 bp 
Protein Length480 aa 
Translation table11 
GC content68% 
IMG OID643390125 
Productamidohydrolase 
Protein accessionYP_002275001 
Protein GI209542772 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1228] Imidazolonepropionase and related amidohydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value0.00251478 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCCGAAC AGTGGGGCCG GAAAACGTTC AATAGCGGGA AGGAAGCGGG CATGACCATC 
GCAGGACGGT TCGGCTGGGC GCTGTCGCTG ATGTTATGCG CCGGTGCAGC CAACGTGGCC
GTGGCGGCGG AGGCACCGGC GGTGCTGCTG GAGCACGCGA CGCTGATCGA CGGCACCGGG
GCGACGCCCG TGGCCGACAG CGCGGTCCTG ATCCGGGACG GCCGGATCGC TTCGGTGGGG
CGTGACGGCA CGATCGCCGT GCCGGCGGGC GTGAAGACGG TCGATCTGGT CGGGATGACC
ATCCTGCCGG GCCTGATCTC GGACCACAGC CATACCGGGC TGGTGAAGGG CACGCAGGAC
GACACGGCCA ATTATACGCG CGAGAATATC CTGGCGGCCC TGAAGCAGTA TGAACGCTAC
GGCGTGCTGT CCGTGGTGTC GCTGGGCCTG AACAAATCCC CGCTGTTCGA CCAGTTGCGG
CAGGAACAGC ACGCGGGCCG CAATCCGGGC GCCGACCTGT TCGGCGTGGA CCAGGGGATC
GGCGCGCCGG ACGGCGTGCC GCCACAGGGC ATGTTCCATC TGGGTGCCGA TCAGGTCTAT
CGCCCGACCT CGGTGCCCGA GGCCCGCGCC GCCGTCGATC GCATGGTCGA CGAGGGCACG
GACCTGGTGA AGATCTGGGT GGACGATTTC CGCAACGGCG TGCCCGGCGC CAAGGGATTC
CCCAAGATCG ATCCCGCGAT CTATCGCGCG GTGATCGAAC AGGCCCATGC GCGCGGCAAG
CGCGTCGCGG CGCATATCCA TGACCTGGCC GACGCCAAGG CGCTGGTGGC GGCCGGGGCC
GACATCGTTG CCCACGGCGT GCGCGACCAG CCGGTCGATA CCGATTTCAT CATGCTGATG
GAACAGAAGG GCGCGTGGTA TATCGCCACC CTGGACCTGG ACGAGGCGAA CTACATCTTC
GCCCTGCACC CGGAATGGCT GGACGATCCG TTTCTGTCCG CCGGCCTGAA CCCCGCCCTG
CGGGCCCGGT TCGCCGACCC GGCCTGGCGG GCCAAAATCC TGGCCGCGCC GCTGACCGAA
GCGTCGAAGA GGGCGGTGGC GCTGAACCAG CGCAACCTGA TGACCCTGTA CCGCGCGGGC
ATTCCTATCG GCTTCGGCAC TGATTCCGGG GCATCGGCCA CGCGGATTCC CGGTTTTGCC
GAACATCGCG AACTGAAGCT GATGGTCGCG GCGGGCATGA CGCCGGTCCA GGCCCTGACG
ATCGCAACGG GCCGCGCCGC CGCCCTGATG CAGTGGGACG ACCGGGGTAT CCTGCTGCCG
GGCCGGTGGG CCGACCTGGT CGTCGTGTCC GGCGACCCGG CCCATGACAT CACGGCGGTC
GACAGGATCG CCCAGGTCTG GCATCGCGGC GTACAGACCG AAGGCGCGCT GATTCCACAA
TAA
 
Protein sequence
MAEQWGRKTF NSGKEAGMTI AGRFGWALSL MLCAGAANVA VAAEAPAVLL EHATLIDGTG 
ATPVADSAVL IRDGRIASVG RDGTIAVPAG VKTVDLVGMT ILPGLISDHS HTGLVKGTQD
DTANYTRENI LAALKQYERY GVLSVVSLGL NKSPLFDQLR QEQHAGRNPG ADLFGVDQGI
GAPDGVPPQG MFHLGADQVY RPTSVPEARA AVDRMVDEGT DLVKIWVDDF RNGVPGAKGF
PKIDPAIYRA VIEQAHARGK RVAAHIHDLA DAKALVAAGA DIVAHGVRDQ PVDTDFIMLM
EQKGAWYIAT LDLDEANYIF ALHPEWLDDP FLSAGLNPAL RARFADPAWR AKILAAPLTE
ASKRAVALNQ RNLMTLYRAG IPIGFGTDSG ASATRIPGFA EHRELKLMVA AGMTPVQALT
IATGRAAALM QWDDRGILLP GRWADLVVVS GDPAHDITAV DRIAQVWHRG VQTEGALIPQ