Gene Gdia_0285 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_0285 
Symbol 
ID6973677 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp314367 
End bp315788 
Gene Length1422 bp 
Protein Length473 aa 
Translation table11 
GC content66% 
IMG OID643389816 
ProductCytochrome-c peroxidase 
Protein accessionYP_002274697 
Protein GI209542468 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1858] Cytochrome c peroxidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.906916 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones45 
Fosmid unclonability p-value0.908563 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCGGTGC GCAAGCTCGT ACTATCAGTG GCGGCCCTGG GCTGCGTCGC CTATGGCGGA 
ACAGTGGGGT ATCTGACCCA TTTCGACCAT GACACCGCGC CGACATTGGG CACGAATTCT
CCAACACTGG CCGATCCGGT CGCCTCGGCG GCCTTCGCCG CGATTCGCGA ATCCCGGTGC
GACTACTGCC ATGCCCGCAA TACCGACCTG CCGTTCTATT TCCATGTGCC GGTGGCGAAC
CAGTTGATGC AGCGCGACGT GGACCAGGGC CTGCGTCATT TCCGGATCGA GCCCGTGCTG
GCCGCGTTCC AGAGCGGCGC CGTTCCGTCC GAGGAACAAC TGGCGCGGAT CGAGGAAGTG
GTGCGCCAGA ACCGCATGCC GCCGACCCTG TACCTGCTGC TGCACTGGCA CGCCCATCTG
TCGCAGGCGC AGCGTGACGC GCTGCTGACC TGGATCGCGG CGGAACGGCG CGCGCATTAC
GCGACCCCGG GGGTCGCCCC ACGCTTCGCG GCCGAACCGG TGCAGCCGGT GCCCGAGACG
CTGCCGGTGG ATGCGGCCAA GGTCGCGCTG GGCCAGCGCC TGTTCTTCGA CAAGCAATTG
TCGGGGGACG GCACCCTCAA TTGCGCCAGT TGCCATGCGC TGGACCATGG CGGCGTGGAC
GGCCGGGTCA CGGCGCTGGG CATCGACAAC CGCCACGGCC CGATCAACGT GCCCACCGTC
TATGACGCCG CGTTCAATCA GAGCCAGTTC TGGAACGGCC GCGCCGCGAC CCTGGCCGAC
CAGGCGGCGG GACCGGTGAT GAACCCGCTG GAAATGGGAT CGCACGACTG GACCGGCGTG
GCCGACAAGC TGAAGCAGGA CCCCACCTAC CTCACCGCGT TCCAGGGCGT CTTCGGCTCG
GACGAGATCA CCAGGGACCG GATCACGGAT GCGATCGCGG AATATGAAAA GACCCTGATC
ACCCCCGACA GCCGCTTCGA CCGCTACCTG AAGGGCGACG ACCAGGCCCT GAACGCGCAG
GAAAAGAACG GCTACGCGCT GTTCAAGAGC GTGGGATGCT CGGGCTGCCA CACCGGCGTC
TCGCTGGGCG GGCAGGCGTT CGAGGCGATG GGCCTGGAGG GCGATTACTT CGCCGCGCGC
GGCGGCACGC TGACCGATGC CGACAAGGGA CGCTATATGG TGACCCATTC GGACGCCGAC
ATGGAACGCT TCAAGGTGCC GAACCTGCGC AACATCGCCC TGACCGCGCC GTATTTCCAT
GACGGCAGCG TCAAGACGCT GGACCAGGCA GTGCGGGAAA TGGCGCGCTA CCAGACGCCC
GATCACGACC TGTCGGACCA CGACGTGGCC GATATCGTGG CCTTCCTCCA GACCCTGACC
GGCACCTACC AGGGCCACCA ACTGGCTGAA ACCACGCACT GA
 
Protein sequence
MSVRKLVLSV AALGCVAYGG TVGYLTHFDH DTAPTLGTNS PTLADPVASA AFAAIRESRC 
DYCHARNTDL PFYFHVPVAN QLMQRDVDQG LRHFRIEPVL AAFQSGAVPS EEQLARIEEV
VRQNRMPPTL YLLLHWHAHL SQAQRDALLT WIAAERRAHY ATPGVAPRFA AEPVQPVPET
LPVDAAKVAL GQRLFFDKQL SGDGTLNCAS CHALDHGGVD GRVTALGIDN RHGPINVPTV
YDAAFNQSQF WNGRAATLAD QAAGPVMNPL EMGSHDWTGV ADKLKQDPTY LTAFQGVFGS
DEITRDRITD AIAEYEKTLI TPDSRFDRYL KGDDQALNAQ EKNGYALFKS VGCSGCHTGV
SLGGQAFEAM GLEGDYFAAR GGTLTDADKG RYMVTHSDAD MERFKVPNLR NIALTAPYFH
DGSVKTLDQA VREMARYQTP DHDLSDHDVA DIVAFLQTLT GTYQGHQLAE TTH