Gene Gdia_3047 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_3047 
Symbol 
ID6976481 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp3336071 
End bp3337540 
Gene Length1470 bp 
Protein Length489 aa 
Translation table11 
GC content72% 
IMG OID643392555 
ProductFAD linked oxidase domain protein 
Protein accessionYP_002277392 
Protein GI209545163 
COG category[C] Energy production and conversion 
COG ID[COG0277] FAD/FMN-containing dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones43 
Fosmid unclonability p-value0.517001 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGCCT GCCCCGCCCC GACCGCCGAC CTGCTGGACC GTCTGGCACA TCTGCTGGGT 
CCCTCGGGCA TCCTGACCGC CCCGTCGGAC ACCGATCCGT ACTGCACGGA CTGGCGGGCG
CTCTATCACG GCCGCACCGC CGCCGTGCTG CGCCCCGCCG ACACCGCCGA ACTGGCCCAG
GCGGTCACGC TGTGCGCCCA GGCGGGGGTG GCGATGGTGC CGCAGGGGGG CAACACAAGC
ATGGTGGGGG GCGCCACCCC GGACGACAGC GGGCGCGAGG TCGTGATCTG CCTGTCGCGC
ATGAACCGCG TGCGCCGGAT CGATCCGCTG GACCACACGA TGGAAGTCGA GGCCGGGGTG
ACGCTGAAGG CCGCGCAGGA CGCCGCGCGC GAGGCCGGGC TGATGCTGCC GCTGTCGATT
TCATCGGAAG GCTCGGCCCA GATCGGCGGC GTGCTGGCCA CCAATGCCGG CGGCAACAAC
ACCGTGCGCT ACGGCAACGC GCGGGAACTG GTCCTGGGGC TGGAGGTCGT GCTGCCGGAC
GGCAGCGTGT TCCACGGGCT GCGCCGCCTG CGCAAGGACA ACACCGGATA CGCCCTGCGC
CAGCTGTTCG TGGGGTCCGA GGGCACGCTG GGCTTCATCA CCGCCGCCAT CCTGCAGCTT
CAGCCGCAGC CGCGCGCGAC CGAGGCCGCC CTGTGCGCGG TCGCCGACGC CAAGGGCGCG
CTGGCGCTGT TCGCGGCCTT CCGCGCGCAG GACCCGGCGC TGATCCAGGC GTTCGAATAC
ATGTCCGGCG CCGGCATGGA CCTGGTCACG CGACTGGTCC CGGGCGCCAG CCTGCCGCTG
GCCGAACCGG CACCGGCCTA CGTGCTGGTC GAACTGGCCA CCCCGCGCGC CAATGCCGAC
CTGCGCGGGG CGCTGGAGGA CGTGCTGGGC GCGGCGCTGG AGGACGGCAC GGTCACCGAT
GCGGTGATCG CCGAGAGCGA GGGCCAGCGC ATGGCCCTGT GGAAGCTGCG CGAGGAACAT
GCCGAGGCCC AGCGCCGCGC CGGCGCCAGC GTGAAGAACG ACGTCTCGGT CCCGGTCTCG
CACGTGCCGG AACTGATCGA CCGGGCCACC GCCGCCTGCG CGGCCCTGAT CCCGGGCATC
CGCCCGGCGC CGTTCGGCCA TATCGGCGAC GGGAACATCC ATTTCAACCT GGTCCAGCCC
GAAGGCATGG ACCCGGCCGC CTTCCTGGCA CTGGACCACC GGATCATGGA TACGGTGGGG
GCCATCGTGC GCGACCTGGA CGGCTCCTTC TCCGCCGAAC ACGGGGTGGG GCGGCTGAAG
CCCTACATGA TGCCGGACTG GCGCGGCGGC GCCGAACTGG CCACGATGCG CCGGATCAAG
GACGCGATCG ACCCGCGCGG CCTGCTGAAC CCCGGCGCCA TATTTCCCCC CGATACGGGG
AATCCGGCCC GCGTGACCCC GGACGCATGA
 
Protein sequence
MTACPAPTAD LLDRLAHLLG PSGILTAPSD TDPYCTDWRA LYHGRTAAVL RPADTAELAQ 
AVTLCAQAGV AMVPQGGNTS MVGGATPDDS GREVVICLSR MNRVRRIDPL DHTMEVEAGV
TLKAAQDAAR EAGLMLPLSI SSEGSAQIGG VLATNAGGNN TVRYGNAREL VLGLEVVLPD
GSVFHGLRRL RKDNTGYALR QLFVGSEGTL GFITAAILQL QPQPRATEAA LCAVADAKGA
LALFAAFRAQ DPALIQAFEY MSGAGMDLVT RLVPGASLPL AEPAPAYVLV ELATPRANAD
LRGALEDVLG AALEDGTVTD AVIAESEGQR MALWKLREEH AEAQRRAGAS VKNDVSVPVS
HVPELIDRAT AACAALIPGI RPAPFGHIGD GNIHFNLVQP EGMDPAAFLA LDHRIMDTVG
AIVRDLDGSF SAEHGVGRLK PYMMPDWRGG AELATMRRIK DAIDPRGLLN PGAIFPPDTG
NPARVTPDA