Gene Gdia_0499 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_0499 
Symbol 
ID6973895 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp549675 
End bp550871 
Gene Length1197 bp 
Protein Length398 aa 
Translation table11 
GC content71% 
IMG OID643390032 
ProductHI0933 family protein 
Protein accessionYP_002274909 
Protein GI209542680 
COG category[R] General function prediction only 
COG ID[COG2081] Predicted flavoproteins 
TIGRFAM ID[TIGR00275] flavoprotein, HI0933 family 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value0.0205832 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCTTAG CCATGGATTT CGACGTCATC GTCCTGGGGG CCGGCGCGGC GGGGCTGATG 
GCCGCGCTGG CGGCCGGCCA GCGCGGCCGC CGGGTGGCCG TGCTGGACCA CGCCGCCGAA
GCCGGGCGCA AGATCCTGAT TTCCGGCGGC GGACGCTGCA ATTTCACCAA CCTGGACACC
GATGCCGGGC GGTTCCTGTC GGGCAACCCG CATTTCGCCA AGTCGGCGCT GGCCCGGTTC
GGTCCCGCCG ACATCCTGGC GATGATCGAG CGGCATGGAA TTGCCTGGCA CGAAAAGACG
CTGGGGCAGT TGTTCTGCGA CGGATCGGCG CGCGCCGTCG TTGCCATGCT GCTGGCGGAA
TGCGACGCGG GCGGGGTGGA TATCCGCCTG TCGCATCGCA TCACCGATGT CGCGGGGGGC
GACGGCTTCA GCGTTGCGAC CGATCACGGC ACGTTCCGCG CGCCCGCGCT GGTGCTGGCG
ACCGGCGGGC TGTCGATTCC CAAAATGGGG GCCACGGGCC TGTCGCACGC CATCGCCCGG
CAGTTCGGCC TGCGGGTCAC GCCCACCCTG CCGGCGTTGG TCCCGCTGAC GATCGACGCC
GCCACCCCCG GGTGGGTGCC CGACCTGGCC GGGGTGTCGC TGGATGTCGT CGCGCGATGC
GGGGCCACCC GCTTCCGCGA GGCGATGCTG TTTACCCATC GCGGCCTGTC GGGACCGGCG
ATCCTGCAGG TTTCGTCCTA TTGGGATTCG GGGCGCGAGA TCTGCCTGGA CCTGATGCCG
GGGCGTGATG CGGCCGAAGC CCTGCGGGCG GCCAAGCGGC AGCGCCCGCG CGTCGAGGGG
CGGACGATGC TGTCCGGCTT CCTGCCGCAG CGGCTTGCGC AGGCCATGGC GATGCACCAT
CTGCCCGACG GGCCGGTCGG CGACATGCCG GACGCGGTGC TGAACCGGCT GGGTCAGCAT
CTGGGACGCT GGTGCCTGGT GCCCGCGGGA ACCGAGGGCT TCGCCAAGGC CGAGGTCACG
CGGGGCGGCA TCGACACGCG CGACCTGTCG TCGCGCACCA TGGAGGCACG GTCGGTCCCC
GGCCTGTACG CGGTCGGCGA GGCCGTGGAC GTGACGGGCT GGCTGGGCGG CTATAATTTT
CAATGGGCCT GGGCCAGCGG CTACGCGGCG GGACGTGCCA TCGCCGAAGG CGGCTGA
 
Protein sequence
MSLAMDFDVI VLGAGAAGLM AALAAGQRGR RVAVLDHAAE AGRKILISGG GRCNFTNLDT 
DAGRFLSGNP HFAKSALARF GPADILAMIE RHGIAWHEKT LGQLFCDGSA RAVVAMLLAE
CDAGGVDIRL SHRITDVAGG DGFSVATDHG TFRAPALVLA TGGLSIPKMG ATGLSHAIAR
QFGLRVTPTL PALVPLTIDA ATPGWVPDLA GVSLDVVARC GATRFREAML FTHRGLSGPA
ILQVSSYWDS GREICLDLMP GRDAAEALRA AKRQRPRVEG RTMLSGFLPQ RLAQAMAMHH
LPDGPVGDMP DAVLNRLGQH LGRWCLVPAG TEGFAKAEVT RGGIDTRDLS SRTMEARSVP
GLYAVGEAVD VTGWLGGYNF QWAWASGYAA GRAIAEGG