Gene Gdia_0510 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_0510 
Symbol 
ID6973906 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp562442 
End bp563587 
Gene Length1146 bp 
Protein Length381 aa 
Translation table11 
GC content70% 
IMG OID643390042 
Producthypothetical protein 
Protein accessionYP_002274919 
Protein GI209542690 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2706] 3-carboxymuconate cyclase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value0.0476754 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTTTTG GATTGCTGTC CCTTTTGTGC CTGTCGCTGA TGATCGCCCC GGCGGCACAG 
GCACAGACCG TGCTTTCGGC CAATGATGCC CATTCCATCC TGCGAAACGG CGTGCAGGTC
CTGCCGAAAC CGTTGAAGCC CGATACGCTG TCCGTCCTGG ACCGGGGGGC GGACGGGCTG
TGGACCGTGC GCGCCAGCGT TGCCGTGCCC GTCAGCGTCG TGGGGCCGCC CACCGCTCTG
GTCCTGACGC CGGATGGCCG GACCGCGCTG GTGTCTTCCG CCAGCAAGGC GGACCCGGCC
CGGGACAGGA TCGTCCCGGA TGACCGGATC AGCGTCATCG ACCTGTCCGG CGACAGGCCG
CGCGTGGTCC AGCAGGTCAC GTCCGCCCCC GGCGCCACCA CGCTGCGCAT CACGCCGGAC
CAGAAGCATG TGCTGGTCGC CAATGGCGCC GGCGGCGTGG TGACATGGTT CCGCTTCGAC
GGGCGCCGGC TGAGCGACCG CACGGTGATC ACCCTGCCGG GGGCCGCCGG ATTCCCGGGT
GGGCTGGCGA TCACCCCGGA CGGACGGCGC GCGCTGGTCA GCCTGTGGAA GGGCGACCGG
GTCTTCGTCC TGCATCTGGA CGGGGATCGG GTCACGGTCG ATCCGCATCC GCTGGAGATC
GGGCCGGGAC CGTGGAATAT CCGCCTGACC GGGGACGGGC ATTACGCGGT CATGGGCATC
CTGGGCCATG GCGAGGGATT GCCCGGCGCG CTGTCGGTCC TGGACCTGAC GGCCGCGCCG
ATCCGCGAAA TCCAGCGCGT CGTGGTTCCC AACGCGCCCG AGGGGCTGGA CATTTCCAGC
GACGGGCGGT TCGTCGCCGT CGTGTCGCAG AACGGGTCGG CCGTCGTTCC GACCTCGCCG
CACTATCATG ATCGCGGGAT CGTCACCGTG TTTTCACTGT CCGGCGGCCA TCTGACGCAA
CTGGCGCAGG CGCCGGGCAC GCTCTGGCCA CAGGGGCTGG TCTTCGCGCC GGACGGGACG
TCGATCCTGG TCCAGGGCGT CATGGACCGC ATGCTGCGCA CCCTGTCCTG GGACGGCACC
ACCCTGGCGG TGAAGGGGGA CACGCCCCTG CCCGGCGGGG GCGCCGACAT CGAACGCCAG
CGCTGA
 
Protein sequence
MRFGLLSLLC LSLMIAPAAQ AQTVLSANDA HSILRNGVQV LPKPLKPDTL SVLDRGADGL 
WTVRASVAVP VSVVGPPTAL VLTPDGRTAL VSSASKADPA RDRIVPDDRI SVIDLSGDRP
RVVQQVTSAP GATTLRITPD QKHVLVANGA GGVVTWFRFD GRRLSDRTVI TLPGAAGFPG
GLAITPDGRR ALVSLWKGDR VFVLHLDGDR VTVDPHPLEI GPGPWNIRLT GDGHYAVMGI
LGHGEGLPGA LSVLDLTAAP IREIQRVVVP NAPEGLDISS DGRFVAVVSQ NGSAVVPTSP
HYHDRGIVTV FSLSGGHLTQ LAQAPGTLWP QGLVFAPDGT SILVQGVMDR MLRTLSWDGT
TLAVKGDTPL PGGGADIERQ R