Gene Gdia_0033 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_0033 
Symbol 
ID6973422 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp38606 
End bp39913 
Gene Length1308 bp 
Protein Length435 aa 
Translation table11 
GC content67% 
IMG OID643389566 
Producthypothetical protein 
Protein accessionYP_002274450 
Protein GI209542221 
COG category[S] Function unknown 
COG ID[COG2718] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.19629 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value0.0885106 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGATATTA TCGACAGACG TCCCAATCCC CATGGGAAGA ATATCGAAAA TCGGCGGCGC 
GTGCTGCAGC GCGCCCGTGC CGCCGTGCAG AAGGCCGTGC GCGATTCGGT CGCGCGCGGC
AAGATCAGGG AGGTCGGGCA GGGAAATGCG GTGTCGATCC CGTCGGACGC GCTGCATGAG
CCCACCTTCC ACCGCGTCTT TTCCGAGGGT GTGCGCGAGA TCGTGCTCAC CGGTAACCGC
GAGTTCTCGC GCGGGGACCG GCTGCAACGC CCGCCGGGCG GCGTAGGGCA GGGCGGTGGC
GGCGAAGGGC AGGGCGGCGA AAAGGGGGGC GGCGAGGATT CCTTCCGCTT CGTGCTGAGC
CGCGACGAAT TCCTGGACCT GTTCTTCGAC GACCTGGAAC TGCCGGACCT GGTCAAGCGC
GAGATCGCGG CCACCGAGAC CAGCAAGCCC ACCCGCAGCG GGCTGAGCAA CGAGGGCGCG
CCGTCGCACC TGGATCTGGG GCGGACGATG CGGCGGTCGA TCGCCCGGCG CATCGGGCTG
GGCCGTCCCA AGCCCGACGA CCTGCGCGAG ATCGAGGCCC GGATCGCCGA ACTGGAGGAC
CGCCGGCCCC TGGGGACCGA GGATCTGGAC GAACTGGAAT CCCTGTGCGA ACGCCGGGGC
ACGCTGCGCC AGCGCCTGGC ACGGATTCCG TGGGTCGATC CGGTCGATCT GCGGTTCCGC
CGCTATACCG TCGTGCCGCA GCCGGCGACG CGGGCCGTGA TGTTCTGCAT CATGGACGTC
TCGGGCTCGA TGACCGAGCG GATGAAGGAC CTGGCGAAGC AGTTCTTCCT GCTGCTGCAC
GTCTTTCTGG AGCGGCGGTA CAAGAAGGTC GAAATCGTCT TCATCCGCCA CGCCGAAAGC
GCCGAGGAGG TGGACGAGGA AACCTTCTTC CACGATCCGC GCACCGGCGG GACCATCGTG
TCCTCGGCGC TGGAACTGAT GTCCAGCATC CAGCGCGAGC GCTTTCCGTC GGGAAGCTGG
AACATCTATG TCGCGCAGGC GTCGGACGGC GACAATGCGT CGTCCGACAC CACCCGGACC
GCCAGCCTGC TGCGGGACGA GATCCTGCCG GCGGTCCAGT ACTATGCCTA TATCGAAATC
ACGGGCTCGG GGGCGGTTAT CCGTGGCGAG ACCGACCTGT GGCGCAGCTA CCGGACGATC
GCGGATGAAA ACGACCACCT GGCGATCCGG CAGGTCGGCG ACCGCAAGGA AATCTTCCCC
GTTTTCCGCG AACTGTTTTC CCGACAGCAC GACACGGCGG AGGCATGA
 
Protein sequence
MDIIDRRPNP HGKNIENRRR VLQRARAAVQ KAVRDSVARG KIREVGQGNA VSIPSDALHE 
PTFHRVFSEG VREIVLTGNR EFSRGDRLQR PPGGVGQGGG GEGQGGEKGG GEDSFRFVLS
RDEFLDLFFD DLELPDLVKR EIAATETSKP TRSGLSNEGA PSHLDLGRTM RRSIARRIGL
GRPKPDDLRE IEARIAELED RRPLGTEDLD ELESLCERRG TLRQRLARIP WVDPVDLRFR
RYTVVPQPAT RAVMFCIMDV SGSMTERMKD LAKQFFLLLH VFLERRYKKV EIVFIRHAES
AEEVDEETFF HDPRTGGTIV SSALELMSSI QRERFPSGSW NIYVAQASDG DNASSDTTRT
ASLLRDEILP AVQYYAYIEI TGSGAVIRGE TDLWRSYRTI ADENDHLAIR QVGDRKEIFP
VFRELFSRQH DTAEA