Gene Gdia_0022 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_0022 
Symbol 
ID6973411 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp26417 
End bp27526 
Gene Length1110 bp 
Protein Length369 aa 
Translation table11 
GC content68% 
IMG OID643389555 
Producthypothetical protein 
Protein accessionYP_002274439 
Protein GI209542210 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.205908 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value0.027287 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACCATG CCCCGACCGC CGCTTCCAGC CGGGCCAAGA CGGGGGGGGA TGCGGCACGG 
CAGCCCGCGC CGCTGTCGAT GTTCGAATTC TGGCCGGGCT GGGCCATCTA TACCCCGGTC
GTGCTGTACT GGATCCTGCT GGGCCTGTGG CACCGGGATT TCAGCCTGCC GACCGCGGCC
AATCCCCGCA TCCTGACCGG CGGCCTGTGC GGCGAAAGCA AGACCAGCAT CCTGGACATG
GCGGGCGAGA CGGCGCGGCG CTGGATCGCG CCCTATGTCT CGGTCACCAC CGGTTCGGCC
GATGACGGCG CGGCCGCCCT GGCGGCGCTG GATCGCGGCG GGCTGGCGCT GCCGGTGGTG
GTGAAGCCCG ATATCGGCTG CAACGGCGCC GGGGTGAAGC TGGTCACGAC CCCGGACGAA
CTGGTCGCGG CGGTGGCGCT GTACCCGCCC GATACCCCCC TGGTCATGCA GCGGCTGATC
CCGTTCGAGC ACGAGGCCGG CGTGTTCTAT ATCCGCCACC CAGACGAGGA CCGGGGCCGG
ATATCCTCGC TGACCTACAA GGAGGCACCG GTCATCGTCG GCGACGGCCG GTCCACGGTG
CGACAACTGA TCGATGCCGA CGCGCGCACG CGCCTGGTGC CGCATCTGTA TCTGCCCCGC
CTGGGCGATC GGGTGCATGA GGTCCTGCCG GCGGGCATGC CGCTGCGGCT GGTCTTCGCC
GGGAATCACA GCAAGGGGTC GATCTTCCGC AACGGCGCGG ACGACATCAC CCCGGCCCTG
GTCGAGCAGA TCGACCGGAT CATGCAGGAT ATCCCCGATT TCCATTTCGG CCGGATCGAC
CTGAAGTTCG AATCCATCGC CGCCCTGCGC CTGGGCCGGG GGTTCGAAAT CATCGAGATC
AACGGCGTGG GGTCCGAAGC GACCCATATC TGGGATTCGC GCACCACCCT GCGCGAGGCC
TATGCGGCGC AGTTCACGCA TTACCGCGAG ACCTTCCGCA TCGGCGCCAA GAAGAAGAAG
GCCGGATGGC GGACCAGCGG CGCCTTCACC ATGCTGCATT ACTGGCGCCA GCAGAGGCGG
CTGCTCGCCT CCTACCCCCT GAACGACTAG
 
Protein sequence
MNHAPTAASS RAKTGGDAAR QPAPLSMFEF WPGWAIYTPV VLYWILLGLW HRDFSLPTAA 
NPRILTGGLC GESKTSILDM AGETARRWIA PYVSVTTGSA DDGAAALAAL DRGGLALPVV
VKPDIGCNGA GVKLVTTPDE LVAAVALYPP DTPLVMQRLI PFEHEAGVFY IRHPDEDRGR
ISSLTYKEAP VIVGDGRSTV RQLIDADART RLVPHLYLPR LGDRVHEVLP AGMPLRLVFA
GNHSKGSIFR NGADDITPAL VEQIDRIMQD IPDFHFGRID LKFESIAALR LGRGFEIIEI
NGVGSEATHI WDSRTTLREA YAAQFTHYRE TFRIGAKKKK AGWRTSGAFT MLHYWRQQRR
LLASYPLND