Gene Gdia_2047 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_2047 
Symbol 
ID6975474 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp2269401 
End bp2270489 
Gene Length1089 bp 
Protein Length362 aa 
Translation table11 
GC content68% 
IMG OID643391577 
Productcytochrome oxidase assembly 
Protein accessionYP_002276422 
Protein GI209544193 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1612] Uncharacterized protein required for cytochrome oxidase assembly 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones53 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCCTGC GTCCGGCCGG CAGCCGCGAC GATGCCTCGC CCATCGCGCT TCGCGACCGC 
AGGCGTATCT CGACCTGGCT GTTCGTCATC TGCTTCATGC TGATCGGCCA GATCGCGCTG
GGCGGATACA CCCGGCTGAC CGGTTCGGGC CTGTCGATCA TGGACTGGCG GCCGGTCACC
GGCATCATCC CGCCCTTGTC CCACGCGGAA TGGGAGCGGC AGTTCGCGCT GTACCAGACC
ATTCCGCAGT ACAAGATCCT GCATGACGGA TTCGGGCTGG CCGGGTTCCA GAAGATCTTC
TGGGCGGAAT GGACCCATCG GTTCTGGGCC CGGGTCATGA GCCTGGCGCT GCTGGCGCCG
CTGATCTGGT TCGCCGTGAC CGGCGCGCTG ACGCGGGGAC TGATCGCGCG GCTGCTGCTG
TATTTCGTGC TGGGCGGGCT GCAGGGGGCG ATCGGCTGGT TCATGGTCGC ATCGGGCTTC
GACCAGAACA GCACGGCGGT CGAGCCGGTG CGGCTGGTCC TGCATCTGGG CTGCGCCTTC
GCGCTGTATA TCGCCATCCT GTGGACCGCG CTGTCGGTCC GCACGCCCCG CGCCGCCTTC
ATCCCCGCCA CGGCGGCGGT GGTGCGGACG AAGCGGCTGG TGTGGTGCGC CACGATCCTG
ATCGGCATCA CCATCACCGC TGGCGGCTTT ACCGCCGGGA CCCACGCGGG TTTTTCCTAC
AACACCTTTC CGCTGATGGA CGGGCGCCTG ATTCCCCATG GCTACGCCCG GCTGTCGCCG
TTCTGGCTGA ACTGGTTCGA GAACGTGCCG GCCGTCCAGT TCGACCACCG GCTGCTGGCG
ACCGTGACCG CGCTGGCCAT CGGGGCCTGC CTGTTCGCGG GCCTGCGCAC GCCGCAACTG
GGCAAGCCGG CGCAGGACGC GCTGATGCTA ATGGGCTGGG CGGTCCTGAT TCAGTACGCG
CTGGGCATCA CCACCCTGCT TCTGGTCGTT CCCGCCTGGG CCGGAACCGT GCACCAGACC
TGGGCCGCCG TCCTTCTGAC CATCGCCATC GTGACCCTGC ACCGGCTTCG CGGCGTCGGC
CGCGTCTGA
 
Protein sequence
MSLRPAGSRD DASPIALRDR RRISTWLFVI CFMLIGQIAL GGYTRLTGSG LSIMDWRPVT 
GIIPPLSHAE WERQFALYQT IPQYKILHDG FGLAGFQKIF WAEWTHRFWA RVMSLALLAP
LIWFAVTGAL TRGLIARLLL YFVLGGLQGA IGWFMVASGF DQNSTAVEPV RLVLHLGCAF
ALYIAILWTA LSVRTPRAAF IPATAAVVRT KRLVWCATIL IGITITAGGF TAGTHAGFSY
NTFPLMDGRL IPHGYARLSP FWLNWFENVP AVQFDHRLLA TVTALAIGAC LFAGLRTPQL
GKPAQDALML MGWAVLIQYA LGITTLLLVV PAWAGTVHQT WAAVLLTIAI VTLHRLRGVG
RV