Gene Gdia_2254 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_2254 
Symbol 
ID6975683 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp2502460 
End bp2503476 
Gene Length1017 bp 
Protein Length338 aa 
Translation table11 
GC content72% 
IMG OID643391781 
ProductThioredoxin domain 
Protein accessionYP_002276624 
Protein GI209544395 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG3118] Thioredoxin domain-containing protein 
TIGRFAM ID[TIGR01068] thioredoxin 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.708544 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones43 
Fosmid unclonability p-value0.526752 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATTACA TCATCGGTCA GTCCCGCGGC GGCCAGCGCG CTACCGGCGG CCTGGTGGAC 
GAGGCAGGCG TTCCCGCCGC GCCGTCGGGG ATGGCCGGAG CGCCGGCCGG GCCCGGCGGG
AATGGTGCGA TGATCGTCGA CGGCACCCAG GACACCTTCA TGCAGGATGT CCTGGAGGCC
AGCCGTACCC TGCCGGTCCT GGTCGATTTC TGGGCCACCT GGTGCGGGCC GTGCCGCCAG
TTGACCCCGG TGCTGGAAAA GATCGTCCGG TCGGCCGGCG GCCGCGTGAA GCTGGTCAAG
ATCGATGTCG ACGCCAACCG GGCCCTGGCC CAGCAACTGA CCCAGGTCGG GCTGCCGCTG
CAGTCCATCC CGCTGGTGGC CGCCTTCTGG CAGGGGCAGA TCCTGGACCT GTTCCAGGGC
GCGCAGCCGG AAAGCGAGAT CAAGCGCTTC GTCGAAGGAC TGCTGAAGGC CGCGGGCGGC
GGCAGCATGC CGGCGGCGGA CCTGATCGTC GCGGCCCGCG CGGCGCTGGA GGCCGGCAGC
GCCGAGGAAG CGGCCGGCCT GTATGCCCAG ACCCTGGAGA TCGAACCCGA AAACGCCGCC
GCCTGGGGCG GCCTGGTGCG GGCGCTGATC GTGATGGGGG ACGAGGACGC CGCCGAGGCC
GCACTGGCCG ACGTGCCGGC CAGGATTGCC GACCATGCCG AGATCACCGG CGCCCGCGCC
GCGCTGGACC TGAAGCGCGA GGGCCGCAAG GCCGCCGAGG CATCCGAAGG GCTGCGCCGG
CGGCTGGCCG CGAATCCGGC GGACCACGAG GCCCGCTACG AACTGGCCGC CGCCCTGAAC
GCGGCCGGCC ACCGGCAGGA AGCCGCCGAC GAACTGCTGA CCATCATGCG CCAGGACCGT
GCCTGGAACG ACGATGCGGC GCGGCTGCAA TTGATCCGGC TGTTCGAGTC CTGGGGCCAT
GACGACCCGG CGACCCTGCA GGCCCGGCGG CGTATGTCCG CGCTGCTGTT TTCATGA
 
Protein sequence
MDYIIGQSRG GQRATGGLVD EAGVPAAPSG MAGAPAGPGG NGAMIVDGTQ DTFMQDVLEA 
SRTLPVLVDF WATWCGPCRQ LTPVLEKIVR SAGGRVKLVK IDVDANRALA QQLTQVGLPL
QSIPLVAAFW QGQILDLFQG AQPESEIKRF VEGLLKAAGG GSMPAADLIV AARAALEAGS
AEEAAGLYAQ TLEIEPENAA AWGGLVRALI VMGDEDAAEA ALADVPARIA DHAEITGARA
ALDLKREGRK AAEASEGLRR RLAANPADHE ARYELAAALN AAGHRQEAAD ELLTIMRQDR
AWNDDAARLQ LIRLFESWGH DDPATLQARR RMSALLFS