Gene Gdia_1119 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_1119 
Symbol 
ID6974523 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp1256919 
End bp1258709 
Gene Length1791 bp 
Protein Length596 aa 
Translation table11 
GC content72% 
IMG OID643390648 
ProductDEAD/DEAH box helicase domain protein 
Protein accessionYP_002275517 
Protein GI209543288 
COG category[J] Translation, ribosomal structure and biogenesis
[K] Transcription
[L] Replication, recombination and repair 
COG ID[COG0513] Superfamily II DNA and RNA helicases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones42 
Fosmid unclonability p-value0.538016 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGTTTC CCGAGACCCA TCCCGCCCTT CGGCGCGCGC TAGACGCGCG CGGCTATGAA 
GAGCCGACCC CCGTCCAGAA AGCCGTGCTG GATGTCGCGG CGGACGGAAG GGACCTGCTG
GTGTCGGCGC AGACCGGATC GGGCAAGACG GTGGCGTTCG GCCTGGCGAT GGCCGACACG
CTGCTGGGCG GGGCCGAACG GTTCGGCCCG GCCGGCGCCC CGCTGGCCGT CATCATCGCC
CCCACCCGCG AACTGGCCAT GCAGGTCCAG CGCGAACTGT CCTGGCTGTA CGCCCCGGCC
GGCGCGCGCA TCGTGTCGTG CATCGGCGGC ATGGATGCGC GGCGCGAAGC CCGGGCCCTG
GAAATCGGCG CGCATATCGT CGTCGGCACC CCCGGGCGGC TGTGCGACCA CCAGTCGCGC
GGACGCCTGG TCCTGTCCGA ACTGCGTGTC GTGGTGCTGG ACGAAGCCGA CGAAATGCTG
GACCTGGGCT TCCGCGACGA ATTGCAGCAA TTGCTCGATG CGATGCCCGA CACGCGGCGG
ACGCTGCTGT TTTCGGCCAC CATCGCCAAG GACATCGCCA CCCTGGCCCG CCGCTACCAG
CGCGACGCGC TGCGGATCGA CACGCTGTCC GGCGCCCGGC AGCACGCCGA CATCACCTAT
CGCGCCGTGA TGGCCGACCC GCGTGAGATC ATTCCGGCCG TCGTCAACAT CCTGCGCTTC
ACCGACAGCC CGACCGCCAT GGTGTTCTGC GCCACGCGCG AACTGGTGCG CCACATGCAG
GGCGCGCTGC TGGAACGCGG CTTTTCGTCG GTCGCGCTGT CGGGCGAACT GGGGCAGAGC
GAGCGTACCC GCGCGATCGA AAGCCTGCGC ACCGGGCAGG CGCGGGTCTG CGTCGCCACC
GACGTCGCAG CGCGCGGCAT CGACATTCCC GCGCTCAGCC TGGTGGTGCA TGCCAGCCTG
CCGACCGACA GCGCCACCCT GCTGCATCGT TCGGGCCGTA CCGGCCGCGC CGGGCGCAAG
GGCGTATGCG CCCTGATCGT GCCGGTTTCG ATGCGCCGGC GCGCCGAGCG GCTGCTGACG
ATGGCGAAGG TCTCGGCCGA ATGGACCGCC GTGCCGACCG CCGAGGCGAT CCGCGCCCAG
GACGCCGAGC GCCTGCTGAC CGACCCGATC CTGACCGAGG GCGCGGCCCC GGGCGACGAA
GACCTGGTCG CCCGCCTGGC CGAAAACCGC AGCGCCGAGC AGCTGGCCGC CGCCCTGCTG
CAGATGTACC GCGCCCGCCT GCCGGACCCC GAGGATATCC GCCCGCTGCG CGTCGAGGCA
CCCCGTGCCC CGCGCGAGGG CGATTATGCC CGCCGCAAGG AACAGGGCCC CCGCGAGGAC
CATGCGCCGC GTGGCGGACC GGGTGCGTGG TTTTCCATGA GCGTCGGCCG CCAGGACAAG
GCCGACCCGA AATGGCTGGT GCCGCTGATC TGCCGTCTGG GCGGGGTGCG CAAGAACGAT
ATCGGCGCGA TCCGCATCGC CGACGACCAC ACGCTATTCG AAATCGCGAC GGAATCGGCC
GAGCGCTTCA CCGCCTGCGT CGCCGCGACC GATTCCGACG AGGTGCGGAT CAGCGCGGCC
AAGGCCCCGG CGGGCGGCCC CCGTGGCGTA AGTGGCGAAC GCGGCTATGC CCCGCGCAAG
CCCGCGGGCA AGGCCGGCTA TCCGCACGGA CCGCGTGCCG GTCGTGCACC GGCGGGCGCC
ACGCGCGGCG CGCCCTCGTC ACGCAAGCGG GCGCCGTCGC GCGCGTCGTA G
 
Protein sequence
MPFPETHPAL RRALDARGYE EPTPVQKAVL DVAADGRDLL VSAQTGSGKT VAFGLAMADT 
LLGGAERFGP AGAPLAVIIA PTRELAMQVQ RELSWLYAPA GARIVSCIGG MDARREARAL
EIGAHIVVGT PGRLCDHQSR GRLVLSELRV VVLDEADEML DLGFRDELQQ LLDAMPDTRR
TLLFSATIAK DIATLARRYQ RDALRIDTLS GARQHADITY RAVMADPREI IPAVVNILRF
TDSPTAMVFC ATRELVRHMQ GALLERGFSS VALSGELGQS ERTRAIESLR TGQARVCVAT
DVAARGIDIP ALSLVVHASL PTDSATLLHR SGRTGRAGRK GVCALIVPVS MRRRAERLLT
MAKVSAEWTA VPTAEAIRAQ DAERLLTDPI LTEGAAPGDE DLVARLAENR SAEQLAAALL
QMYRARLPDP EDIRPLRVEA PRAPREGDYA RRKEQGPRED HAPRGGPGAW FSMSVGRQDK
ADPKWLVPLI CRLGGVRKND IGAIRIADDH TLFEIATESA ERFTACVAAT DSDEVRISAA
KAPAGGPRGV SGERGYAPRK PAGKAGYPHG PRAGRAPAGA TRGAPSSRKR APSRAS