Gene Gdia_1747 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_1747 
Symbol 
ID6975162 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp1929685 
End bp1930764 
Gene Length1080 bp 
Protein Length359 aa 
Translation table11 
GC content55% 
IMG OID643391270 
Producttransposase IS4 family protein 
Protein accessionYP_002276127 
Protein GI209543898 
COG category[L] Replication, recombination and repair 
COG ID[COG3666] Transposase and inactivated derivatives 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.25501 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones55 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGCAGC CGGGATTTTT TGACGTTGAA GAGCGGCTTG CCCGGTTAAG CGGGCTTGGC 
GATCAGCTCG AAGCATTTTC CCGGACTGTA GATTTTGAAG CGTTCCGTCC TGATCTGGAG
AAGGCTCTGG CCTATTCAGA TGGAAGCAAA GGCGGGCGAC CGCCATTTGA TCCGTTGCTA
ATGTTCAAGA TCCTGGTCAT CCAGACGCTC AACAATTTGT CTGATGAGCG CACGGAGTAT
CTGATCAACG ACCGCCTGTC CTTCATGCGC TTCCTTGAGC TGGGGCTTTC AGATCGAGTT
CCGGATGCCA AAACAATCTG GCTGTTCCGT GAACGCCTGA CCCAGGCGGG AGCGATCGAG
GGTCTGTTCA ATCGCTTTGA TACAATGCTG CGGCACGCAG GCTATCTGCC GATGTCGGGC
CAGATCCTGG ATGCCACACT GGTGGCTGCT CCAAAGCAGC GCAATACCAA CGCCGAGAAA
GCCGACCTCC GGGCAGGCCG TATTCCCGAA AACTGGCAGT ACAAGCCGTC AAAGCTGTCG
CACAAGGATC GTCATGCGCG CTGGACACTG AAGTTTACGA AGGCGAAGCG TCAGGATGAC
GGAACAACCC CCACAACGGA TCTCGCTATC CCGTTCTTTG GCTATAAATC GCATGTTTCC
ATCGATCGGA AATACCGGTT CATCCGGAAA TGGAAAACAA CGCATGCCGC CGCCAATGAT
GGCGCGCGAT TGAGAGAGGG GCTGCTGGAT AAAACCAATA CGGCCTCAAA CGTCTGGGCT
GACACAGCCT ATCGCTCAAA AGCCAACGAA GACTTCATGG AAAAGCAGGT CTTTGTCTCA
AAGGTTCACA GGAAGAAGCC GCATCTCAAA CCCATGCCCC GCCATATCCA GCGGTCCAAT
GCAGGAAAGT CCGTGATCCG GTCCCGTGTC GAGCATGTCT TTGCCGATCA GAAGTCGCAG
ACGGGACTGT TCATCCGAAC TGTCGGTATC ACCCGGTCCA CCATGAGGAT CGGGCTGGCC
AATATCGTCT ACAATATGCG CCGCTTTCTC TTCCTGCAGA AGATCAGCGC GAGCGCGTAG
 
Protein sequence
MKQPGFFDVE ERLARLSGLG DQLEAFSRTV DFEAFRPDLE KALAYSDGSK GGRPPFDPLL 
MFKILVIQTL NNLSDERTEY LINDRLSFMR FLELGLSDRV PDAKTIWLFR ERLTQAGAIE
GLFNRFDTML RHAGYLPMSG QILDATLVAA PKQRNTNAEK ADLRAGRIPE NWQYKPSKLS
HKDRHARWTL KFTKAKRQDD GTTPTTDLAI PFFGYKSHVS IDRKYRFIRK WKTTHAAAND
GARLREGLLD KTNTASNVWA DTAYRSKANE DFMEKQVFVS KVHRKKPHLK PMPRHIQRSN
AGKSVIRSRV EHVFADQKSQ TGLFIRTVGI TRSTMRIGLA NIVYNMRRFL FLQKISASA