Gene Gdia_0649 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_0649 
Symbol 
ID6974046 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp739816 
End bp740933 
Gene Length1118 bp 
Protein Length372 aa 
Translation table11 
GC content63% 
IMG OID643390180 
Producttransposase IS3 family protein 
Protein accessionYP_002275056 
Protein GI209542827 
COG category[L] Replication, recombination and repair 
COG ID[COG2801] Transposase and inactivated derivatives 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones41 
Fosmid unclonability p-value0.343729 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGAAAG CGCGATTTAC GCAGGACCAG ATTATCGGGG TCCTGAAAGA GCATCAGGCG 
GGCGCTACGG CTGCGGATCT GTGCCGCAAG CACGGGATCA GTGACGCGAC CTTCTACACC
TGGCGGTCGA AATACGGCGG GATGGAGGTG TCGGAAGCGC GGCGCCTCAA GGCTCTTGAA
GAAGAGAACG CGAAGCTGAA GCGGCTTCTG GCGGAGAGCG TGATGGACGT CTCGACGCTG
AAGGAACTAC TGGCAAAAAA CTCGTGACGC CCGGTTTGCG GCGGGAAGCC GTGACCTGGG
CGATCCGGGA GAAAGAGTAT TCGCAGCGAC GGGCCTGCCG GCTGATCGGC ATGGACCCGA
AGACCTGGCG CTATGCGTCA CGCCGCCCGG ATGATGCCGC AGCGCGCGGG CGGCTGCGCG
AACTGGCTGG GGAGCGACGG CGATTTGGCT ACCGGCGACT GCATATCCTG CTCGGCCGGG
AAGGAATGAC GATGAACCAC AAGAAGCTGT TCCGGCTGTA TCGCGAAGAG GGGCTGTCGG
TCCGCAAGCG TGGCGGCCGG AAACGGGCGC TGGGCACGCG CTCGCCGATG ATGCTGCCCG
ACGGGCCGAA CCAGCGCTGG AGCCTGGATT TCGTCTCGGA TGCATTGAAC AACGGACGGC
GCTTCCGGGT GCTGACGGTG GTCGACGACT ACACGCGCGA ATGTCTGGCG CTGGTGGCGG
ACACCTCGTT ATCAGGCGAA CGCCTCGGTC GTGAACTCGA CCGGATCGGC GAGCATCGCG
GCTGGCCGCT GATGATCGTT AGCGACAATG GCACCGAGAT GACATCGAAC GCGATCCTGG
CCTGGCAGCA GAAGCGATCG GTGCTGTGGC ACTATATCGC ACCGGGCAAG CCGCAGCAGA
ACGGGTTCGT CGAGAGCTTC AACGGCCGGT TCCGCGACGA ATGCCTCAAT GAGCATCTGT
TCCGTAACAT CGCCCACGCT CGGACGGTCA TCGAGGACTG GCGGGCCGAC TACAACGCCG
TCAGGCCTCA CACCAGCCTC AATGGCATGA CGCCAGAGGC TTTCGCTCAA CACGCCACCA
AGGCATACAA CAATACACAG ACCCTAACTC AAAACTGA
 
Protein sequence
MKKARFTQDQ IIGVLKEHQA GATAADLCRK HGISDATFYT WRSKYGGMEV SEARRLKALE 
EENAKLKRLL AESVMDVSTL KELLAKKLVT PGLRREAVTW AIREKEYSQR RACRLIGMDP
KTWRYASRRP DDAAARGRLR ELAGERRRFG YRRLHILLGR EGMTMNHKKL FRLYREEGLS
VRKRGGRKRA LGTRSPMMLP DGPNQRWSLD FVSDALNNGR RFRVLTVVDD YTRECLALVA
DTSLSGERLG RELDRIGEHR GWPLMIVSDN GTEMTSNAIL AWQQKRSVLW HYIAPGKPQQ
NGFVESFNGR FRDECLNEHL FRNIAHARTV IEDWRADYNA VRPHTSLNGM TPEAFAQHAT
KAYNNTQTLT QN