Gene Gdia_1719 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_1719 
Symbol 
ID6975134 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp1900984 
End bp1902101 
Gene Length1118 bp 
Protein Length372 aa 
Translation table11 
GC content63% 
IMG OID643391246 
Producttransposase IS3 family protein 
Protein accessionYP_002276103 
Protein GI209543874 
COG category[L] Replication, recombination and repair 
COG ID[COG2801] Transposase and inactivated derivatives 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.662695 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones68 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGAAAG CGCGATTTAC GCAGGACCAG ATTATCGGGG TCCTGAAAGA GCATCAGGCG 
GGCGCTACGG CTGCGGATCT GTGCCGCAAG CACGGGATCA GTGACGCGAC CTTCTACACC
TGGCGGTCGA AATACGGCGG GATGGAGGTG TCGGAAGCGC GGCGCCTCAA GGCTCTTGAA
GAAGAGAACG CGAAGCTGAA GCGGCTTCTG GCGGAGAGCG TGATGGACGT CTCGACGCTG
AAGGAACTAC TGGCAAAAAA CTCGTGACGC CCGGTTTGCG GCGGGAAGCC GTGACCTGGG
CGATCCGGGA GAAAGAGTAT TCGCAGCGAC GGGCCTGCCG GCTGATCGGC ATGGACCCGA
AGACCTGGCG CTATGCGTCA CGCCGCCCGG ATGATGCCGC AGCGCGCGGG CGGCTGCGCG
AACTGGCTGG GGAGCGACGG CGATTTGGCT ACCGGCGACT GCATATCCTG CTCGGCCGGG
AAGGAATGAC GATGAACCAC AAGAAGCTGT TCCGGCTGTA TCGCGAAGAG GGGCTGTCGG
TCCGCAAGCG TGGCGGCCGG AAACGGGCGC TGGGCACGCG CTCGCCGATG ATGCTGCCCG
ACGGGCCGAA CCAGCGCTGG AGCCTGGATT TCGTCTCGGA TGCATTGAAC AACGGACGGC
GCTTCCGGGT GCTGACGGTG GTCGACGACT ACACGCGCGA ATGTCTGGCG CTGGTGGCGG
ACACCTCGTT ATCAGGCGAA CGCCTCGGTC GTGAACTCGA CCGGATCGGC GAGCATCGCG
GCTGGCCGCT GATGATCGTT AGCGACAATG GCACCGAGAT GACATCGAAC GCGATCCTGG
CCTGGCAGCA GAAGCGATCG GTGCTGTGGC ACTATATCGC ACCGGGCAAG CCGCAGCAGA
ACGGGTTCGT CGAGAGCTTC AACGGCCGGT TCCGCGACGA ATGCCTCAAT GAGCATCTGT
TCCGTAACAT CGCCCACGCT CGGACGGTCA TCGAGGACTG GCGGGCCGAC TACAACGCCG
TCAGGCCTCA CACCAGCCTC AATGGCATGA CGCCAGAGGC TTTCGCTCAA CACGCCACCA
AGGCATACAA CAATACACAG ACCCTAACTC AAAACTGA
 
Protein sequence
MKKARFTQDQ IIGVLKEHQA GATAADLCRK HGISDATFYT WRSKYGGMEV SEARRLKALE 
EENAKLKRLL AESVMDVSTL KELLAKKLVT PGLRREAVTW AIREKEYSQR RACRLIGMDP
KTWRYASRRP DDAAARGRLR ELAGERRRFG YRRLHILLGR EGMTMNHKKL FRLYREEGLS
VRKRGGRKRA LGTRSPMMLP DGPNQRWSLD FVSDALNNGR RFRVLTVVDD YTRECLALVA
DTSLSGERLG RELDRIGEHR GWPLMIVSDN GTEMTSNAIL AWQQKRSVLW HYIAPGKPQQ
NGFVESFNGR FRDECLNEHL FRNIAHARTV IEDWRADYNA VRPHTSLNGM TPEAFAQHAT
KAYNNTQTLT QN