Gene Gdia_1645 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_1645 
Symbol 
ID6975061 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp1830787 
End bp1833561 
Gene Length2775 bp 
Protein Length924 aa 
Translation table11 
GC content70% 
IMG OID643391180 
ProductDNA topoisomerase I 
Protein accessionYP_002276037 
Protein GI209543808 
COG category[L] Replication, recombination and repair 
COG ID[COG0550] Topoisomerase IA 
TIGRFAM ID[TIGR01051] DNA topoisomerase I, bacterial 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value0.247282 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTGACG TCGTCGTGGT CGAGTCGCCT GCCAAGGCGA AGACGATCAA CAAGTATCTG 
GGGGACGGAT TCACGGTTCT TGCCTCGTTC GGCCATGTCC GCGACCTGCC GCCGAAGGAT
GGCAGCGTCC GCCCCGACGA GAATTTCGCC ATGGACTGGC AGGCCGACGA GCGCGGCAGC
CGGCAGATGG CCGCGATCGC CAAGGCGTTG CGCGGGGCGC GGCATCTGTA CCTGGCCACT
GACCCGGATC GCGAGGGCGA GGCGATTTCC TGGCATGTCC GCGCGATGCT GGAGGAAAAG
AACCTGCTGA AGGGGGTGGA CGTCCAGCGC GTCACCTTCA ACGAGATCAC CAAGAGCGCC
ATCCGCGCCG CGATGGCCCA GCCGCGCGAC CTGGACCAGC CGCTGATCGA GGCCTATCTG
GCCCGGCGCG CGCTGGATTA CCTGGTGGGC TTCACCCTGT CGCCCGTGCT GTGGCGCAAG
CTGCCGGGGT CGCGCAGCGC CGGCCGGGTA CAGTCGGTCG CCCTGCGCCT GATCTGCGAG
CGCGAGGCGG AGATCGAGAG CTTCCGCGCC CGCGAATACT GGACGGTGGC CGCGCAGTTC
ACCACGCCCG GCGGCGCGGC CTTCACCGCG CGGCTGACGC ACCTGGCCGG CCGCAAGCTG
GACCAGTTCG ACCTGCATGA CGAAGCCGGG GCCATGGCGG CCAAGGCGGC CGTCGAGGCC
GGGCGCTTCG CCGTGCAGTC GGTGGAACGG CGCAAGGTCC GCCGCAACCC GCCGCCGCCC
TTCACGACCT CGACCATGCA GCAGGAAGCG TCGCGCAAGC TGGGCATGGG CGCGCAGGGC
ACCATGCGCA CCGCCCAGCA GCTGTACGAA GGCATCGACC TGGGCGGCGA GACGGTCGGT
CTGATCACCT ATATGCGAAC CGATGGCGTG CAGATGGCCG GCGAGGCCAT CGCCGCCATC
CGCGGCCATA TCGGCGAGAG CTTCGGCGCG CCCTACGTGC CCGAGAAGGC CCGGATCTAT
TCCACCAAGG CGAAGAACGC CCAGGAAGCG CACGAGGCGA TCCGCCCCAC CGATGTCAGC
CGCACCCCGG CGCAGATGGC CCGCTACCTG AATGACGAGC AGCGGCGGCT GTACGAACTG
ATCTGGAAAC GGTCGGTCGC CAGCCAGATG CAGTCGGCCG AACTGGACCA GGTGATCGTC
GAGATCGCGG ATGCCGGAGG GGCCGCCACC CTGCGCGCCA CCGGGTCGAT GATCGCCTTC
GACGGGTTCC TGAAGCTGTA CAGCGAAGGC CGGGACGACG CCGCGCCGAA GGACGAGCAG
GACGACGACA GCCGCATGCT GCCGCCGATG CGCGAGCGCG ACGCGCTGAA AACCGGCGAG
GTCGCGGCCG ACCAGCATTT CACCCAGCCG CCGCCGCGCT TCTCCGAAGC GTCGCTGGTC
AAGAAGATGG AAGAGATCGG GATCGGCCGG CCGTCGACCT ATGCCTCGAT CCTGACGGTG
CTGCGCGACC GCAATTACGT GCGGCTGGAT GCCCGCCGCT TCGTCCCCGA GGACCGGGGG
CGGCTGGTCA CCGCGTTCCT GACCTCGTTC TTCGAACGCT ATGTGGACAC GCAGTTCACG
GCGGGGCTGG AAGAGCAGCT GGACGACATA TCCGGTGGGC GGGCCGACTG GCGCGACGTG
ATGTCGGCCT TCTGGCAGGA TTTTTCCCGC GCGGTGGACC AGACGAAGGA TCTGAAGATC
TCCGACGTCA TCAGCGCGCT GGATGCCGAC CTGGCGCCGC ATTTCTTCCC CGCGCACCTC
GACGGCAGCG ATCCGCGCGT CTGCACCGCC TGCGGCACCG GGCGGCTGGG GCTGAAGCTG
GGGCGGTACG GCGCCTTCAT CGGCTGTTCC AACTATCCGA CCTGCCAGTT CACCCGCCGC
CTGGTGGTGG ACCCCAAGGA GGACGGCGAG GCCGACACGC TGAAGGACGG CATGCGCCTG
CTGGGCCAGA CGCCCGGCGG CGAGGATGTG ACCGTGCGGC GCGGCCCGTG GGGCCTGTAC
GTCCAGCAGG GCGAACCCGA CCCCGAGGAC AAGAAGGCCA AGCCCCGGCG CGCCACCATT
CCGCGCGGGA TCGAAGGCGA CAAGATCACG CTGGACCAGG CGCTGGGCCT GCTCTCGCTG
CCGCGGGTCG TCGGCATCCA TCCGGAAACC GGCGAGCAGA TCGAGGCCGG GCTTGGCCGC
TTCGGGCCAT ACGTGAAGAT GGGCGCGGTC TATGGATCGC TGGACAAGGA TGACGACATC
CTGACGGTCG GGCTGAACCG GGCGGTGGAC GTGCTGGCCC GCAAGCTGGC CTCGGTCCGC
ACCATCGCGC CGCACCCCAA GGATGGCGAG CCGGTGATCG TCCGCAAGGG ACGGTTCGGA
CCGTATATCC AGCATGGCAC GATGGTGGTG AACGTGCCCC GGGGCGAGGC CATGGAGGAC
GTGACCCTGG ACCAGGCGGT GGCGCTGCTG GCCGAAAAGG GCAAGCCGCT GAAGCCCAAG
GGCAAGGCCG CGGCGAAGAA GGCCCCCGCC CGGAAGACGG CCGCCAGGAA GGCGCCGGCC
AAGACCGCGG CGAAGAAAGC CGCCCCGGAC GGGGATGCCG ATACGCAGGC CAGGGCCGCG
AAACCCCCGG CGCGCAAGGC CGCCGCCCGC AAGACCCCGG CCGGCAAGGC AACGGGCAAG
ACCGCGAAGG GCAAGGCCGA ACCCGGCGAG GGTGCCGGGC CCCGCACGCG GCGCACGGCG
ACCGAGGCCG GCTGA
 
Protein sequence
MTDVVVVESP AKAKTINKYL GDGFTVLASF GHVRDLPPKD GSVRPDENFA MDWQADERGS 
RQMAAIAKAL RGARHLYLAT DPDREGEAIS WHVRAMLEEK NLLKGVDVQR VTFNEITKSA
IRAAMAQPRD LDQPLIEAYL ARRALDYLVG FTLSPVLWRK LPGSRSAGRV QSVALRLICE
REAEIESFRA REYWTVAAQF TTPGGAAFTA RLTHLAGRKL DQFDLHDEAG AMAAKAAVEA
GRFAVQSVER RKVRRNPPPP FTTSTMQQEA SRKLGMGAQG TMRTAQQLYE GIDLGGETVG
LITYMRTDGV QMAGEAIAAI RGHIGESFGA PYVPEKARIY STKAKNAQEA HEAIRPTDVS
RTPAQMARYL NDEQRRLYEL IWKRSVASQM QSAELDQVIV EIADAGGAAT LRATGSMIAF
DGFLKLYSEG RDDAAPKDEQ DDDSRMLPPM RERDALKTGE VAADQHFTQP PPRFSEASLV
KKMEEIGIGR PSTYASILTV LRDRNYVRLD ARRFVPEDRG RLVTAFLTSF FERYVDTQFT
AGLEEQLDDI SGGRADWRDV MSAFWQDFSR AVDQTKDLKI SDVISALDAD LAPHFFPAHL
DGSDPRVCTA CGTGRLGLKL GRYGAFIGCS NYPTCQFTRR LVVDPKEDGE ADTLKDGMRL
LGQTPGGEDV TVRRGPWGLY VQQGEPDPED KKAKPRRATI PRGIEGDKIT LDQALGLLSL
PRVVGIHPET GEQIEAGLGR FGPYVKMGAV YGSLDKDDDI LTVGLNRAVD VLARKLASVR
TIAPHPKDGE PVIVRKGRFG PYIQHGTMVV NVPRGEAMED VTLDQAVALL AEKGKPLKPK
GKAAAKKAPA RKTAARKAPA KTAAKKAAPD GDADTQARAA KPPARKAAAR KTPAGKATGK
TAKGKAEPGE GAGPRTRRTA TEAG