Gene Gdia_1074 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_1074 
Symbol 
ID6974471 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp1204634 
End bp1206322 
Gene Length1689 bp 
Protein Length562 aa 
Translation table11 
GC content64% 
IMG OID643390596 
ProductResolvase domain 
Protein accessionYP_002275472 
Protein GI209543243 
COG category[L] Replication, recombination and repair 
COG ID[COG1961] Site-specific recombinases, DNA invertase Pin homologs 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.324601 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones52 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCGCG TTGCTCTCTA TGCCCGCTAT TCCTCGGACA ACCAGCGGGC GGCTTCGATC 
GAGGACCAGT TCCGCCTCTG TGAGGAACGC GCCACGCGCG AGGGCTGGCA GGTGGTGGAG
TATTACCGCG ATGCGGCCAT CTCCGGGGCC AGCATGATCC TGCGCCCCGG CATCCAGACA
CTGTTGCAGG ACGCCCAGGC CGGACAGTTC GATGTCGTGC TGGCCGAGGC GCTGGACCGC
GTGTCCCGCG ACCAGGCGGA CGTCGCTACC CTGTTCAAGC GTCTCCAGTT TGCGGGCGTC
ACCATTGTCA CCTTGGCCGA GGGCGAGATC AGCGAGCTTC ACGTCGGCCT GAAAGGAACG
ATGAACGCCC TGTTCCTCAA GGATTTGGCC CTGAAGACCC ATCGCGGCCT GCGGGGCCGG
GTCGAGGCGG GTAAATCCGG CGGCGGCCTG TGCTACGGCT ATCGCGTCGT GCATCAGATG
GACGCGCGGG GCGAACTGAT CCGGGGCGAG CGCGAGATCG ATCCGGTCCA GGCCGAGATC
GTGCGCCGCA TCTTCCATGA GTTCGCCTCG GGCCGCAGCG CGCTTTCCAT CGCCAGCCGT
CTGAACGACG AAGGTATCCT CAGCCCCACG GGCGGCAAAT GGAACAATAC CACCCTACGG
GGCAACGCCC TGCGTGGCAC CGGCATTCTG AACAACGAGC TTTATATCGG CCGGCTGGTG
TGGAACCGGT TGCGCTACCT GAAAGACCCG CAGACCGGCA AGCGCGTGTC CCGACTCAAT
CCGCAATCGG AATGGATCAT CACCGATGTC CCCGACCTGC GGATCATGGA CGATGATCTC
TGGCAGGCGG TGCGCGATCG GCAAGCGCTG ATCGCGGAGA AGAGCGTGAA CATCAGTGCC
GGCATCCGGG CCTCGATCAA CAAGCTCAAT GGCCAGCGCC GTCCAAAATC GCTCCTGTCG
GGGTTGGTGT TCTGTGGGGT ATGCGGCGGT CCCTGCTCGA TCCGGGGCGG CGACCGCTTC
GCCTGCTCCA CCCATATGGA CAACCGCTCC TGCACCAACA GGACCACGAT CCGTCGCCCG
GAACTGGAAG GCCGGGTGCT GTGTGGCCTC AAGGACCGGC TGATGACACC GGAAGCCGCC
GCCGAAGCAA TGCGCGCCTA TGTTGAGGAA ACCAATCGCG CCAACCACGA ACGCCGCGCC
AGCACGGCGG GATGGCAGAC GGAACTGACC AAGGTGCGCA AGGGCTTGAA ACAGATGCTC
CAGGTGATCG AGGATGGCGG CTATACGCGC GGCATGGTCG AGCGCATGCG CGAAATGGAG
GCCCGTGAGG ATGAACTGGT CGCCCTGCTC GCCGCCCAGC CGCAGGACGT GCCGGACATT
CATCCCAACG TCGCCGGGAT CTTCAAGCAC AAGGTAGAGC GGCTGGCCGA GACGCTGAAC
CATCCCGAGG ACCGGCAGGA AGCCTCAGAG GCCATCCGCG CGCTGATCGA GAAGATCGTG
CTTAATCCCG GCAAGGGCCG CGGCGAGATG CATGCCACGC TCCATGGCGA ACTCGGAAAA
CTGCTGGATT TCGCCGCCTC ACGCGGTCAG GGAGCCAAAA ACACGAACAC TCCCGGAGCT
AGGGCTTCGG GAGTGTCGGT ATCGGGTATT GCGGGGGCAG GATTTGAACC TGCGGCCTTC
AGGTTATGA
 
Protein sequence
MTRVALYARY SSDNQRAASI EDQFRLCEER ATREGWQVVE YYRDAAISGA SMILRPGIQT 
LLQDAQAGQF DVVLAEALDR VSRDQADVAT LFKRLQFAGV TIVTLAEGEI SELHVGLKGT
MNALFLKDLA LKTHRGLRGR VEAGKSGGGL CYGYRVVHQM DARGELIRGE REIDPVQAEI
VRRIFHEFAS GRSALSIASR LNDEGILSPT GGKWNNTTLR GNALRGTGIL NNELYIGRLV
WNRLRYLKDP QTGKRVSRLN PQSEWIITDV PDLRIMDDDL WQAVRDRQAL IAEKSVNISA
GIRASINKLN GQRRPKSLLS GLVFCGVCGG PCSIRGGDRF ACSTHMDNRS CTNRTTIRRP
ELEGRVLCGL KDRLMTPEAA AEAMRAYVEE TNRANHERRA STAGWQTELT KVRKGLKQML
QVIEDGGYTR GMVERMREME AREDELVALL AAQPQDVPDI HPNVAGIFKH KVERLAETLN
HPEDRQEASE AIRALIEKIV LNPGKGRGEM HATLHGELGK LLDFAASRGQ GAKNTNTPGA
RASGVSVSGI AGAGFEPAAF RL