Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Gdia_0471 |
Symbol | |
ID | 6973865 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Gluconacetobacter diazotrophicus PAl 5 |
Kingdom | Bacteria |
Replicon accession | NC_011365 |
Strand | - |
Start bp | 516237 |
End bp | 517433 |
Gene Length | 1197 bp |
Protein Length | 398 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 643390003 |
Product | integrase family protein |
Protein accession | YP_002274882 |
Protein GI | 209542653 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.116709 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 0.000616461 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGCAAGCA GCAGCGATAA AACACGAGAT CAGGCCGTGA CCAGGGCCAA GGGGGACATA GACCCTGCGA CTGGTAAGAC CTTGCCGGCT GGCATCTCGT GCCGCGGCCC TAAGCAGTAT CAGGCTCGCA AGCAAGTCGA TGGCAAGCGC TACCGCAAGA CCTTCTCGAC CGCCGCGCTG GCTCGACGGT GGCTCAACGA GACCGCCGCC AAGGTCGAAC TCGGCCAATT CAAGGACACC AGCGCGCTCG ACAAGCAGAC CATCGGCGAC CTTGTGGACA GGTATGCCAA GGAGTGCATG GATGGGCGTG GCGCTGACCT GACAGGGCAT ATCCCGGCCA TTCTCCGGGA CAAGGATCTG CCTGGAGTCC GGCTGTCCAA ATTCTGCCTG GCGGATGTGC GCGGCTTCCG GGATCGGATG ATGTCTGCCG ACTATTCCCC GGCCACGGTC GTCAAACGTC TAAACCTACT TGCCTCGGTC ATCCAGCACG CCATCAGCGA GTGGGATACC TCCATCGTCA ACTACGCCTC CGGACGGTTC GTGAAGCGGC CGGAAGGTGC GGACAAGAAG CGCAATAGAC GGCTCGACGA AGACAAGGAC AAGGACGGGA AGACGGAGTT CGATAGGCTG ATCGTGGCTG TCTCGGACTC CGTCTATCCG GACGATGTGT GGCTGGTCCG CTGGTCGATC GAGCAGGGCA CAAGACGCGG TGAGGCGATC GGCCTGCGAT GGTGCGATGT CGATATCGAA CGCAGCTTGA TCAAGCTGGG GGGCGAGTCC GGCAAGACCA AGACGCACAA GACCCAGGAA GAACAAGGCC CTGAAATCCG CCCGTTGACG CCGGGAGCAA GGCGACTCCT GCTTGAGAAA CGGGACACAT ACGAGACGCC GCCGGAGCCC GGCGACAGCG TGTTCAGCGT AGGCAAAGAG TCTACATTCA GCATGCGTTA CGGGCGGATG GTCAAACGCA CCGGGCTCCA CAACCTGACG TTCCATGACC TGCGCCACGA AGCGACCAGC CGCCTCGCGC GTCTGCTGCC GAACCCGCTG GACCTCAAGA GGGTCACGGG ACATCGTGAT CTGAAGAGTC TGGACCGGTA TTATCAGCCG GTCCCGGAAT CCATCAGCAA GCAGATCGAG GAGGCTGAGC GGCTGGCCGG CATCATCGCC GCCGAGGAAG GCGACGATGA CGAGTAA
|
Protein sequence | MASSSDKTRD QAVTRAKGDI DPATGKTLPA GISCRGPKQY QARKQVDGKR YRKTFSTAAL ARRWLNETAA KVELGQFKDT SALDKQTIGD LVDRYAKECM DGRGADLTGH IPAILRDKDL PGVRLSKFCL ADVRGFRDRM MSADYSPATV VKRLNLLASV IQHAISEWDT SIVNYASGRF VKRPEGADKK RNRRLDEDKD KDGKTEFDRL IVAVSDSVYP DDVWLVRWSI EQGTRRGEAI GLRWCDVDIE RSLIKLGGES GKTKTHKTQE EQGPEIRPLT PGARRLLLEK RDTYETPPEP GDSVFSVGKE STFSMRYGRM VKRTGLHNLT FHDLRHEATS RLARLLPNPL DLKRVTGHRD LKSLDRYYQP VPESISKQIE EAERLAGIIA AEEGDDDE
|
| |