Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Gdia_2948 |
Symbol | |
ID | 6976382 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Gluconacetobacter diazotrophicus PAl 5 |
Kingdom | Bacteria |
Replicon accession | NC_011365 |
Strand | + |
Start bp | 3224672 |
End bp | 3227980 |
Gene Length | 3309 bp |
Protein Length | 1102 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 643392457 |
Product | hypothetical protein |
Protein accession | YP_002277294 |
Protein GI | 209545065 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 39 |
Fosmid unclonability p-value | 0.323193 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGCCCG GTCAAGACCC GTCTGCCCCC GATCGGCCCC GGCTTCGGCG TCCATACCGG CGCATCCTGC TGTCGCTCGC GGCCGGCTGT GCCGCCCTGC CGGTCATCGC CGTCGCCGGA CTGGCGGTGC GGCTTTCGCT GGGGCCGATG GACGTCACCC TGCCGGCCCG GCTGGTCCTG CCGGTTCCGG TCCTGTCGGG CGGCCATGGC CGCCCGGCGG CGCTGCGGCT GGACATCGGG CAGGTGCGCG TCGGCTGGGA CGTGCTGCGT GGCGGGCTGA CGGCGCCGCT GGATGTGCGG GCGGCCGATA TGCGCCTGGT GCGGCCCGAC GGCGTGGTGG CGGACCGGAT CGGCACGGCG CGGATGGTGC TGGCCGGCGC GCCGCTGCTG CATGGCCGCG TGACGCCCCG GGTGCTGGAA ATCTCGGCGG TGCGGCTGGC GCTGCGCCGG GCGGCGAACG GTTCGGTCGG GCTGGATACC GGGCCGTTCC CGGTCGTCGA TAACGGCGCC GCGCCGCTGC CGGTCGATCC CGCGCAGCTT GACCGGCTGG TGGTGGGGGA CGCGGCCGTG ACCCTGCGCG ACGTGCCCAC GGGCGGGACG TGGCAGGGGC AGGACCTGGC GGCCGACCTG CGGGCCGCGC GGCAGGGCGG CGCGTTCGGC GGCACCGTGG GGCTGGTCGG CCGGGCGGGC GTGACCCTGT CCGGGCCCGG CGTGCATGCC ATGCTGCGGG CCGAGGGACA GCGCGCGGAC GCTGCCGTGT CATGGCATGT ATCCGGCACG AAGGTGGCGA TGCAGGCGCT GTCGCCCTTC GCGCCGGCGC TGGCGCGTGT CGCCCTGCCG GTCGGGCTGG ACGGCACGGT GCGCCTGGTC GCCGACCGGT TGGGCCTGCT GGCGCGCCCC GATACCGTCG CCCTGCGGGT GCAGGCCGGG GCGGGGCGCA TCGCCACCGG GCGCGGGGGC GTGGTGGTGC TGGCCGGCGC GCAGGCGGAC CTCTCGGCCC GGTTCGGTCC CGCGGCCGAT GGCGGGGACG TGCCGGGCGC GGTCCATGTG CGGCTGGACG GGCTTCAGGC CAGCCTGCTG CCCTCGGATC ACCCGGATGC GCCCCCGGAT AACCTGGGTC CCGGCGGCCC GGTGCTGCGG GCCAGCGGCG CGCTGGACAT CGACAGCCTG GCGCGGCCGG GCCGGATCGG CGTGACGCTG GCCGGCGATA TTCCGCTGCT GGATTTCGCG ACCCTGGGCG CCTACTGGCC GGCCGGGGCG GCCCGGGGGG CCAGGACCTG GGTCACGCGA AACATCACGG CCGGCATGGC GCACGACCTG CACGTCACGG CGGGGCTGGC CAGCACGGCG GGCTGGGGGG CGATGCGCCT GCTGACGCTG GGCGGCGGGG TCGCGGGGTC GGACCTGGAC CTGCACTGGC TGCGACCGAT CCCGCCGATC CGGGGCGTGG ACGCCACCCT GACCTTCGAC GGGCCGGATG CGCTGTCCAT CGCCTTTACC CACGGGGTCC AGACGGTGGA CCGCACCGGG CGCAATGTCG ACGCGACCGG CACCGGGCGG ATCGCGGTCG GCGACGGGCA CATGCGCATC ACCGGCCTGA TGGTGAAGGA CCAGATGGGC GACATCTCCA CCACGCTGCA CGGCAACGTG CGGGACGTGC TGGCGCTGCT GGCCGAACCC CGGCTGAACC TGCTGTCGCG CCATCCGCTG TCGTTCAGCC ATCCGTCGGG CCGGGCCGAC CTCGCGTTGC ACCTGGTCCT GCCGCTGAAC GCCCATGTCC GGGTGGACGA ACTGCATCTG GACGGCCATG CCGACCTGGC CCAGGTCCAT CTGGGCAACG TGGTGCTGGG CCGCGCGCTG GAAGGCGGGC GGCTGGCCAT CGACGCCACC ACCGACGGGC TGGGCCTGCG GGGGACCGGC GTGCTGGGCG GCGTGCCCTC CACACTGCGC TACGACATGG ATTTCCGCAG CGATCCCGGC ATCCGGGTGC GCGAGACGGC GCAGCTCCGC GCCCATGTGA CGCCCGAGGT CGCGGAGCGC GCCGGATTCG CGGTCGCCCA GCGCTTCAGC GGCGCCGCCG ACCTGGATGT CGGCTATGAC CGCTATGCGG ACGGGACCGG ACAGGTCCGG CTGGACCTGG ACCTGGACGA CGCGGCGCTG ACCATCCCGG TCTGGAGCAA GGCGCGCGGC CAGGCGGCGC GGGCATCGGC GCGGATCGGG CTGGCGGACG GGCGCCTGGC CTCGGTCGAG GCCATTCACG CCGCCGGCCC CGACCTGCTG ATCGACGGCC GGGCGAACGT GGCGGGCGGC ACGGCGCGCA ACCTGATCCT GCGCGGGTTC CGGGTCGGCC GATCGCGGGG CGACGCCACG ATCGGCGTGC CCGCGGGGCC GCGCGATGCG GTGCGGGTGG CGATCGACGC CCCGGTGCTG GACCTCTCGC CGCTGCTGGC GCCCGATCCG GCAGCCGATC ACGGCACCGA CCCCGTCCCG CAGGGACGGG CGGCGGCGGG CTATCACCTG CCCGTGGCCG CGTCGGGCCG CGTCCATGGG CCGCCGGGGC GAAGCTGGCT GATCGATGCC AGCGTCCGCA CCCTGTTATA TGCCAAGCAG GCCGCCCTGA CCGGCGTGCA CGCGCATCTG GAGGATAACG GCGTCCGCCT GACGCGGATG CGCTTCGCCA TGGCCGGCCC GTCGCCCGCC TCCGCGATCC TGACGCCCGA GTCCGACGGG CGGCATCTGT GGGCCAGCGT CCAGGATCTG GGCCTGATGC TGCGGGGGCT GGATGTCACG ACGCAGTTCG AAGGCGGCCG GACGGTGCTG CAGGGCGTCT TCGACGACCG CCAGCCCAGC GCGCCCTTCG CCGGGGTGCT GACGATCGAC CCGATGACGC TGCACAAGGC CCCGGGGGCG GTGCGGCTGG CCAACGACGC CTCGATCTAT GGCTGGATGC AGGCGCCAAA GGGGCCGGAT TTCCTGATCC AGCGCGTGTC CCTGCCGCTG ACCTTTCGGG ACGGCACGCT GCACATCCAT GACGGGGTGC TCAACAATGC CTCGCTGGGC GTGACGCTGG AAGGGCCGCT GGACCTGGAT CACGGACGGA TGGACCTGCG CGGCACGATC GTGCCGGCCT TCGCGGTGAA CACCATTCCC GGCCACATGC CCGGGGTCGG CCGGCTGATG AGCCCGGAAA AGGGCGGCGG CCTGCTGGCC GCGACCTTCG TCGTCAGCGG GGCGATGAAT GCCCCGGCGC TGAAGGTCAA TCCGTTCTCG ATCTTCCTGC CGGGCGTGCT GCGGCGGCTG GTGCAGTAG
|
Protein sequence | MTPGQDPSAP DRPRLRRPYR RILLSLAAGC AALPVIAVAG LAVRLSLGPM DVTLPARLVL PVPVLSGGHG RPAALRLDIG QVRVGWDVLR GGLTAPLDVR AADMRLVRPD GVVADRIGTA RMVLAGAPLL HGRVTPRVLE ISAVRLALRR AANGSVGLDT GPFPVVDNGA APLPVDPAQL DRLVVGDAAV TLRDVPTGGT WQGQDLAADL RAARQGGAFG GTVGLVGRAG VTLSGPGVHA MLRAEGQRAD AAVSWHVSGT KVAMQALSPF APALARVALP VGLDGTVRLV ADRLGLLARP DTVALRVQAG AGRIATGRGG VVVLAGAQAD LSARFGPAAD GGDVPGAVHV RLDGLQASLL PSDHPDAPPD NLGPGGPVLR ASGALDIDSL ARPGRIGVTL AGDIPLLDFA TLGAYWPAGA ARGARTWVTR NITAGMAHDL HVTAGLASTA GWGAMRLLTL GGGVAGSDLD LHWLRPIPPI RGVDATLTFD GPDALSIAFT HGVQTVDRTG RNVDATGTGR IAVGDGHMRI TGLMVKDQMG DISTTLHGNV RDVLALLAEP RLNLLSRHPL SFSHPSGRAD LALHLVLPLN AHVRVDELHL DGHADLAQVH LGNVVLGRAL EGGRLAIDAT TDGLGLRGTG VLGGVPSTLR YDMDFRSDPG IRVRETAQLR AHVTPEVAER AGFAVAQRFS GAADLDVGYD RYADGTGQVR LDLDLDDAAL TIPVWSKARG QAARASARIG LADGRLASVE AIHAAGPDLL IDGRANVAGG TARNLILRGF RVGRSRGDAT IGVPAGPRDA VRVAIDAPVL DLSPLLAPDP AADHGTDPVP QGRAAAGYHL PVAASGRVHG PPGRSWLIDA SVRTLLYAKQ AALTGVHAHL EDNGVRLTRM RFAMAGPSPA SAILTPESDG RHLWASVQDL GLMLRGLDVT TQFEGGRTVL QGVFDDRQPS APFAGVLTID PMTLHKAPGA VRLANDASIY GWMQAPKGPD FLIQRVSLPL TFRDGTLHIH DGVLNNASLG VTLEGPLDLD HGRMDLRGTI VPAFAVNTIP GHMPGVGRLM SPEKGGGLLA ATFVVSGAMN APALKVNPFS IFLPGVLRRL VQ
|
| |