Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Gdia_0790 |
Symbol | |
ID | 6974187 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Gluconacetobacter diazotrophicus PAl 5 |
Kingdom | Bacteria |
Replicon accession | NC_011365 |
Strand | + |
Start bp | 902564 |
End bp | 903889 |
Gene Length | 1326 bp |
Protein Length | 441 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 643390319 |
Product | hypothetical protein |
Protein accession | YP_002275195 |
Protein GI | 209542966 |
COG category | [S] Function unknown |
COG ID | [COG5338] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 0.843096 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 46 |
Fosmid unclonability p-value | 0.984184 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCTCCT CCCGCCGTCC GGCGGTACTG GCCACGCTGG CGATGGCGAC GATGGCGGCA GGAAGCCATC CGGCCCGGGC CCAGTTGATC GCGCAATATT TCCCATCGGA CCTGCCGGGC TACGCATCGT CCGACCAGGC CGATTCCGTC GTCATGCGCC AGTTGCTGGG CCAGCAGCCG ACGGGAATTC CACTGGGCAG CTTCATCGTG CGTCCATCCG CCGCGCTCAA CGCCGGGTAC AACACCAATA CGCTGGCCAC CCCGCATACC CAGAGCGCGG AATTCGAGAT GGATGGCGGC CTGAAGATCA ATTCGAACTG GTCCCGCCAC GCGCTGGGCA TCTTCGCCAA CGTCAGCGAC CGCCGCTTTC CCCAGATCCC GGTCGCCAAC TACACAAACT GGGCGACCGG GGCGGGCGGG TTGCTGAACC TGGGAAACGA CACGCTGTCG CTGGGATATA CGCATTACCT GATGCACCTG TCGTCGATGG ACCTGGGCAA TTTCGGCGTA TCGCGCCCGG TGCCCTACGC CGCCGACGAT GTGCGCCTGA CCTATACCAA GCTGTTCGGC CGATTCAGCC TGATTCCGTC CGCGATCTTC GAGAACTATT CATTCGGCCG GGCCACGGGC GGGGGATTCG ATACCAACTA CGACAGCCTG AGCCATCGGC TGGAGACCGG ATCGCTGACC TCGCGCTTCG AAATATCCAC CGGCAATGCG GCGGTCATGA TCCTGCGCGG ATCGACGGCG CAATTCGTCA CGACCCCGGG CGGCACCCCC AACGACTACG TCGATGGCGC GGCCTTCGCG GGCCTGGACC TGAATACGGA TTCGCCCGTC CGCTACCGGC TGCTGGCGGG GGGCGAGACG CGCCATTTCA CCCGCAGCGC GGCACCGTCC GTCACCACCC CCACCTTCGA GATCGATACG ATCTGGACGC CGACGCGGCT GGATACGGTC AGTGTCTTCT TCTATCGCCG GATCCTGGAC CCTGCCTCGC CCTTCGCGCG CAACCAGACG GTGACGGACG GGCGCATCCA GGTCGATCAC GAATTGCGGC GGAATATCTT CCTGCGCGGC TATGCCGAGG GCGGCATGAG CCAGACCGGG CAGACCGTCC AGGGAACCCG GTCCCGCACG CAGACCCTGT TCAAGTTCGG CGTTTCCGCG AACTGGAAGG TCAACCGCAC GGTCACGGCC AGCCTGTCCT ACAGCCACGT CAACGACTAC GCCCACGGGG GTGCTGCGCC CGACCCGCTC TTTGCCGACC AGAGATCGAC TTTCGCCAGC AATTATATCG CACTCGGGGT CAGCTTCGCC GAATGA
|
Protein sequence | MRSSRRPAVL ATLAMATMAA GSHPARAQLI AQYFPSDLPG YASSDQADSV VMRQLLGQQP TGIPLGSFIV RPSAALNAGY NTNTLATPHT QSAEFEMDGG LKINSNWSRH ALGIFANVSD RRFPQIPVAN YTNWATGAGG LLNLGNDTLS LGYTHYLMHL SSMDLGNFGV SRPVPYAADD VRLTYTKLFG RFSLIPSAIF ENYSFGRATG GGFDTNYDSL SHRLETGSLT SRFEISTGNA AVMILRGSTA QFVTTPGGTP NDYVDGAAFA GLDLNTDSPV RYRLLAGGET RHFTRSAAPS VTTPTFEIDT IWTPTRLDTV SVFFYRRILD PASPFARNQT VTDGRIQVDH ELRRNIFLRG YAEGGMSQTG QTVQGTRSRT QTLFKFGVSA NWKVNRTVTA SLSYSHVNDY AHGGAAPDPL FADQRSTFAS NYIALGVSFA E
|
| |