Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Gdia_2949 |
Symbol | |
ID | 6976383 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Gluconacetobacter diazotrophicus PAl 5 |
Kingdom | Bacteria |
Replicon accession | NC_011365 |
Strand | + |
Start bp | 3227999 |
End bp | 3229282 |
Gene Length | 1284 bp |
Protein Length | 427 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 643392458 |
Product | protein of unknown function DUF195 |
Protein accession | YP_002277295 |
Protein GI | 209545066 |
COG category | [S] Function unknown |
COG ID | [COG1322] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.358116 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 40 |
Fosmid unclonability p-value | 0.173006 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCGACG CCCTGGGTCC CGGCAGCATC CTGGCCATCG TGGCGCTGGG GGCGGTGGTG GTGCTCGCCG TCCTGCTGCT GCGGCGCGGC ACGGGGGCGG GGGCGGCCGG CGAGGGCGAC CTGCTGGCGC GGCTGTATGT CATGCTGGAA CGCGAAACCG CCGCGCGCAC GGCCGACAGC GAGGCCCAGC GGGCCCGGCT GGCGGAAATC GAGCGCGTAC TGGCCGCGCG GCTGGACCAG TCCCGGGCCG AGACCGCCGA CCGGCTGGCC ACGATCGCGC AGTCCATCAC CCGCGACCTG GGCGATGCGC GCGTCCGCCA GGGCGAGGCA CTGCGCGAGA TGGCCGAAGC CTCCGCCCGG CAACTGGAGA CGATCCGCAC CGCCGTCAAC GAACGGCTGC ACGAGGCCGT GGAACGGCAG ATGCAGACCT CGTTCCAGCG CGTGCTGGAA CAGTTCGCCG CGATGCAGAA GGCGATGGGC GAGGTCACGG CCATGACGGC GCAGATCGGG GACCTGAAGC GCCTGTTTTC CAACGTCAAG ACCCGGGGCG GCTGGGGCGA GGCGCAGTTG CGCGCCATCC TGGACGACGT GCTGCCGGCC GGCGCCTACC AGGCCAATTG CCGCCTGCGC GAAGGCAGCG CGGAGGTCGT GGAATTCGCC GTGCGCATGC CGGTCCGCGC CACGACGCCG CCGGTGCTGG CGATCGATTC CAAATTCCCG ACCGAGGCCT ATGAACGGCT GCTGGACGCG GTGAACCGCG TGGACGCCGA GGCCGAGCGC GCCGCCCGCC GCGCGCTGGA AACCACCCTG CGGATCGAGG CGCGCAAGAT CGCGTCCAAA TATATCGTCC CGCCGGTGAC GGTGGAATTC GCGGTGCTGT ACCTGCCGAC CGACGGGCTG TATGCCGAGG TCGCGCGCCT CCCCGGCCTG CTGGACGAAA TCGGGCGCAC CTGCCGGGTG ATCGTCATGG GCCCCGGCCT GCTGCCGGCC ATGCTGCGGA CGATTCACCT GGGCTACGTC ACCCTGGCGC TGGAGGAGCG GACCGACGGC ATCGCCCGCC TGCTGGGCGC CACCCGGCAG GAAATGCTCA AGATGGACGG GGTGCTGGAA CGCCTGGCCC GCAACGCCTC GGCCATGTCA TCCTCGATCG ACGAGGCCAG GCGGCGCACG CGGGTGGTGG CGCGGCAGCT GCGCGGCCTG GACGGCGTCG AGTCGCTGGT TCCCGAAGGC GCGGGGGATG ACGCGGCGGC CGGGACCGGT GAAACCGAAT TTAATGGTGC ATGA
|
Protein sequence | MSDALGPGSI LAIVALGAVV VLAVLLLRRG TGAGAAGEGD LLARLYVMLE RETAARTADS EAQRARLAEI ERVLAARLDQ SRAETADRLA TIAQSITRDL GDARVRQGEA LREMAEASAR QLETIRTAVN ERLHEAVERQ MQTSFQRVLE QFAAMQKAMG EVTAMTAQIG DLKRLFSNVK TRGGWGEAQL RAILDDVLPA GAYQANCRLR EGSAEVVEFA VRMPVRATTP PVLAIDSKFP TEAYERLLDA VNRVDAEAER AARRALETTL RIEARKIASK YIVPPVTVEF AVLYLPTDGL YAEVARLPGL LDEIGRTCRV IVMGPGLLPA MLRTIHLGYV TLALEERTDG IARLLGATRQ EMLKMDGVLE RLARNASAMS SSIDEARRRT RVVARQLRGL DGVESLVPEG AGDDAAAGTG ETEFNGA
|
| |