Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Gdia_3378 |
Symbol | |
ID | 6976824 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Gluconacetobacter diazotrophicus PAl 5 |
Kingdom | Bacteria |
Replicon accession | NC_011365 |
Strand | + |
Start bp | 3700914 |
End bp | 3703805 |
Gene Length | 2892 bp |
Protein Length | 963 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 643392894 |
Product | hypothetical protein |
Protein accession | YP_002277719 |
Protein GI | 209545490 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR02302] conserved hypothetical protein TIGR02302 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.344689 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 35 |
Fosmid unclonability p-value | 0.143577 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGGGCG GGACCGCTCC CCCCCAGGAT GTCGCGGATA CGGGGGCGGC GCTGTCGTCG TTTCCGGTCC GGCTGGCGGC CGCGCGGTAT CGCGCGCGGC AGGTCCTGTG GGTCGAGGGC GCCTGGCCGG TCCTGCTGCC GGTGCTGGGC GGACTTGCCG CCTATCTGAT CGCGGGCCTG CTGGGCCTGC CGCAGGACCT GCCCGATCTG GCGCATGCGG CGCTTCTGCT GGCGCTGGCC GGTGGCGCGG CCGCGTGGAT CGTCTGGCGC GGGCGCCGCG TGACCGCGCC CACCCCCGCC GGAGTGGACC GGCGGATCGA ACGCGCGTCG GGTCTGGCGC ACCGCCCGTT GCAGACCCTG GGCGACCACC CCGCCGGTGC CGATGCCGCT CCCCATGATG CCGCCGCCCG GGTCGAACGG GTCGCCCTGT GGGACGCGCA TCTGCGGCGG ACGGGGCGGG CGATCGGCCG GCTGCGGGCC GGCTGGCCGC GCCTGTCGCT GGCGGCGCAT GATCCGTGGC GCGTGGGCTA TGTGCTGCTG CCGGGATTGC TGGCCGCACT GCTGTGGGCG GGCGGCGACG CCCGCGGCCG GCTGGAAGCC GCCTTCTGGC CTGGCCTGGA CGATCCGGGG GCGCCGCGCC CGCATATCCA GGCCTGGATC ACGCCGCCAT CCTACGCCCC CGGGGCCCCG GTCTTCCTGG ATGATCGCAC GGGCCAGGCC ACCGTGCCCC AGGGGGCGGT GCTGAGCATC AGCGTGACCG ACCTGCGCGG CCGTCCGTCG CTGCGCGTCG CCGCCACCGG ATCGGCATCG GGCCCGGCAG TGGGGCCCGA CCGGTTTCGC GCCCTGGGGG CGGAAAGCTG GTCGGCGGAC GTTCCGCTGC TGGCCAGCGC GTCGCTGACC CTGCGCGGAC GGGGGCGGTC GTTCAGTCGG TGGACCGTGA CTGTCCTGCC CGATGCGGCG CCGACGGTCG CATGGGGGGC GGGTGCCGGC GCCGGACGGG GCGAATGGCG CACGCGGCTG CCCTATGCCG CGCGGCAGGC CTACGGCATC ACGTCGCTGC GGGCCGAGCT GCGTCTGGCC GGCGGCGAAA AGGGGGGAGC CCCGCGCGTG CTGACCGTGC CGATTCCGAT CGACGGCCAC CCGAAGGACG TCACGGGCAT CGCCATGCCC GACCTGTCCG CCGACCCGTG GGCGGGCGAG GAGGTCGTGG GCCGGCTGGT GGCAACCAGC GCCAGCGGCC ATGAGGGCGT CAGCCCCGAA GCCCGGTTCC ACCTGGGCGC GCGCCTGTTC CGCAGCCCGA TGGCCAAGGC GGTGCTGGAT GTCCGGCGAC GGGTCGCGAC GGGCCGCGAG CGCCGCACCG CCGCCGCGAG CGACCTGATG GCGCTGGGCG AAACGCCCGA TCCGTTCCAG AACGACGCCG GCCTGCTGCT GAACCTGACC AGCGCCGCCG CCCTGCTGGA AAGCCCGGAT GTCGATCCGC ACGCGGCGGT GGACCAGGCG GTGGCGCGGC TGTGGTACCT GGCGCTGGAG ATCGAGGACG GCCGGCAGGG CGGAAGCGCC GCGGCGCGAG CGGCCCTGGA TGTCCGCGCG GCGCAGGACG CCGTGGCGGC GCAGTTGAAC CGCATGCGCG CGCTGGGGGC GCAGGGCCAG TCGCCCGAGG AACAGGCCGA ACTGCAGCGC CGGATGGAGA CGCTGCGCCA GGCGATCATG CGCCGGATGC AGGCGCTGGC GCAGCAGGCC GTGCAGTCGC ACACCGCCAT ACCCGACCTG CAGGGCCTGA CCCGCAACGG CGACCAGGCC CTGTCGCGCA TGATGCAGCA GATGCAGGAC GCCGCCCGCA ACGGCCGCTC GGCCGAGGCG ATGCAGGCGT TGCAGCGCAT GGAAGACATG CTGGAGCACA TGCGCTCCGC CACGCCGCAG GACCTGGCCG ACATGGCGCG CCAGATGCAG GCCCGCCAGC AGGCCAACGA ACAGCGCGAC GCGCTGCAGG ACCTGATCCG CCGGCAATCC GGCCTGCTGG ACCACAGCCA GTCTCGCCTG GACCGCGTCC GCCACGCCCA GGAACGCGCC GAGGCCGCGC GCCGCGCCGC CCAGGGCGAG ATGCCGGGAC AGGGAATGGA TGGCGACCTG GCCAGCATGC CCACCGCCGA ACTGCTGCGC CGGCTGGGGC TGCGCCCCCC GCCGGACATG CAGGGGCCGC CGGCCGACGA GCCGCAGCCC GGAGACGCCG GGCCGGCGCA GGGGCCCGAC GCGCCGCCGT CGGGGGATGC GTCCGCACCC GGCGCGCCCA ATCCGCCGAA CGAGGAGGTC CGGCACGCCG ACCGCGCCGT GCAGCACGCG CTGGGGCGGG CGCTGGACGA ACTGGGGCAG GAATTCAAGG GCCTGACCGG AAAGGACGCG CCGTCCGGCT TCGCCGATGC CGGGGGCGCG ATGAAGGACG CGCGCGCCGC CCTGGCCCAG GGCAACGACA CCGCCGCCGC CGAGGCCCAG CGCAAGGCCC TGGCCGACCT GCAGAAGGGC GACCAGCAGA TGCGCCAGGC GATGAAGGGC TCGGGCAAGG GCGGCGCGAC CAGCTTCCTG CCCGGCTTCG CCAGCGGATC GGGCGAGGGC GGCCAGGGCG AACCGGGCGA TAGCGGCGAC AGTGCCCAGG GAAGTGATCA GGGAAGTGAC CAGGGAGGCG ACCAGGCGGA CGACCAGCAC GGCGACCGGG ACCCGCTGGG CCGCCGGACC GGCGAGGGCA AGGACGGGCT GGATTCCGAC ACCCACGTGC CGGATACGAT GTCGCGCGAA CGCGCCCGGG AGATCGAGCA GGAACTCCGC CGCCGCGACT CCGACCGCAC CCGCCCGCGC GAGGAACTGG ATTACCTGGA CCGGCTGCTG AAATCCTTCT GA
|
Protein sequence | MTGGTAPPQD VADTGAALSS FPVRLAAARY RARQVLWVEG AWPVLLPVLG GLAAYLIAGL LGLPQDLPDL AHAALLLALA GGAAAWIVWR GRRVTAPTPA GVDRRIERAS GLAHRPLQTL GDHPAGADAA PHDAAARVER VALWDAHLRR TGRAIGRLRA GWPRLSLAAH DPWRVGYVLL PGLLAALLWA GGDARGRLEA AFWPGLDDPG APRPHIQAWI TPPSYAPGAP VFLDDRTGQA TVPQGAVLSI SVTDLRGRPS LRVAATGSAS GPAVGPDRFR ALGAESWSAD VPLLASASLT LRGRGRSFSR WTVTVLPDAA PTVAWGAGAG AGRGEWRTRL PYAARQAYGI TSLRAELRLA GGEKGGAPRV LTVPIPIDGH PKDVTGIAMP DLSADPWAGE EVVGRLVATS ASGHEGVSPE ARFHLGARLF RSPMAKAVLD VRRRVATGRE RRTAAASDLM ALGETPDPFQ NDAGLLLNLT SAAALLESPD VDPHAAVDQA VARLWYLALE IEDGRQGGSA AARAALDVRA AQDAVAAQLN RMRALGAQGQ SPEEQAELQR RMETLRQAIM RRMQALAQQA VQSHTAIPDL QGLTRNGDQA LSRMMQQMQD AARNGRSAEA MQALQRMEDM LEHMRSATPQ DLADMARQMQ ARQQANEQRD ALQDLIRRQS GLLDHSQSRL DRVRHAQERA EAARRAAQGE MPGQGMDGDL ASMPTAELLR RLGLRPPPDM QGPPADEPQP GDAGPAQGPD APPSGDASAP GAPNPPNEEV RHADRAVQHA LGRALDELGQ EFKGLTGKDA PSGFADAGGA MKDARAALAQ GNDTAAAEAQ RKALADLQKG DQQMRQAMKG SGKGGATSFL PGFASGSGEG GQGEPGDSGD SAQGSDQGSD QGGDQADDQH GDRDPLGRRT GEGKDGLDSD THVPDTMSRE RAREIEQELR RRDSDRTRPR EELDYLDRLL KSF
|
| |