Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Gdia_3331 |
Symbol | |
ID | 6976774 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Gluconacetobacter diazotrophicus PAl 5 |
Kingdom | Bacteria |
Replicon accession | NC_011365 |
Strand | + |
Start bp | 3642566 |
End bp | 3643816 |
Gene Length | 1251 bp |
Protein Length | 416 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 643392845 |
Product | NADH dehydrogenase subunit D |
Protein accession | YP_002277673 |
Protein GI | 209545444 |
COG category | [C] Energy production and conversion |
COG ID | [COG0649] NADH:ubiquinone oxidoreductase 49 kD subunit 7 |
TIGRFAM ID | [TIGR01962] NADH dehydrogenase I, D subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.150101 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 37 |
Fosmid unclonability p-value | 0.101451 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGACA TTGTCCTTCA TGAGGATATT CCGGAATCCC TGCTGCCCGG TGGCGAGGCC GCGGCGGCCG CGACCCACAC GGTGGAAATC GATTCCCACG CCCTGAATTT CGGCCCGCAG CATCCGTCCG CCCATGGTGT GCTGCGCCTG GTCCTGGAAA TGGAGGGCGA GGTCGTCGCC CGCGCCATTC CGCATATCGG CCTGCTGCAT CGCGGCACCG AAAAGCTGAT CGAATACAAG ACCTATCCCA AGGCCCTGCC GTATTTCGAC CGGCTCGATT ACGTCTCGCC GATGTGCGAG GAGCAGGCTT TCGCGCTGGC GACCGAAAAG CTGCTGGGGA TCGACATTCC CGATCGCGCG AAATGGATTC GCGTGATGTT CGCGGAAATC ACCCGGATCC TGAACCATAT CCTGAACCTG ACGGCGCTCG GGCTCGATTG CGGCGCGGTG ACCCCGGCGC TGTGGGGCTA CGAGGAACGC GAAAAGCTGA TCGAGTTCTA CGAGGCCGCG TCGGGCGCCC GGTTTCATGC CAATTACTTC CGTCCCGGCG GGGTCTCGCG TGACCTTCCG GCGGGGCTGG AGGATCGGAT CGCCGAATGG GCGCGCCAGT TCCCGGCCTG GATCGACGAT CTGGAATCGC TTCTGACCAA CAACCGGATC TGGAAGCAGC GCACGGTCGG GATCGGCATC TTCACGACCG AGCAGGCGCT GGCCTGGGGC TTCAGCGGTC CGTGCCTGCG CGCCTCGGGC GTGCCGTGGG ACCTGCGCCG CGCCCAGCCC TATGACAATT ACGACAAGGT CGAGTTCAAC ATCCCCGTCG CGCGCCAGGG CGATTGCTAC GACCGCTACC TGATCCGCGT CGCGGAAATG CGCGAGAGCG TGCGGATCGT CGAACAGTGC CTGGCCCAGA TGAAGCCCGG CCCGATCAAG ATCCAGGACC ACAAGATCAC GCCGCCGCCC CGGCGCGAGA TGAAGCGGTC GATGGAAGCC CTGATCCATC ATTTCAAGCT GTTCACGGAA GGGTACCACG TCCCGCCGGG GGCAACCTAT ACGGCGGTCG AAAGCCCCAA GGGCGAATTC GGGGTCTATC TGGTCGCGGA TGGCAGCAAC CGGCCCTACC GGTGCAAGAT CCGGCCGACC GGCTTCGCCC ATCTGCAGGC CATCGACGAG ATGTCGCGCC GCCACATGCT GGCCGACGCG GTGGCGATCA TCGGGTCGCT GGACCTGGTG TTCGGCGAGA TTGACAGGTG A
|
Protein sequence | MSDIVLHEDI PESLLPGGEA AAAATHTVEI DSHALNFGPQ HPSAHGVLRL VLEMEGEVVA RAIPHIGLLH RGTEKLIEYK TYPKALPYFD RLDYVSPMCE EQAFALATEK LLGIDIPDRA KWIRVMFAEI TRILNHILNL TALGLDCGAV TPALWGYEER EKLIEFYEAA SGARFHANYF RPGGVSRDLP AGLEDRIAEW ARQFPAWIDD LESLLTNNRI WKQRTVGIGI FTTEQALAWG FSGPCLRASG VPWDLRRAQP YDNYDKVEFN IPVARQGDCY DRYLIRVAEM RESVRIVEQC LAQMKPGPIK IQDHKITPPP RREMKRSMEA LIHHFKLFTE GYHVPPGATY TAVESPKGEF GVYLVADGSN RPYRCKIRPT GFAHLQAIDE MSRRHMLADA VAIIGSLDLV FGEIDR
|
| |