Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Gdia_1518 |
Symbol | |
ID | 6974928 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Gluconacetobacter diazotrophicus PAl 5 |
Kingdom | Bacteria |
Replicon accession | NC_011365 |
Strand | - |
Start bp | 1694348 |
End bp | 1696666 |
Gene Length | 2319 bp |
Protein Length | 772 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 643391049 |
Product | general secretion pathway protein D |
Protein accession | YP_002275912 |
Protein GI | 209543683 |
COG category | [N] Cell motility [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG1450] Type II secretory pathway, component PulD |
TIGRFAM ID | [TIGR02517] general secretion pathway protein D |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.22323 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 42 |
Fosmid unclonability p-value | 0.534897 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACAGTG ACACGACTTC CCGTCGCGGC ATGACCTCCC GCCGTGCGCA GACCGCCTGG ACCGGCATCG CCTTGTGTTT TTCGATCGCG GGATGCAGCC CCGACAAGCC GCCGCATGTC TCGCCGCTGG GCGGTCCCCT GCCGCTGGGC GGCACGGCGC CCGACCTGAT GGGTGGCCGG ATCGAGGGCA CGGACCAGGT CGGGCCGGCG CAATACAGCT ACGGCCGCGG CGCGCTGCCG CTGATCCGGG GCCAGGGCCG CCCGTACCAG GGCGGGGGCG ACATCGCGCT GAATTTCGCC GATACCGATG TCCGTGACGC CGTCTCGCAG ATCATGACGG ACATCCTGCA CGTGAATTAC GCGATCGACC CGGCGGTGCA CGGGCGCGTG ACGCTGCATA CGTCCCAGCC CCTGACCGCC GGGCAGCTTC TGCCCGCCCT GCAGATCATC CTGGCCCAGG TGCACGCCGT CCTGATCCAG GCCGACGGGC TGTACCGCAT CGTGCCGGCC CAGGCCGATT CGGCAAATCC CCAGTCCAGC GCGGCAGGGA TCGCGGAAAA CGCGTCGCTG GGCGGCAGCG TCATGGTGCC GCTGCGCTAC GCCGATGCCG CAAACCTGGC CAAGGCACTG CAACCGTTCC TGCAGGGCGG CGCCCGCGTC ACTCCGGTGG ACAGCGCGAA CGCCGTGATC GTCAGCGGCG AGCCGTCCGC GCGCAATACG CTGGTCGATG TCATCCATGC CTTCGACGTG GACTGGCTCA GCAGCCAGTC CTATGCCCTC CTGCCGGTGG AATCGGGAAA TGCCAGGGAT ATGGCAACGG CGTTGCAAAG TGCCCTGCAT GGCGGCAACC TGGTGCAGGT CCTGCCGCTG GCGCGGGTCA ATGCCGTGCT GGTCGTCGCG CGCAGCGCGC GGCAGATCGA CGATGCACGG CGCCTCTTTG CGCTGATCGA ACGCAATCGG CGCGCGACCA TGCGGTCGTG GCATGTGTTC TACGTGCAGA ATTCCAGCGT GAACGACGTG ACCTATACAT TGCAGCAGGC CTTCACGCCG GGCGACGTGA CGGCCACGCC GCCCGAAACC ACGTCCGGCA CGGCCTCGCA ACTGGGCCAG TCCGGCTTCA CCGGCACCAT GGCCAACGCC ATGGGCGGCG GCGGGCTGGG CGGCAGCGCG GGCACGGGGA ACACCACGGG CCTGATGGGC GGCAGCCAGC AGGGCGGGGT CCAGGCCACC GCCGGCCAGG CACCGACCGG CCAGGCGGGC GGGGGGCCGG CGGCATTCGC CAATCCCCTG CTGGGCGGCC TGGACAACAC TGCCAGCGCC GAAGGCCGCC CGCAGGCGAT GCGGATCATT TCCAATTCCC AGCATAACGC GATCCTGGTC TACGGGACGG ACCAGGAAAG CGACACGGTC GAACAGATGC TGCGCCGGAT CGACGTCATG CCGCTGCAGG TCCGCATCGA TGCCGTCATC GCCGAGGTGC AGCTGAACGA CGCGCTGCAG TACGGCACGC AGTTCTTCTT CAAATCCGGC GGCATCAACG GCGTGCTCAG CACCAACAGC CAGACGATCA CCACCGGCAC CCTGGCCACC GCCGCCTTCA GCCATACGCT GCCCGGCTTC ATCATCGGGG GCGCCAGCGG CGGCGGCGCG CCCTTCGCCA TCGACGCGCT GCAGAACGTC ACCACCGTCC ACGTCCTGTC CTCGCCGCAA CTGATGGTCC AGGACAACCG GGCGGCCCGG CTGCAGGTCG GGCAACTGGT GCCGGTCCAG ATCGGCTCGC AATCCAGCAC CATCGGCACG TCGATCTACA ACCAGTTCAC CTATCAGCCG ACCGGCGTGA TCATGCAGGT GACGCCCCAT GTCAGCGAGG GCGGGCTGGT CACGCTGGAC GTGTCGCAGG AGGTCAGTTC CGTCAGCCCG ACCGCCGCCA GCACGGCCAA CGCGGCGTCC AATCCCACCT TCAACGACCG TTCGGTCAGT TCCCGCGTCG TGGTCCAGGA CGGGCAGACG GTGGGGCTGG CCGGCCTGAT CACCGACAGT TCCAGCCGCA TCAACAGCGG CATCCCGTGG CTGAAGAACA TCCCGGTCCT GGGCGTCCTG GCAGGGAACC AGAACAACAA CCGCCAGCGG ACGGAATTGC TGATCCTGAT GACGCCGCAC GTCATCCATG ACCAGCGCGA CGCGGTCGAC CTGATGGAGG ATCTGCGCGA CACCCATCCC AACGCGGCGA ACGTCCCCGA CGAATTGCGG GTCATGCGCA TGACCGGCAG CGCCGACCCC CAGCAGCGCC TGCGCGAAAA GGTCGGGCTG GGACCGTGA
|
Protein sequence | MNSDTTSRRG MTSRRAQTAW TGIALCFSIA GCSPDKPPHV SPLGGPLPLG GTAPDLMGGR IEGTDQVGPA QYSYGRGALP LIRGQGRPYQ GGGDIALNFA DTDVRDAVSQ IMTDILHVNY AIDPAVHGRV TLHTSQPLTA GQLLPALQII LAQVHAVLIQ ADGLYRIVPA QADSANPQSS AAGIAENASL GGSVMVPLRY ADAANLAKAL QPFLQGGARV TPVDSANAVI VSGEPSARNT LVDVIHAFDV DWLSSQSYAL LPVESGNARD MATALQSALH GGNLVQVLPL ARVNAVLVVA RSARQIDDAR RLFALIERNR RATMRSWHVF YVQNSSVNDV TYTLQQAFTP GDVTATPPET TSGTASQLGQ SGFTGTMANA MGGGGLGGSA GTGNTTGLMG GSQQGGVQAT AGQAPTGQAG GGPAAFANPL LGGLDNTASA EGRPQAMRII SNSQHNAILV YGTDQESDTV EQMLRRIDVM PLQVRIDAVI AEVQLNDALQ YGTQFFFKSG GINGVLSTNS QTITTGTLAT AAFSHTLPGF IIGGASGGGA PFAIDALQNV TTVHVLSSPQ LMVQDNRAAR LQVGQLVPVQ IGSQSSTIGT SIYNQFTYQP TGVIMQVTPH VSEGGLVTLD VSQEVSSVSP TAASTANAAS NPTFNDRSVS SRVVVQDGQT VGLAGLITDS SSRINSGIPW LKNIPVLGVL AGNQNNNRQR TELLILMTPH VIHDQRDAVD LMEDLRDTHP NAANVPDELR VMRMTGSADP QQRLREKVGL GP
|
| |