Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Gdia_0990 |
Symbol | |
ID | 6974387 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Gluconacetobacter diazotrophicus PAl 5 |
Kingdom | Bacteria |
Replicon accession | NC_011365 |
Strand | - |
Start bp | 1111324 |
End bp | 1113399 |
Gene Length | 2076 bp |
Protein Length | 691 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 643390512 |
Product | hypothetical protein |
Protein accession | YP_002275388 |
Protein GI | 209543159 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.00334394 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 59 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTCGATA TCGCAACAGG CTGGATGGCG CGGCTAGGGT CGGCAATCAT TCCCGATACA AGCAAGGCGG TCGGATGGAT CGTCGACAGC CAAGTCGACC AAGCGTTTGT GGAGCCAACA CTCAATATCT TGCAGGGTGA TCCTGCCACC GCCTGTATCT TGATCATCGC AGCGCCCGGC GCGGTGGGCA AGAGCACCTA TGCCCGCTCG ATCGGCGCCC GCGCGAATGC CGTACTCGTC GATCTCGCTC AGACCGAACC GCTGGGTGGC AACTTTTTCG TGGGCGGTAT CGCCAATGCG TTCGGCTACG AGGCGTTGGG CGACCTCGCC AACGGCCGGA TTGGCCTTGT CGTTGATGCC CTCGACGAGG CTCAGTTGCG CTCGGGCAGC GAGGGCTTTT CTGCGGGGCT GCTGGATCTG TGCACGATCG TTCGAAATTC CTCGGCCCTA CCGGCCACCC TTCTCGGCCG CGCTGCAGCG GCGGAAGAGG CGTGGTTAAT TTTGAGCGAA GCCGGACTCA ATCCGTGCCT ACTGCAGATA GAATATTTTG ACGAAGACCA GTCCAAGCAG TTCATCAGAC GCCGGCTGCC AGTGATCGCG GACGCCCGCG ACGCGATGCG GAACGCTTTC CGCAAACATG AAGAGAAGTT CGTCGAATTT GCACTCGCAA CCCGCGAGAA ACTGAAGAGC ACGCCAAGCG GCAACGAACA GCGATTCGCA GGATATGCGC CCGTTCTGGA TGCCATCTGC ACTTATGCGC TCGGCGATGA CGGCTTGAAC CCGAGCACGC GACTATCGAA CCTCGCCGCC GAGAGCGCAA TCAACCTCAT TGTCCAGATC GCGACCAGCA TCCTGCAGCG CGAGCAGACC AAACTGACCT CGCAGATCGA ATCTCCGCCT GCGGGCATCG ACCTGTCATC ACTGTACACT CCGGACGAGC AACTTGGCAG AATCGCCGCA ATCTTGCTTG ACTCTGATGC GCCTGCGGGC ATCTCGATCT CCAATGCTGC TTTCAGGCAG GCTTATGAAG GCATGGTCGC CGAGTTCACG CCGCAGCACC CGTTTCTCGA TCCACGCGGC GGTCCCTCAA ACGCGGCGTT CGCCGCTTAT CTGCTCGTCT GGGCCATCAC CACGGGAAAC GCAAAGCGGG ATGCCCGCCG CATGCTGGCA TTGAATCCGA CGTTTGGGTC CGGCTTGTTC TTCGAGATGT ACATGAGTTG GCTCGGAAGT TCGGCGGATA ACCTGCTGGA GCTCGAGGAC GTGGGGTCGC TCTACGGCTC GTTCGCGAGT CAGGCTGCAC AGGGCGAGCG CCCAGTGCTC GAGGTAAGCG CCGAACCGGG CGACGCGACC GCCGAGGTCG AGTTCGAGAT GACACCTCTG GCTGACAGCT CGATCGACGC CAGTAGGAGC TACGGACCCT ACGTAAGTGC AATCGACGGC ATCCTCGAAT TCAAGGGACC CGTCGGCGGA CTGCGAATAG TGGCCCCCGT CTCAGTCATC ATAGGCGACG GTCGTACGGC AAGCATAACC AGCCCGGTCG AAATCGACGT GGAAGTTTTC GAGATCGATG CGCGCGAGCT TCGCGTGTTC AAGTCGACTG CTGGTGAGGA TGTCGGGCTC GGGCGGGTAG TGTTGGAGGC ACTCGATGCC TCCGTAGAAC GTGTCGAGCG CATCTTTCTT CATGGAGCGG AATTGCAAGT TACCTTCCCC GGTGCGCGTG CTCATCCTTG GGCGGACTAT GCCGTCAATA GGCAGGTAAC GCAAAATCCG AGGATCGCCT TACTCAGGAG GCGCGCGCGC AAGGTAATCA CTTCGTTTCG TTCGCACAGC AAGGGAGCTC TGGTCCGCCT AGCCGCTAAG ATCGAGCACA CGAGGATGAT GAAGGAAGGT GAGGACGGGC CCCGACTACT TCAGCGGCTG CGCGATGACG GCATATTGAC GACGTTCGAT GCCGGAAAGT TCTACGTCTT GCATCCTGAC AAGCTCGCCC AGCATTTCAA CATGGATTAC CAAGCGCTGC ATCTGCAGCG CTGGACAGAC GAGGCGGACG CCTATCTCTC CTCGATCGGC GGGTGA
|
Protein sequence | MFDIATGWMA RLGSAIIPDT SKAVGWIVDS QVDQAFVEPT LNILQGDPAT ACILIIAAPG AVGKSTYARS IGARANAVLV DLAQTEPLGG NFFVGGIANA FGYEALGDLA NGRIGLVVDA LDEAQLRSGS EGFSAGLLDL CTIVRNSSAL PATLLGRAAA AEEAWLILSE AGLNPCLLQI EYFDEDQSKQ FIRRRLPVIA DARDAMRNAF RKHEEKFVEF ALATREKLKS TPSGNEQRFA GYAPVLDAIC TYALGDDGLN PSTRLSNLAA ESAINLIVQI ATSILQREQT KLTSQIESPP AGIDLSSLYT PDEQLGRIAA ILLDSDAPAG ISISNAAFRQ AYEGMVAEFT PQHPFLDPRG GPSNAAFAAY LLVWAITTGN AKRDARRMLA LNPTFGSGLF FEMYMSWLGS SADNLLELED VGSLYGSFAS QAAQGERPVL EVSAEPGDAT AEVEFEMTPL ADSSIDASRS YGPYVSAIDG ILEFKGPVGG LRIVAPVSVI IGDGRTASIT SPVEIDVEVF EIDARELRVF KSTAGEDVGL GRVVLEALDA SVERVERIFL HGAELQVTFP GARAHPWADY AVNRQVTQNP RIALLRRRAR KVITSFRSHS KGALVRLAAK IEHTRMMKEG EDGPRLLQRL RDDGILTTFD AGKFYVLHPD KLAQHFNMDY QALHLQRWTD EADAYLSSIG G
|
| |