Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Gdia_0468 |
Symbol | |
ID | 6973862 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Gluconacetobacter diazotrophicus PAl 5 |
Kingdom | Bacteria |
Replicon accession | NC_011365 |
Strand | - |
Start bp | 513510 |
End bp | 514790 |
Gene Length | 1281 bp |
Protein Length | 426 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 643390000 |
Product | helix-turn-helix domain protein |
Protein accession | YP_002274879 |
Protein GI | 209542650 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2856] Predicted Zn peptidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 30 |
Fosmid unclonability p-value | 0.00592204 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGACCGCTC GGTGGGCATT AAAACAATTC GGCTACGCTG CACTCCGCGA GGCGGTCGAT GAGGGTGCGA CCGTCATTAG TGTCAGTCTT GATGAGCCTG CGCGCTCGCT GCGGGAGCAG CGTGTGAGGC TCGGACTCAC GCAAGAGCAG GTGGCTCGGG CTAGCATGCT TACGGTCAAC GACGTGCGGC GCGCTGAGCA GCTAGGGGCG ATCTCGCCGG TTCGTCGCCT GCAACTCCTT GCCCAAACTC TTGGGCTCAA CGATGAGCAA CTGGGCGTCC GACCAGATGC GAGCGCCGAC CAGCAATTAG CAGTCCGCCT GCGGACGCTC GGGAGTGCCG GACACAACGT ACGCTTCTCA CCCTCCCTGG TGCTCAAGCT TGCGGAGGCG GCGTGGACGA TATCGCGGCA AAACTTGCTC GCGGTCGAGC TTGGTGTTGT TCCGAATATA CTCAAGCACT TTGATGCGAG CGATGATTAT TATGCTCCAG TATGGCGCCG GGGGTATGAT CTGGCAGAAC GAACGCGCAC CCTGCTGAAC TTAGATCCAC TTGCCCCAAT CCCATCAGTG CGTGAGCTCA TTGATCAACT GGGGATTCCG CTGATTCAGG CCGCAATGGG AGCGGCCTTC GCAGGCGCAA CGGTAGTTAA TGGTGACGAT CGAGGGATCG TCATCAATAC TGAAGGTGAC AACCAGAATG TCTGGGTGAG GCGGATGACC TTGTGTCACG AGCTTGGCCA TTTGCTTTGG GATCCACCAG CTCGTCTGCG CCGCCTTCAT GTCGACCGTT ACGATGACTT GCGCACTGCC GAGGCCGGCG GCGGCGACGA AGTAGAGGCG CGTGCTAATG CCTTCGCCAT TTCCTTTCTG GCACCGCGTG AGGCTGTAAT CGAAATCGTC AAGCGTGGTG CAAGTCCGAC TGATCAGGTG ATCGAGCTCA TGAAACGCTT CGGCGTTGGC GCAACGGCCG CGAAGTATCA CATCGCGAAT GTCTCGCGCA ACTGGGGTGC CGAGGTCGAT ACCCGGCATG TCGCATCTAA CCAGCTTCCA CCACCTGATG ATTATTGGAC AACAAACGAG AACTGGACGG CGGACTATTT TCCCGTCGCC GGCGTTCCTA TTAGTCGTCG TGGGCGCTTC TCCGGGTTGG TAGCTATTGC CGCTTCCCGA GAACTGATTT CGACCGATAC GGCGGCATCA TGGTTGCAGG CGGCACCCTC GGCTTTGGCT CAGCAGTTGG CGACAATTGC CGAACTGACT GCTCAGGATC TTGCTGTTTA G
|
Protein sequence | MTARWALKQF GYAALREAVD EGATVISVSL DEPARSLREQ RVRLGLTQEQ VARASMLTVN DVRRAEQLGA ISPVRRLQLL AQTLGLNDEQ LGVRPDASAD QQLAVRLRTL GSAGHNVRFS PSLVLKLAEA AWTISRQNLL AVELGVVPNI LKHFDASDDY YAPVWRRGYD LAERTRTLLN LDPLAPIPSV RELIDQLGIP LIQAAMGAAF AGATVVNGDD RGIVINTEGD NQNVWVRRMT LCHELGHLLW DPPARLRRLH VDRYDDLRTA EAGGGDEVEA RANAFAISFL APREAVIEIV KRGASPTDQV IELMKRFGVG ATAAKYHIAN VSRNWGAEVD TRHVASNQLP PPDDYWTTNE NWTADYFPVA GVPISRRGRF SGLVAIAASR ELISTDTAAS WLQAAPSALA QQLATIAELT AQDLAV
|
| |