Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Gdia_3229 |
Symbol | |
ID | 6976668 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Gluconacetobacter diazotrophicus PAl 5 |
Kingdom | Bacteria |
Replicon accession | NC_011365 |
Strand | - |
Start bp | 3537497 |
End bp | 3538477 |
Gene Length | 981 bp |
Protein Length | 326 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 643392740 |
Product | transcriptional regulator, AraC family |
Protein accession | YP_002277572 |
Protein GI | 209545343 |
COG category | [K] Transcription |
COG ID | [COG4977] Transcriptional regulator containing an amidase domain and an AraC-type DNA-binding HTH domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.382079 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 44 |
Fosmid unclonability p-value | 0.686696 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTCCACG GCACTCATTC CGCCCGGAAG GCGCGCGCCG CGACACCCGT CTTTGAAGTC GTGCGTCCCG AACCGCGCAA CAGCTTCGTC TGGCATACGC ATGATTATCC GGCGCCCTGC GCGCGGTGGA ACTATCACCC CGAATATGAA CTGCACCTGA TCACCAAGGG GTGCGGCCAG TATATGGTTG GCGATTATTT CGGCTTTTTC GCGCCGGGAA ACCTCGTGCT GATCGGGCCG AACGTGCCGC ACGGCTGGTT CAGCGACGTA ACACCCGGCG AAAGCGTACC GGACCGCAAT ATCGTTCTTC AGGTCAACAA GTCCTGGTTC ACGGGCCTGA TGACGCTTTG TCCGGAACTT GACGTTCTGC ACGGGCTTCT CGCGGCGTCA GGACGTGGGG TCGAATTCCT CGGCCCGGGG GTCGCTGCCC TTGGCGCGCG CCTGGCCGGG CTCGGCACGA TGGACGATGC CGCGCGCATT CCCGCCATCA TCGGCCTGTT GCTCGACCTG GCCCAGTGCT CTTACCGGAC GTTATGCAGC GCGGGCCTGA CCCTGTCCCC GGATGACAGG GAACTGGAGA AAATAGATTC CATCATCAGG AATATAATCG ACGACAACAT CGTTTTTCGT CAGCAGGCCG AAATTGCAAA AGCGGTCGGA TTGTCCGCAC CCGCGTTTTC GCGGCAGTTC CGGCGCGCCA CCGGCGATAC GTTCGTATCC TTCATGAAAA AGCTGCGGAT CGGCAAGGCA TGTCAATTGC TGATGACAAC AGACGCCTCG ATTGCCGACA TCAGCGCGGC CACGGGGTTC GGCAACCTGT CCAATTTCAA CAGGCAGTTT CTCCAGATTC GCCAGACGAC CCCTTCGCAA TACAGGCGGG ACGTCCGACG CCTGGTGAAA CAGGATGCTG AAACAGCAAA AATTCGAAAC GACCGCCCAT ACGGAATGCA CAGTCACCAC GCACAGCGAT ACGTACAATA G
|
Protein sequence | MVHGTHSARK ARAATPVFEV VRPEPRNSFV WHTHDYPAPC ARWNYHPEYE LHLITKGCGQ YMVGDYFGFF APGNLVLIGP NVPHGWFSDV TPGESVPDRN IVLQVNKSWF TGLMTLCPEL DVLHGLLAAS GRGVEFLGPG VAALGARLAG LGTMDDAARI PAIIGLLLDL AQCSYRTLCS AGLTLSPDDR ELEKIDSIIR NIIDDNIVFR QQAEIAKAVG LSAPAFSRQF RRATGDTFVS FMKKLRIGKA CQLLMTTDAS IADISAATGF GNLSNFNRQF LQIRQTTPSQ YRRDVRRLVK QDAETAKIRN DRPYGMHSHH AQRYVQ
|
| |