Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Gdia_1577 |
Symbol | |
ID | 6974987 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Gluconacetobacter diazotrophicus PAl 5 |
Kingdom | Bacteria |
Replicon accession | NC_011365 |
Strand | - |
Start bp | 1752954 |
End bp | 1754699 |
Gene Length | 1746 bp |
Protein Length | 581 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 643391108 |
Product | transcriptional regulator, NifA, Fis Family |
Protein accession | YP_002275971 |
Protein GI | 209543742 |
COG category | [K] Transcription [T] Signal transduction mechanisms |
COG ID | [COG3604] Transcriptional regulator containing GAF, AAA-type ATPase, and DNA binding domains |
TIGRFAM ID | [TIGR01817] Nif-specific regulatory protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 63 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCTACGG ATACCGTTCC CTTTCGCCCC GTCAGCGGGG AAGGCCATTC GACCTGGGCT GAACGCGCCT TGTTCGGCCT TCACGAGATA TCGAAAATCC TGTGCGCGCC GACGGAGACC GTCGGAATCA TCAGGAATGT CCTGGCCGTT CTGGAAAGCT TCGTCGACCT GAACAATGCC GTCGTCGCGC TGTTCGACGC GGCGGGCAAT GTGGAGACCA TCATCGGCAC CGAGGCCGAT GACGCCGCCG CCCGGCGCTA TTTCGATTCG ATTCCCGAAC AGGCGGTGGG ACAGATCGCC GTCAGCCGCC AGCCGCTGCT GGTGCCCGAC GTAACGGGTG ACACGCGCTA TGGCTTCGCG ACGTCGCAGG CCTGGACGGT GGGGGCGGGG CGCATCGGCG TGATCGGCGT GCCGATCCTC AGCCGCGACA GGCTGGTGGG TGTCATGCTG CTCGATCGCC CGGCCATTCC GGAGGAATCG GTCGTCGGCA GCATGGATGT CTGCCTCCTG TCGATGGTGG CGGGGCTGAT GGGGCAGATG GTGCGCCTGC ATCGGCTGGT GCAGCGGGAC CGGGAAAACC TGCTGCACGA CAGCGGCCTC GCCCAGCCTG CGGCGCCGGT TGCCGATGGC GGCGGATATA TGGGGATCGT CGGCGACAGT CCAGCGCTGC GGGCCGTTCT GCAGAAGGTC GAAATCGTCG CCAGAAGCGA CGTCACCGTC CTGTTGCGCG GTGAATCGGG CACGGGCAAG GAATTGTTCG CCCAGGCGAT CCACCAGGCC TCGCCCCGGC ATCGCAAGCC GTTCGTCAAG CTGAACTGCG CTGCCCTGCC GGAAAGCGTT CTGGAATCCG AACTGTTCGG GCATGAAAAG GGCGCCTTTA CCGGCGCGAT CGGCCAGCGC AAGGGCCGGT TCGAACTGGC CGATGGCGGC ACCCTGTTCA TGGACGAGAT CGGCGAGATC TCGCCCAGTT TCCAGGCCAA GCTGCTGCGG GTGCTGCAGG AGGGCGAGTT CGAACGCGTG GGCGGCACCA GGACGCTGAA GGTCGACGTG CGCGTGGTGA CGGCGACAAA TCGCAATCTG GAGGAAGCGG TCAGCAAGGG CCGCTTTCGC GCCGACCTCT ATTACCGGCT GAGCGTCATT CCGATCTTCC TGCCGCCGCT GCGGGATCGC CCGACGGATA TTCCCATGCT GGCCAATGAA TTCCTGCGGC GGTTCAACGA AACCAACCGG ACGTCGCTGA CCATCGCGTC CGACGGCATG GACGTCCTGA CAACCTGTTA TTTTCCGGGC AACGTACGCG AACTGGAAAA CTGCATCCGG CGCACCGCGA CGCTGGCGCC GGGGCAGACC ATCGGCGCGG CCGACTTTGC CTGTCGCAGC GACGGCTGCC TGTCCTCGAT CTTCTGGCGG CCGGCCGCAC CCGCCCCGGG TCCGGGCCGC CATCCGGCGG CATCCGATGT GGCCGGGGAC ACCGGCGCGG CGACGGAACT GCCCGTTCTT CCGGCGCCCC GGTCTTCGTC CCAGTCTTCG CCCCAGTCCT CGGCCGCGCC CGACCCGGCG ACGTGCCCGT CCTCGGCCGT GTGTTCGGCC GCGCAGGGGG AATGGAGCCC CCAGCGCGAC GAACTGCTCG AAGCCATGGA AACCGCCGGC TGGGTGCAGG CCAAGGCAGC GCGGATTCTG GGCCTGACAC CCCGTCAGAT CGGCTATGCC CTGCGCAAGA ACGGGATTTC CATCAAGAAA TTCTGA
|
Protein sequence | MPTDTVPFRP VSGEGHSTWA ERALFGLHEI SKILCAPTET VGIIRNVLAV LESFVDLNNA VVALFDAAGN VETIIGTEAD DAAARRYFDS IPEQAVGQIA VSRQPLLVPD VTGDTRYGFA TSQAWTVGAG RIGVIGVPIL SRDRLVGVML LDRPAIPEES VVGSMDVCLL SMVAGLMGQM VRLHRLVQRD RENLLHDSGL AQPAAPVADG GGYMGIVGDS PALRAVLQKV EIVARSDVTV LLRGESGTGK ELFAQAIHQA SPRHRKPFVK LNCAALPESV LESELFGHEK GAFTGAIGQR KGRFELADGG TLFMDEIGEI SPSFQAKLLR VLQEGEFERV GGTRTLKVDV RVVTATNRNL EEAVSKGRFR ADLYYRLSVI PIFLPPLRDR PTDIPMLANE FLRRFNETNR TSLTIASDGM DVLTTCYFPG NVRELENCIR RTATLAPGQT IGAADFACRS DGCLSSIFWR PAAPAPGPGR HPAASDVAGD TGAATELPVL PAPRSSSQSS PQSSAAPDPA TCPSSAVCSA AQGEWSPQRD ELLEAMETAG WVQAKAARIL GLTPRQIGYA LRKNGISIKK F
|
| |