Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hhal_1067 |
Symbol | |
ID | 4709845 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhodospira halophila SL1 |
Kingdom | Bacteria |
Replicon accession | NC_008789 |
Strand | - |
Start bp | 1159009 |
End bp | 1160445 |
Gene Length | 1437 bp |
Protein Length | 478 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 639855538 |
Product | GntR family transcriptional regulator |
Protein accession | YP_001002645 |
Protein GI | 121997858 |
COG category | [E] Amino acid transport and metabolism [K] Transcription |
COG ID | [COG1167] Transcriptional regulators containing a DNA-binding HTH domain and an aminotransferase domain (MocR family) and their eukaryotic orthologs |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.315299 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACTACAC CACTCTACGA ACGGGTGGCC GAGACGCTGG CTCATCAGAT CGAGCAGGGC GTCTACGCGC CCGGGGATCG TCTGCCCGGA CTGCGGCGTC TGCGCCGGCA CTTCGGGGTG AGCATGGCCA CCATCGTCGC CGCCTGCGAA CTCCTTGAGC AGCGCGAGGT CCTCGAGGCC CGACCCCGCT CGGGGTTCTT CGTCCGCGAA CGGGCGACAG CGAGCGCAGC CCCCAGCGAG CAAGTGGACG AGGTGGATGC GCCGAGTCTG GTGCTCGGCC AGGAGCGCGT CCTCGACCTG GTGCGCGCCC ACCACGAGCC GGATGTGGTC TCGCTGGGGG CGGCCGCAGC GCCGGCGGCG TTCCTCCCGG CCGCGGCCAT GGACCGCAGC TTCGCCCGGG TGCGTCGCCG CCAGCGCGAG CGGGTCAATG CCTACGACTT CCCGCCGGGG TGTGCCGAGC TGCGCACCCA GATCGCCCGC CGCATGGCCT ACGCCGGCTG CAGCCTGTCT CCGGACGACA TCGTGCTCAC CGCCGGGTGC CAGGAAGCGA TCACCCTGAG CCTGCGCGCC GTGGCCGAGC CCGGCGACGT GGTGGCCATC GAGTCGCCGA CCTTCTACGG CATCCTTCAG GCCATCGAGT CCCAGGGCAT GCAGGCCCTG GAGGTGGCCA CGGACCCGCA TTCCGGGATG ATCCCCGAGG CGCTGGAACG GGCCCTGTCG CGTTGGTCGA TCCGCGCCTG TGTGCTGATG CCCACCTTCG GCAATCCGTT GGGGCACGCC ACCCCGGAGT CGCGCAAGCA GGAGCTGGTG GGGGTGCTCA GCGCCCACGG CGTGCCCTTA ATCGAGGACG ATGTCTACGG TGAGCTGGCC TTCGACGGTA CCCGGCCGTG GGCAGCGAAG GCCTTCGACG CCCACGGCGA GGTGCTCTAC TGCAGTTCCT TTTCCAAGGT CCTCGGTGCC GGGCTGCGGG TCGGCTGGGT CGCGCCGGGC CGTTACCGGG ACCGCCTGGT CTATCTCAAG TACGCGACCA GTCAGGCCAC TTGTACCCTG AGTTCGTTGG CGGTGGCCGA CTACCTGGAG CAGGGCGGCT ACGACCGCTT CCTGCGCCGG GCCCGGCGCA ACTACGAGCG GCATGTGCGC TGGGTGGCCG GCCTGGTACG CCGCCACTTC CCGGAGGGCA CCCGGGTGAC GCGGCCGCGG GGCGGTTTCG TGGTGTGGGT GGAGCTCCCG GCCGGCTGCG ATAGCCTGGA GCTGCAGCGC CGCGCCCTGG CCGAGGGAAT CAGCATCGCC CCCGGTCCCG TCTTCTCGCC CACCGGCCGG TACGGACGCT GCCTGCGGCT CAACTGCGCC CAGGCCGACG TGCCGGCGAC AGAGTGGGCG CTGCAGCGGC TCGGCGCCCT GGCCATCGGC GAGGATCGCG GCGACGTGCG CGGCTGA
|
Protein sequence | MTTPLYERVA ETLAHQIEQG VYAPGDRLPG LRRLRRHFGV SMATIVAACE LLEQREVLEA RPRSGFFVRE RATASAAPSE QVDEVDAPSL VLGQERVLDL VRAHHEPDVV SLGAAAAPAA FLPAAAMDRS FARVRRRQRE RVNAYDFPPG CAELRTQIAR RMAYAGCSLS PDDIVLTAGC QEAITLSLRA VAEPGDVVAI ESPTFYGILQ AIESQGMQAL EVATDPHSGM IPEALERALS RWSIRACVLM PTFGNPLGHA TPESRKQELV GVLSAHGVPL IEDDVYGELA FDGTRPWAAK AFDAHGEVLY CSSFSKVLGA GLRVGWVAPG RYRDRLVYLK YATSQATCTL SSLAVADYLE QGGYDRFLRR ARRNYERHVR WVAGLVRRHF PEGTRVTRPR GGFVVWVELP AGCDSLELQR RALAEGISIA PGPVFSPTGR YGRCLRLNCA QADVPATEWA LQRLGALAIG EDRGDVRG
|
| |