Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_5102 |
Symbol | |
ID | 5737060 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009973 |
Strand | - |
Start bp | 132027 |
End bp | 133856 |
Gene Length | 1830 bp |
Protein Length | 609 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 641282267 |
Product | hypothetical protein |
Protein accession | YP_001547858 |
Protein GI | 159901612 |
COG category | [K] Transcription |
COG ID | [COG2378] Predicted transcriptional regulator |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCCTATT CTCCCATGCC TGTCAAGATC CTCGCCGCCC ACGCGTTTGC TGCCGTGCCC ATTCGTACCT TGCGCACGCT GGCCCATTAT ACCCACTATC AACGGCTTGC ATCCCAAACC CGTGCGGGCC TTGCCACTGA CCTGCTGTGC CACTGGACAA CACCGGCTTA TCGTCGGCTG GTCCGGCAAT CCTTGACCGC TACCGATTAT GCCCTGCTCC ACGCCCTGTG GGCTGGCGAG CATCCCCTCC CCGACCCTAA TACCCTTGAT CTCTGGCGCT GGCAGGCTCC GTGGCCAACG CTCGCCAGCC TTTCCTCGCT CCAGCGTTTG GCGGTCTTGG GCTTCCTTCT GCCCATCCGC ACGCCGCTAG GTCGGCAAAC CGTACTGCTC CGCGATACGA CCCGCTGGCT GCGTCGCGTT CGTCCCGCGC CCGCACCGCC CACCCCCGCA TCGTTCCAAT CGCTCTTTCA GGCGGTCGCC GAGCTGCTCA TCCACGGGTC GATTCAGCCA CTGCCTGCCG CGACACCCGG ATCGATGGCG CGCACGCTTG CCCAGTCTGC TGGCTGGCTC GTCCTGCGGC TGGAGCAGTG GCGCACGACC CCGCGTGGCA TCGCGTGGGT GCAGGCGAGC CTCGCCGAGC AGGAACGGCT GCTGCAACAC CAGATCGTGC GCTGTTCCCC GCCGGACAGT GGTCTCCCCG CATGGCGCAA TCCCGATTGG GCGACGCTCT GGCAGGCCTT CGAAGCCCTC ATGCACGATC ACGCCCCACG GCGGATGTGG GATGTGCTGG CGCTGCTCTG GGCGCATCCA GCGTGGGGCA CGCTGCCGGA CGACCAGCGC GGGCGGCTGT TTGGCCAGTG GCTGCGACAG GTACTCCAGC CAGCGGGGGT GGTCAGTCTC GCGCAGGGCT GGGTGTTCTG GCATGGCTGG TCAGCGTTGA CGGTAACTGC GCCCCCTTTT GATGGACTCC TGCTGCCCGC CACGCCTCAT CTGCCCCCGC TACTGCGGTG GTGGGCGACC TACTGGGGCC AGCCGACGCA CCATGGCTGG CGGATCAGCG TGGCAGCGGT GACGGCACGG GTGCAGCAGG ATGGCGATCT AATGGGGGTG TGGGAGCCAC TCGATGCGTG GTATGCCGCC CGGCCACCAG CGGTGGAGTC CGTGGTGGCC ACGGTAGCGG CGCGACCACG CGTGCGGCTA CGGCAGGTGA TGCTGGTGGA AGGCCGTGCT GAGGCGTTGA CGGTATTGGA GCAGCAGCGC GGCATGCAGG GTGTGGTGCA GGCGGGCTGG GCGGCAACCC ACCGGGTGAT TGCGGCGGAA GCAGTGGCCC AGGTCGCGCG TGCCGTGGGG TTGCCGTCGC CGCGCCAGTC CGCCCCGCCC CGCGAGGTCG AAACGCTGGT GTTGGCGTTG CGGATTGCGG CGCAGCACGT ACCGAGCCAT GCGACGGCGT TCCAAGGGCA AGCGCAGCAG TTGCTCGCCG ATCTGTCGTT TGCGCAGCGA TGTGTGATCG ACGAGCAATG GGAGGGGTTG CACTCTAGCC TCACGCCGCC GCTGGCCATT GATGCGGAAC CGCTGGCCGT TGGGCAGCAG CCACGAGCGC AGATCACGGT GGATCATGCA CGACAGGTGG TGCGAGAGGC AATTCAGGCG GGTCATGCGC TGACGGTGCG CTATTACACG CCGTCAGCGC ATCGGATCAC GACGCGCACA ATTCGCCCGC TCGAACTGAC CAGCACCGGA GTCCGTGGCT GGTGCGAGCT GCGGCAGGAA GAGCGAGCTT TTCGCTTTGA TCGGGTGTTG GCGGTGACAG TCCACCATGA ATCGGGGTAA
|
Protein sequence | MPYSPMPVKI LAAHAFAAVP IRTLRTLAHY THYQRLASQT RAGLATDLLC HWTTPAYRRL VRQSLTATDY ALLHALWAGE HPLPDPNTLD LWRWQAPWPT LASLSSLQRL AVLGFLLPIR TPLGRQTVLL RDTTRWLRRV RPAPAPPTPA SFQSLFQAVA ELLIHGSIQP LPAATPGSMA RTLAQSAGWL VLRLEQWRTT PRGIAWVQAS LAEQERLLQH QIVRCSPPDS GLPAWRNPDW ATLWQAFEAL MHDHAPRRMW DVLALLWAHP AWGTLPDDQR GRLFGQWLRQ VLQPAGVVSL AQGWVFWHGW SALTVTAPPF DGLLLPATPH LPPLLRWWAT YWGQPTHHGW RISVAAVTAR VQQDGDLMGV WEPLDAWYAA RPPAVESVVA TVAARPRVRL RQVMLVEGRA EALTVLEQQR GMQGVVQAGW AATHRVIAAE AVAQVARAVG LPSPRQSAPP REVETLVLAL RIAAQHVPSH ATAFQGQAQQ LLADLSFAQR CVIDEQWEGL HSSLTPPLAI DAEPLAVGQQ PRAQITVDHA RQVVREAIQA GHALTVRYYT PSAHRITTRT IRPLELTSTG VRGWCELRQE ERAFRFDRVL AVTVHHESG
|
| |