Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_1682 |
Symbol | |
ID | 5733566 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 1956733 |
End bp | 1958310 |
Gene Length | 1578 bp |
Protein Length | 525 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641278821 |
Product | hypothetical protein |
Protein accession | YP_001544453 |
Protein GI | 159898206 |
COG category | [S] Function unknown |
COG ID | [COG5267] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTGAATC GACGAAATTT TTTGAAAATC AGTGGACTGG CCGCTGCCTA TAGTGTGCTC CCCCTCTGGC TAAGCGCCTG TGATCGGGCG GCTCCTAGCG CAATCGCCCA GCCAACCTGG GAGCTTGAAG CTCCGACCAA CGGCGATCAT CGCAGCCGTG TGGCAATTCG CCATTTGCTC AATCGGCTCA GCTATGGCCC ATTGCCCGGC CAAATTGAGC AAGTTCAAGC CCTCGGTTGG GATGCCTACC TTGAACAGCA ATTAAACCCA AGCCAGCTTG ACGACTCAGC ACTTGAGCAA CAATTGGCTC AATTTACAAC CCTCAAGCTC TCTAGTGCCC ATTTAATCGA GCACTATCCC AAAGGTGCGA ACGGCCCACG CCTGATTATG CGCGAGCTTC AAGCCGCTAG TTTATTGCGG GCAGCGAGCA GCCAACGCCA ACTATTCGAG CTAATGGTCG ATTTTTGGAG CAACCATTTC AATATTTACA TTGGCAAAAA TCAGGTCAAA TGGCTCAAAA CGGCTGATGA TCGTGAAGTA ATTCGCCAGC ATGCGCTCGG TAAATTCCGT GATTTATTGC TGGCTTCGGC CAAAAGCCCA GCCATGTTGG TCTATCTCGA TAATGCCGAA AACGTGCGAC CTGGAGTTAA GGTTGGCAAG AAGATGCTTG GCTTGAATGA AAATTATGCC CGCGAACTGC TCGAACTGCA TACCGTCGGG GCTGATGCAG GCTATAGCCA AGCCGATGTC CAAGCAGTTG CCCGTGTTTT GACGGGCTGG ACAATCACCC GAGCCAACAG CGAGCAGCCT GGACTTTTCC AATTTCTGCC CAAATTTCAC GATGTTATGG CCAAACGAAT CGATTTTCTG CAGCTAGATT TGGCTGCCGA TGGCGAAATC GAAGAGGGCG AATTATTGTT AAAGCTGTTG GCTGAACACC CTAAAACTGC CCAACGATTG GCCTATAAGC TCTGCCTACG CTTCGTCAGC GATGATCCAC CAGCTGATTT AGTTGAGCGG GTTGCTCAAG CGTATCTTCA GCACGATACC GATATTCGCG CCATGCTCAA CATGTTAGTC AACTCCGCTG AGTTTTTGGC CGCTGCTCAG CAAAAAATCA AACAGCCCAT GCATCTGTTA ATTTCAGCCA TTCGCGCTAC CAATAGCAGC ATCACCAAAC AAGCCTTCAA GGGTAAAAAC AACCTCCTTG ATCAATTGGA AACCTTAGGC CAAATGTTTT TCAACTGGCC TCCACCCGAT GGCTATCCCC AAATCAGCAG CGCTTGGATC AACACTGGCG CGATGCTCAG TCGTTGGAAT CTGGCCTTTG CACTCGCTGA AGGTCGAATC GACGGCCTAA AAACCGATGT CCCTAAATTC GCCAAACAGC CAAGCCAAGC CAGCGAATTG GTCGATACGC TAGCCGATTA TCTCAATTTA AGCCTTGCCG CCGAGTCGCG AGCCAGTTTA ATCGATTATT TAAATGATAG CCAATCACCA AACCCAACCG TTGATCAAAC TAAAATCGCT GGCCTACTTG GCCTATTGCT GACCAGCCCT GAATTTCAAT TGTGCTGA
|
Protein sequence | MLNRRNFLKI SGLAAAYSVL PLWLSACDRA APSAIAQPTW ELEAPTNGDH RSRVAIRHLL NRLSYGPLPG QIEQVQALGW DAYLEQQLNP SQLDDSALEQ QLAQFTTLKL SSAHLIEHYP KGANGPRLIM RELQAASLLR AASSQRQLFE LMVDFWSNHF NIYIGKNQVK WLKTADDREV IRQHALGKFR DLLLASAKSP AMLVYLDNAE NVRPGVKVGK KMLGLNENYA RELLELHTVG ADAGYSQADV QAVARVLTGW TITRANSEQP GLFQFLPKFH DVMAKRIDFL QLDLAADGEI EEGELLLKLL AEHPKTAQRL AYKLCLRFVS DDPPADLVER VAQAYLQHDT DIRAMLNMLV NSAEFLAAAQ QKIKQPMHLL ISAIRATNSS ITKQAFKGKN NLLDQLETLG QMFFNWPPPD GYPQISSAWI NTGAMLSRWN LAFALAEGRI DGLKTDVPKF AKQPSQASEL VDTLADYLNL SLAAESRASL IDYLNDSQSP NPTVDQTKIA GLLGLLLTSP EFQLC
|
| |