Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_1455 |
Symbol | |
ID | 5736866 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 1694190 |
End bp | 1695449 |
Gene Length | 1260 bp |
Protein Length | 419 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641278593 |
Product | cysteine desulfurase family protein |
Protein accession | YP_001544227 |
Protein GI | 159897980 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0520] Selenocysteine lyase |
TIGRFAM ID | [TIGR01976] cysteine desulfurase family protein, VC1184 subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.00000306553 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGGTCG ATTTAGCGCC ATTTCGTAGT CACTTTCCAG CGTTAACCCA GACTCACGCG GGGAAACCCT TAGTTTTTTT TGATAATCCT GGTGGAACCC AAGTGCCCCA ACAAGTGATT GGCCAGATGA CCGATTATTT GCGGCGTTCG GTTGCCAATA CCCATGGTGC TTTTATCACC AGCCAGCGCA CCGATGCCGT CATTGACGAA TGTCATGCAG GGTTGGCGGC CTTGCTTGGC GGTGAGCCAG ATGAAATTGT GCTGGGAGCC AACATGACTT CGCTCACATT TGCACTCAGT CGTTCACTAG CTCGCGAATG GCAAGCTGGC GATGAGATTA TTCTGACCAC GCTTGACCAC GATGCCAACG TTACGCCATG GCTGCTGGCT GCTGAAGAAC GTGGGGTTAT CGTACACTTT GTGGATATTA ATCCTGTTGA TTGCACCCTG GTGATGAGTG ACTTTGAGCG CTATCTTTCG CCGCGCACTA AATTGGTGGC CGTTGGTTGG GCCTCAAATG CCTTTGGCAC AATTAATGAT GTTCAAACGA TTGTCAAACA AGCCCATGCT GTTGGTGCTT TGTGTTTTGT TGATGCGGTT CAGAGCGTGC CCCACATTCC ATGCGATGTC AAAGCGCTTG ATGCCGATTT TGTGGCATGT TCGGCCTATA AATTTTTTGG GCCGCATGTT GGGGTGCTCT GGGCCAAACG CGAACATCTA GAGCGCCTGT TTGCTTATAA AGTGCGACCT GCCCCCGAAA CTTTGCCTAG TCGCTGGGAA ACTGGCACGC AAAATTTTGA AGGCCAAGCG GGCATCAACG GAGCCTTGGA ATATCTCGGT GGCTTAGGTG TGGGTTATAT GGAGCGCTAC GATCAGCTGC TTGGCGAAAC GGTTGGTCAA CGGGCCGTCT TGTTGTCCGC AATGCATGCG ATTGCTGAAG CCGAGCAGAG CCTTGGCCAA TATTTGATTC AAGCGTTGCA AACGCTTAAA GGTGTGCAAT TGTATGGCAT TTTGGAGCCT GAACGTGGCC ATTTGCGCGT ACCCACCGTG GCATTTCGCA AGGCTGGAGT TACGCCCCAA GCAATTGCCA AAACCTTTGG TAATGAGGGA ATTTGTGTTT GGGATGGCCA TTATTATGCC TTGCGAGCCG TCGAACGCTT AGGCTTGCTT GATCAAGGGG GGATGGTGCG GGTTGGTTTA GCCCATTACA ACACGCGCAC TGAGATTGAT CGTATGCTGA ATGTGCTTGA ATCAATTTAG
|
Protein sequence | MTVDLAPFRS HFPALTQTHA GKPLVFFDNP GGTQVPQQVI GQMTDYLRRS VANTHGAFIT SQRTDAVIDE CHAGLAALLG GEPDEIVLGA NMTSLTFALS RSLAREWQAG DEIILTTLDH DANVTPWLLA AEERGVIVHF VDINPVDCTL VMSDFERYLS PRTKLVAVGW ASNAFGTIND VQTIVKQAHA VGALCFVDAV QSVPHIPCDV KALDADFVAC SAYKFFGPHV GVLWAKREHL ERLFAYKVRP APETLPSRWE TGTQNFEGQA GINGALEYLG GLGVGYMERY DQLLGETVGQ RAVLLSAMHA IAEAEQSLGQ YLIQALQTLK GVQLYGILEP ERGHLRVPTV AFRKAGVTPQ AIAKTFGNEG ICVWDGHYYA LRAVERLGLL DQGGMVRVGL AHYNTRTEID RMLNVLESI
|
| |