Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_0409 |
Symbol | |
ID | 5732305 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 479199 |
End bp | 480437 |
Gene Length | 1239 bp |
Protein Length | 412 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641277532 |
Product | SufS subfamily cysteine desulfurase |
Protein accession | YP_001543188 |
Protein GI | 159896941 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0520] Selenocysteine lyase |
TIGRFAM ID | [TIGR01979] cysteine desulfurases, SufS subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCTGTAC AAACAACCCT TGATATTCAG GCCATTCGCG AGCAATTCCC GCTCTTGGAT CAATCGATTA ACGGCCATCG CCTAGCCTAT TTGGATAGCA CCGCAACTGC CCAAAAGCCG CTAGCAGTGC TTGATGCGAT GGATCGCTAT TATCGCACGA TTAATGCGAA TGTTCATCGA GGTGTGTATC AGATTAGCGA AGCCGCCACC GAAGCCTATG AAGGCACGCG CCGCACGATT GGCCGCTTTA TCGGCGCAAA ATCGACCAAA GAAATTATTT TTACTCGCAA CGCGACCGAA GCGATTAACT TGGTTGCCCA AAGCTGGGGC CGTGCTAATT TGCAAGCGGG CGATCGAATT TTGCTCACAG TCAGCGAGCA TCATTCAAAT TTAGTGCCAT GGCAATTGCT AGCAGCCCAA ACTGGTGTAG AGCTTGATTT TATCGAGCTT GATGATCAAG GCCGACTTGA TCTCAGCCAC CTTGATCAAC TATTGACTGA ACGCACCAAA TTGGTCGCCA TGACCCACAT GTCGAATGTG TTGGGCACGA TCAATCCAGT TGAACGGGTG ATTGCGGCGG CCAAACAGGT TGGAGCCTTG GTGCTGCTGG ATGGGGCGCA AAGTGTGCCA CATATTCCCG TCAATGTTCA AGCACTTGGC TGCGATTTCT TGGCCTTTTC GGGGCATAAA ATGTGCGGTC CAACTGGGAT TGGGGTGCTG TGGGCGCGGC GCGAATTGCT TGAAGCCATG CCGCCGTTTA TGGGTGGCGG CGATATGATC AAACGGGTCG GGCTACGCGA AAGCTCATGG AACGATCTCC CATGGAAATT CGAGGCAGGC ACGCCAGCGA TTGCCGAGGC GATTGGCCTT GGCGCGGCGA TTGACTTCTT GAATGAACTT GGGATGCAGG CGATTCACGA GCGCGAACGC CAATTGACCC ACTACGCTTG GGATAAACTC AGCGCCATCG ATGGGTTGAC CATTTTTGGT CCACCTGCTG CCGAGCGCGG TGGCTTGTTG AGCTTTACCC TTGCAGGTGT GCATGCCCAC GATGTGGCAG CGATTCTCGA TACCCAAGGG ATTGCAGTGC GGGCTGGGCA TCATTGCACC CATCCATTGC ACGATATTTT TGGCGTGCCA GCAACGGTAC GCGCATCATT CTACCTATAC ACGCTTGAGG AAGAAATTGA TCGTTTGGCC GAAGCCTTGG TTTTGGCTCG CGATACCTTC CAACTGTGA
|
Protein sequence | MSVQTTLDIQ AIREQFPLLD QSINGHRLAY LDSTATAQKP LAVLDAMDRY YRTINANVHR GVYQISEAAT EAYEGTRRTI GRFIGAKSTK EIIFTRNATE AINLVAQSWG RANLQAGDRI LLTVSEHHSN LVPWQLLAAQ TGVELDFIEL DDQGRLDLSH LDQLLTERTK LVAMTHMSNV LGTINPVERV IAAAKQVGAL VLLDGAQSVP HIPVNVQALG CDFLAFSGHK MCGPTGIGVL WARRELLEAM PPFMGGGDMI KRVGLRESSW NDLPWKFEAG TPAIAEAIGL GAAIDFLNEL GMQAIHERER QLTHYAWDKL SAIDGLTIFG PPAAERGGLL SFTLAGVHAH DVAAILDTQG IAVRAGHHCT HPLHDIFGVP ATVRASFYLY TLEEEIDRLA EALVLARDTF QL
|
| |