Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hneap_1195 |
Symbol | |
ID | 8534348 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halothiobacillus neapolitanus c2 |
Kingdom | Bacteria |
Replicon accession | NC_013422 |
Strand | + |
Start bp | 1302962 |
End bp | 1304191 |
Gene Length | 1230 bp |
Protein Length | 409 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 646383585 |
Product | cysteine desulfurase, SufS subfamily |
Protein accession | YP_003263078 |
Protein GI | 261855795 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0520] Selenocysteine lyase |
TIGRFAM ID | [TIGR01976] cysteine desulfurase family protein, VC1184 subfamily [TIGR01979] cysteine desulfurases, SufS subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.542201 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCTTCGC TCAAACTCGA CGTTGCCGCC ATTCGAAATC AGTTCCCCGC GCTGGATCAA ACCATTCAGG GCAAGCCGCT GGTGTACCTC GACAATGCCG CCACCACGCA AAAACCCGAA TGCGTTATTG AAGCGGTAAG CGCGTTTTAT CGTCACGATA ACGCCAATAT TCACCGCGGC GTACATACCT TGTCGGCGCG TGCGACCGAT CAGTATGAAG CCGCGCGTGA ACAGGTGCGT CAAAGCCTGA ACGCCAGCGC CACAGAAGAG ATCGTGTTTG TTCGCGGCAC CACCGAAGCC ATTAATCTGG TGGCATCGAG TTTCGGGCAA ACGCTCAAAG CAGGCGATGA AATCATCATT TCTGCTTTGG AGCATCACGC CAACATCGTG CCGTGGCAGT TGTTGAAACA GCGCATTAGC ATCACGCTAA AAATTATACC AGTCACGCCG GAAGGGGAAC TCGATCTTGG CGCACTGCCC GCGCTGATCA CCGAGCGCAG CCGCCTGATT GCCGTCAATC ATGTTTCGAA TGCCTTGGGC ACCATTAATC CGGTAGCTCA AATCGCGGCC ATTGCCAAAG CCCATGGCGT ACCGATTTTG ATTGATGGCG CTCAGGGTCT ACCGCACGGC CCGGTGGATG TTCGGGCCAT TGATTGCGAT TTTTACACCC TTTCCGGGCA TAAAACATTC GCCCCCAGTG GCGTCGGCGT TCTGTATGCG CGTCGCCCCT GGTTGGATGA GTTGCCACCC TATCAAGGGG GCGGCGACAT GATCGAGACG GTCAGTTTCG AGGCCGTGCG CTACGCGCCG CCGCCTGCAA AATTTGAGGC AGGCACCCCC AATATTGAAG GCGCCATCGG CTTGAACGCG GCACTGCAAT GGCATGCCGC ACTCGATTGG CCCGCCATTA AAGCGCATGA AGCTGCCCTG CTTGCACACG CCACCACGGC ACTTGAGGCA ATCGCTGGCT TGCGCATTGT CGGTACGGCA AAAAACAAGG TGCCTGTGCT TTCGTTCATT ATCGAAGGCG CACACCCGCA CGACATCGGC ATGCTGCTTG ATGCCCAAGG GATTGCGGTG CGTACCGGAC AACATTGTGC CATGCCGGTT TTGCAGTTTT TAGGTGCGCC ACAAGGCACC GTGCGCGCTT CATTTGCTTT TTATAACACG CTGGATGAGG TCGATGCGCT GGTTCGCGCT GTGCATCGCG CCCAGCATAT GCTGAGCTGA
|
Protein sequence | MSSLKLDVAA IRNQFPALDQ TIQGKPLVYL DNAATTQKPE CVIEAVSAFY RHDNANIHRG VHTLSARATD QYEAAREQVR QSLNASATEE IVFVRGTTEA INLVASSFGQ TLKAGDEIII SALEHHANIV PWQLLKQRIS ITLKIIPVTP EGELDLGALP ALITERSRLI AVNHVSNALG TINPVAQIAA IAKAHGVPIL IDGAQGLPHG PVDVRAIDCD FYTLSGHKTF APSGVGVLYA RRPWLDELPP YQGGGDMIET VSFEAVRYAP PPAKFEAGTP NIEGAIGLNA ALQWHAALDW PAIKAHEAAL LAHATTALEA IAGLRIVGTA KNKVPVLSFI IEGAHPHDIG MLLDAQGIAV RTGQHCAMPV LQFLGAPQGT VRASFAFYNT LDEVDALVRA VHRAQHMLS
|
| |