Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | HS_0967 |
Symbol | csd |
ID | 4240460 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haemophilus somnus 129PT |
Kingdom | Bacteria |
Replicon accession | NC_008309 |
Strand | + |
Start bp | 1068065 |
End bp | 1069264 |
Gene Length | 1200 bp |
Protein Length | 399 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 638104523 |
Product | cysteine desulfurase, aminotransferase, class V |
Protein accession | YP_719178 |
Protein GI | 113461110 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0520] Selenocysteine lyase |
TIGRFAM ID | [TIGR01979] cysteine desulfurases, SufS subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATTTTTG ATCCAACATT ATTTCGCCAG CAATTTCCTT TTTTTCACCA AAAAAATGGT ATCACTTATC TAGACAGTGC GGCGACCACC TTAAAACCGC AAGTTCTCCT TGACGCAACG CAGCATTTTT ATGCTTCTGC CGGCTCCGTA CATCGAAGCC AATATGATGC AAAACAGACC GCACTTTTTG AACAAGCGAG AAAGCAGGTG CAAAAACTGA TCAACGCTGA AAGTGAAGAA ACCGTTATTT GGACATCCGG CACCACACAA AGCATCAACA TTGTCGCCTA CGGCTTAATG GCACAACTTT CCCCAAACGA TGAAATTATC ATTAGCGAAG CGGAACATCA TGCAAATTTT GTTACTTGGC AACAAATTGC TAAAAAGTGC GGTGCAACAT TAATTATTTT ACCGTTGCAA GATAATTTAC TCATTGATCA ACAAATTCTA CAAAAATCCT TAAACAAAAA AACGAAATTA GTCGCCTTAA ATGTGATCTC CAATGTAACA GGAACACAAC AACCACTGAC ACAGCTTATC CCCATCATAC GAGAAAAAAG CAGTGCCTTG ATTTTATTAG ATTGTGCACA AGCCATTAAC CATCAAACCA TTGATCTTCA AAGGCTTGAT GCAGATTTTA TCGTTTTTTC CGCACATAAA ATTTACGGAC CGACAGGGTT GGGTGTATTA AGCGGAAAAC GCTCTGCTTT AGAACGATTA CAACCCAGTT TTTTCGGCGG AAAAATGGTT GAACAGGTTT CAAAACAAGA AACTGTTTTT GCCTCTTTAC CTTATCGCCT CGAAAGCGGC ACGCCTAATA TTGCTGGCGT AATTGGTTTT GGTGCGGTAT TAAATTGGCT GGAACAATGG AATATCACAC AAGGGGAACA ATTTGCCGTG CAACTGGCGG AAAAAACAAA AGAGCGGTTA AAAAATTATC CTCTTTGTCG CCTATTTAAT TCGCCACACC CAAGCACATT TGTTTGCTTT ACATTCACCA CAATCGCCAC ATCGGATATT GCGACCCTAC TTGCAGAACA ACATATTGCC TTGCGTAGTG GCGAACATTG TGCAACCCCC TATTTACAGC GTCTAGGACA AAAAAGCACC TTACGCCTCT CTTTCGCCCC TTACAATAAC CAAGAAGATA TTGAACTGTT TTTTAGTGCA TTGGATAAAA GTTTGGAGTT ATTGGCATGA
|
Protein sequence | MIFDPTLFRQ QFPFFHQKNG ITYLDSAATT LKPQVLLDAT QHFYASAGSV HRSQYDAKQT ALFEQARKQV QKLINAESEE TVIWTSGTTQ SINIVAYGLM AQLSPNDEII ISEAEHHANF VTWQQIAKKC GATLIILPLQ DNLLIDQQIL QKSLNKKTKL VALNVISNVT GTQQPLTQLI PIIREKSSAL ILLDCAQAIN HQTIDLQRLD ADFIVFSAHK IYGPTGLGVL SGKRSALERL QPSFFGGKMV EQVSKQETVF ASLPYRLESG TPNIAGVIGF GAVLNWLEQW NITQGEQFAV QLAEKTKERL KNYPLCRLFN SPHPSTFVCF TFTTIATSDI ATLLAEQHIA LRSGEHCATP YLQRLGQKST LRLSFAPYNN QEDIELFFSA LDKSLELLA
|
| |