Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tgr7_1487 |
Symbol | |
ID | 7317973 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thioalkalivibrio sp. HL-EbGR7 |
Kingdom | Bacteria |
Replicon accession | NC_011901 |
Strand | + |
Start bp | 1591235 |
End bp | 1592455 |
Gene Length | 1221 bp |
Protein Length | 406 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 643616378 |
Product | cysteine desulfurase IscS |
Protein accession | YP_002513558 |
Protein GI | 220934659 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1104] Cysteine sulfinate desulfinase/cysteine desulfurase and related enzymes |
TIGRFAM ID | [TIGR02006] cysteine desulfurase IscS [TIGR03402] cysteine desulfurase NifS |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.000124631 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCCGACA AGACCCCCAT CTACCTGGAC TATTCCGCGA CCACGCCGGT GGACGAGCGC GTGGCCGAGG AGATGGCCAA GTACCTGACC CGCGGCGGCA TCTTCGGCAA CCCGGCGTCG CGTTCCCACG TGTTCGGCTG GGACGCGGAG AAGGCGGTGG AGCGCGCCCG TGAGCAGGTG GCCGAGCTGG TGGGAGCCGA CCCCCGGGAG ATCGTCTGGA CCAGCGGCGC CACCGAGGCC AACAACCTGG CCATCAAGGG CGCGGCGCAC TTCTACCAGA AGAAGGGCCG TCACCTGATC ACCGTCAAGA CCGAGCACAA GGCGGTACTG GACACCTGCC GGCAGCTGGA GCGGGAGGGT TTCGAGGTCA CCTACCTGGA CGTGCAGTCC AACGGCCTGG TGGACCTGGA GGTGCTGAAA GCCGCCCTGC GCGAGGACAC GGTGCTGGTG TCGGTGATGC ACGTGAACAA CGAGATCGGC GTGATCCAGG ACATCGAGGC GATCGGCAAC CTGACCCGGG AGCGTGGCGT ACTGCTGCAC GTGGACGCGG CGCAGTCCAC CGGCAAGGTG GCCATCGACC TGTCGAGGCT GCCGGTGGAC CTGATGAGCT TCTCGGCCCA CAAGACCTAC GGCCCCAAGG GCATCGGCGC GCTGTACGTG CGCAGGAAGC CGCGGGTGCG CATCGAGGCG CAGATGCACG GCGGCGGTCA CGAGCGCGGC ATGCGTTCCG GGACCCTGGC CCCGCACCAG ATCGTGGGCA TGGGCGAGGC GTTTCGCATC GCCCGCGAGG AGATGGGTGC CGAGGTGGAG CGCATCCGCA TGCTGCGCGA CCGCCTGTGG ACGGGCCTCT CGGACATGGA CGAGGTGTAC CTGAACGGCG ACCTGGAGCG GCGCGTGGCG CACAACCTCA ACGTCTCGTT CAACTTCGTG GAGGGCGAGA GCCTGATCAT GGCGCTCAAG GACATCGCGG TGAGTTCGGG TTCGGCCTGC ACCAGCGCGA GCCTGGAACC CAGCTACGTG CTGCGTGCCC TGGGCCGGGA TGACGAGCTG GCGCACAGCT CGATCCGCTT CTCCATGGGC CGCTACACCA CGAGTGAGGA GGTGGACTAC ACCATCGACC TGGTGAAGAA CGCGGTGGCG AAGCTGCGCG AGCTCTCGCC CCTGTGGGAG ATGTACCAGG AAGGGGTGGA CCTGAAGAGT GTGCAGTGGG TGGCGCATTA G
|
Protein sequence | MADKTPIYLD YSATTPVDER VAEEMAKYLT RGGIFGNPAS RSHVFGWDAE KAVERAREQV AELVGADPRE IVWTSGATEA NNLAIKGAAH FYQKKGRHLI TVKTEHKAVL DTCRQLEREG FEVTYLDVQS NGLVDLEVLK AALREDTVLV SVMHVNNEIG VIQDIEAIGN LTRERGVLLH VDAAQSTGKV AIDLSRLPVD LMSFSAHKTY GPKGIGALYV RRKPRVRIEA QMHGGGHERG MRSGTLAPHQ IVGMGEAFRI AREEMGAEVE RIRMLRDRLW TGLSDMDEVY LNGDLERRVA HNLNVSFNFV EGESLIMALK DIAVSSGSAC TSASLEPSYV LRALGRDDEL AHSSIRFSMG RYTTSEEVDY TIDLVKNAVA KLRELSPLWE MYQEGVDLKS VQWVAH
|
| |