Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Xcel_1361 |
Symbol | |
ID | 8648882 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Xylanimonas cellulosilytica DSM 15894 |
Kingdom | Bacteria |
Replicon accession | NC_013530 |
Strand | + |
Start bp | 1485587 |
End bp | 1486927 |
Gene Length | 1341 bp |
Protein Length | 446 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | |
Product | cysteine desulfurase, SufS subfamily |
Protein accession | YP_003325950 |
Protein GI | 269956161 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.938144 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTCATGA ACCCGGTGGT CGAGCCTGTC GAAGCCACGT CCGGTATCGG TGCTGTGGTC TCGACAGGCT CGGCCACCGG GACACCGGTC GACTGGAGCG CCGTGCGCTC CGACTTCCCG CTGCTCGGGC GCACGGTGCG CGGCGGTCGA CCTCTCGTCT ACCTCGACTC CGCCGCCACC TCGCAGAAGC CGAACGTGGT CCTCGAGGCC GAGGTCGACT TCTACGAGCA GCGGAACGCC GCCGTGCACC GTGGCGCGCA CTTCCTCGCC GAGGAGGCGA CGTCGGCCTT CGAGGAGGCC CGCACCGCGG TCGCCGCGTT CGTCGGGGCC GACGACGACG AGATCGTCTG GACCTCCGGC GCGACGGCGG CGATCAACCT CGTCGCCTAC GCGTTCTCCA ACGCCACCCT CGGCCGGGGC GGCGCCGCCG CCTCCCGGTT CGCGCTGCGC CCCGGTGACG AGATCGTCGT CACCCGGGCT GAGCACCACG CCAACCTCGT GCCGTGGCAG GAGCTCGCCG CCCGCACCGG CGCCGTGCTG CGCTGGTTCG AGGTGTCCGA CGACGGCCGG CTCTCGCTCG ACGACGGCGT GATCACGGAA CGCACCCGGA TCGTCGCGTT CGCGCACGCC TCCAACGTGA CCGGCGCCGT GGCACCCGTG GCCGCCCTTG TCGCGGCCGC GAAGGCCGTC GGCGCGTACA CCGTGCTCGA CGCCTGCCAG TCCGTACCGC ACCTGCCCGT CGACCTGCAC GCCCTCGATG TCGACTTCGC CGCCTTCTCC GGGCACAAGA TGCTCGGCCC CACCGGCGTC GGCGCCCTGT ACGGACGACG CGAGCTGCTC GCCGACCTGC CACCCGTGAC CACCGGCGGG TCCATGGTCG AGGTCGTCAC CATGGAGTCG ACCACCTACG CACCCCCGCC GCAGCGGTTC GAGGCCGGCA CGCAGATGGT CGCGCAGGCC GTCGGCCTCG GCGTCGCCGC GCAATGGCTC GGCGAGCTCG GGATGCCCGC GGTCGCCGAG CACGAGCGTG CGCTCGCCGC CGAGCTGCTT CGCATTGCCG ACATCCCCGG GGTGCGGGTG ATCGGGCCGC TCGACACGAC CGACCGGCTC GCCGTGGTGT CCTTCGTCGT CGACGGCGTG CACGCCCACG ACGTCGGGCA GGTGCTCGAC GACCGCGGCA TCGCGGTGCG CGTCGGGCAC CACTGCGCGC AGCCGCTGCA CCGCCGGTTC GGGGTGGCCG CCACCGCGCG CGCCTCGGCG TCGGTCTACA CGACCCTCGA CGACGTCGTG GCGTTCCGGG AAGCCCTGGC GGGGGTCCGG GCGTTCTTCG GAGCGGCGTA G
|
Protein sequence | MVMNPVVEPV EATSGIGAVV STGSATGTPV DWSAVRSDFP LLGRTVRGGR PLVYLDSAAT SQKPNVVLEA EVDFYEQRNA AVHRGAHFLA EEATSAFEEA RTAVAAFVGA DDDEIVWTSG ATAAINLVAY AFSNATLGRG GAAASRFALR PGDEIVVTRA EHHANLVPWQ ELAARTGAVL RWFEVSDDGR LSLDDGVITE RTRIVAFAHA SNVTGAVAPV AALVAAAKAV GAYTVLDACQ SVPHLPVDLH ALDVDFAAFS GHKMLGPTGV GALYGRRELL ADLPPVTTGG SMVEVVTMES TTYAPPPQRF EAGTQMVAQA VGLGVAAQWL GELGMPAVAE HERALAAELL RIADIPGVRV IGPLDTTDRL AVVSFVVDGV HAHDVGQVLD DRGIAVRVGH HCAQPLHRRF GVAATARASA SVYTTLDDVV AFREALAGVR AFFGAA
|
| |