Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_2089 |
Symbol | |
ID | 5670490 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 2514482 |
End bp | 2515807 |
Gene Length | 1326 bp |
Protein Length | 441 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641241011 |
Product | SufS subfamily cysteine desulfurase |
Protein accession | YP_001506432 |
Protein GI | 158313924 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0520] Selenocysteine lyase |
TIGRFAM ID | [TIGR01979] cysteine desulfurases, SufS subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.819759 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGTCG TCGAGACACG CGCGGCGGGG GCCGGCGCGC ACGCCGGCGC CGGGTTCGAC GTCGAGCGGG TCCGTCGCGA CTTCCCGATC CTCGCCCGCA CGGTGCACGA CGGCCTGCCG CTGGTCTACC TCGACAGCGC CGCGACGTCG CAGAAGCCGC TGGCCGTCCT CGACGCCGAG CGGACCTACT ACGAGCGGCA CAACGCCAAC GTGCACCGCG GCATCCACGT GCTCGCCGAG GAGGCCACGG CCCTCTACGA GGAGTCCCGG GACAAGATCG CGGCGTTCGT CGGCGCGCCC GACCGGCGCG AGATCGTGTT CACGAAGAAC TCCTCGGAGG CGCTGAACCT CGTCGCCTAC GCGATGAGCA ACGCCGTCAC GGGCGGCCCC GAGGCGGAGC GGTTCCGGCT CGGGCCGGGC GACGAGGTCG TCATCACCGA GATGGAGCAC CACTCCAACC TGGTGCCCTG GCAGATGCTG TGCGCGCGCA CCGGCGCCAC CCTGCGCTGG ATCGGTCTCA CCGAGGACGG CCGGCTCGAC CTGGCGCACC TCGACGAGGT CATCACCGAC CGGGCGAAGA TCGTCTCCTT CGTGCACCAG TCGAACATCC TCGGCACGGT CAACCCGGTC GCGACGATCG TCGCCCGGGC TCGTGAGGTC GGGGCGCTCA CCGTGCTCGA CGGCTCCCAG TCGGTGCCGC ACATGCCGAT CGACGTCGTC GACCTCGGGG TGGACTTCCT CGCCTTCACC GGGCACAAGA TGTGCGGGCC CACCGGCATC GGCGTGCTGT GGGGGCGGCG CGAGCTGCTC GAGGTGATGC CTCCGTTCCT CGGTGGTGGC GAGATGATCG AGGTCGTCAC CATGGAGGCC TCGACCTACG CGGCCCCGCC GCACCGCTTC GAGGCGGGCA CCCCGATGAT CTCCCAGGCG ATCGGCCTGG GTGCCGCGGT GGACTACCTC ACCGGCCTCG GCATGGACGC GGTCGCCGCG CACGAGCACG AGATCACCGC GTACGCGCTC GACGCGCTCG CGGGCGTCCC GGGCCTGCGG GTCATCGGCC CGCCGACCGC CGAGGGCCGC GGCGGGGCGA TCTCGTTCGC CCTGCGTGAC TCCGAGGACC GCCCGCTGCA CCCGCACGAC GTCGGCCAGA TCCTGGACGA GCAGGGCGTC GCGGTGCGGG TCGGCCACCA CTGCGCCCGC CCGGTCTGCC TGCGCTACGG CGTGCCGGCG ACCACCCGGG CCTCGTTCCA CCTGTACACG AACACCGCCG ACGTCGACGC GTTGGTGGAA GGTCTCGGGC AGGTCAGGAG GTTCTTCCTG AAGTGA
|
Protein sequence | MTVVETRAAG AGAHAGAGFD VERVRRDFPI LARTVHDGLP LVYLDSAATS QKPLAVLDAE RTYYERHNAN VHRGIHVLAE EATALYEESR DKIAAFVGAP DRREIVFTKN SSEALNLVAY AMSNAVTGGP EAERFRLGPG DEVVITEMEH HSNLVPWQML CARTGATLRW IGLTEDGRLD LAHLDEVITD RAKIVSFVHQ SNILGTVNPV ATIVARAREV GALTVLDGSQ SVPHMPIDVV DLGVDFLAFT GHKMCGPTGI GVLWGRRELL EVMPPFLGGG EMIEVVTMEA STYAAPPHRF EAGTPMISQA IGLGAAVDYL TGLGMDAVAA HEHEITAYAL DALAGVPGLR VIGPPTAEGR GGAISFALRD SEDRPLHPHD VGQILDEQGV AVRVGHHCAR PVCLRYGVPA TTRASFHLYT NTADVDALVE GLGQVRRFFL K
|
| |