Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_1664 |
Symbol | |
ID | 3903051 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | + |
Start bp | 1998298 |
End bp | 1999626 |
Gene Length | 1329 bp |
Protein Length | 442 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 637879002 |
Product | SufS subfamily cysteine desulfurase |
Protein accession | YP_480769 |
Protein GI | 86740369 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0520] Selenocysteine lyase |
TIGRFAM ID | [TIGR01979] cysteine desulfurases, SufS subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGCTGA CCGACGCGGC CGTGCGCCCG GGGTCCGGAC CCCGGCTCGT CGGCGGGTTC GACGTCGAGC AGGTCCGCCG GGACTTCCCC GTCCTGGCCC GCACGGTGCA CGGCGACCGG CCGCTGGTCT ACCTCGACAG CGCCGCCACC TCGCAGAAGC CGCTGGCGGT GCTCGACGCC GAGCGGACCT ACTACGAGCT GCACAACGCC AACGTGCACC GCGGCATCCA CGTGCTGGCC GAGGAGGCCA CCGGGCTGTA CGAGGCCTCC CGGGACAAGG TCGCCGCCTT CATCGGGGCG GGCGACCGCC GGGAGGTGGT GTTCACCAAG AACTCCTCGG AGGCGCTGAA CCTCGTCGCC TACGCGATGA GCAACGCCGG GGCCGCCGGC GCCGAGGCGG AGCGGTTCCG GGTCGGCCCG GGTGACGAGA TCGTCGTCAC CGAGATGGAG CATCACTCCA ATCTCGTGCC GTGGCAGATG CTGGCCCGGC GCACCGGGGC GACGCTGCGC TGGATCGGGC TGACCGACGA CGGCCGGCTC GACCTGAGCA ACCTCGATTC GATCATCACC GAGCGGGCGA AGGTCGTCGC CTTCGTCCAC CAGTCGAACA TTCTCGGCAC GATCAACCCG GTCGCGCAGG TGGTGGCGCG GGCCCGCGAG GTCGGCGCGC TGACCGTGCT CGACGGCTCC CAGTCGGTAC CGCACAACCC GGTCGACGTC ACCGAGCTCG GGGTGGACTT CCTGGCCTTC ACCGGGCACA AGATGTGCGC GCCCACCGGG GTCGGGGTGC TCTGGGGACG CTACGAGCTG CTTGGGGTCA TGCCGCCCTT CCTCGGCGGC GGCGAGATGA TCGAGCTTGT CACCATGGAG GGCTCGACCT ACGCGGCGCC GCCGCACCGG TTCGAGGCCG GCACCCCGAT GATTGCCCAG GTGGTCGGGC TCGGCGCGGC CGTCGACTAC CTGACCGCGC TCGGCATGCC GGCCGTCGCC GAGCACGAGC ACGCGGTGAC GGCCTACGCG CTGGACGCCT TCGCCGAGGT GCCGGGGTTG CGGATCATCG GTCCGCCGGC GGCGGACGCC CGCGGCGGGG CGATCTCCTT TGTGCTGCAC GAGGACGACG GCCGGCCGAT CCATCCCCAC GACGTCGGAC AGATCCTCGA CGAGCGCGGC ATTGCCGTGC GGGTCGGGCA CCACTGCGCC CGGCCGGTCT GCCTGCGGTT CGGGGTACCT GCGACGACCC GCGCGTCGTT CCACCTCTAC ACGACGACCG GCGAGGTCGA CGCCCTGGTG GAAGGACTGC ACGGGGTCCG GAGGTTCTTT CTGAGGTGA
|
Protein sequence | MTLTDAAVRP GSGPRLVGGF DVEQVRRDFP VLARTVHGDR PLVYLDSAAT SQKPLAVLDA ERTYYELHNA NVHRGIHVLA EEATGLYEAS RDKVAAFIGA GDRREVVFTK NSSEALNLVA YAMSNAGAAG AEAERFRVGP GDEIVVTEME HHSNLVPWQM LARRTGATLR WIGLTDDGRL DLSNLDSIIT ERAKVVAFVH QSNILGTINP VAQVVARARE VGALTVLDGS QSVPHNPVDV TELGVDFLAF TGHKMCAPTG VGVLWGRYEL LGVMPPFLGG GEMIELVTME GSTYAAPPHR FEAGTPMIAQ VVGLGAAVDY LTALGMPAVA EHEHAVTAYA LDAFAEVPGL RIIGPPAADA RGGAISFVLH EDDGRPIHPH DVGQILDERG IAVRVGHHCA RPVCLRFGVP ATTRASFHLY TTTGEVDALV EGLHGVRRFF LR
|
| |