Gene Francci3_1664 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_1664 
Symbol 
ID3903051 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp1998298 
End bp1999626 
Gene Length1329 bp 
Protein Length442 aa 
Translation table11 
GC content72% 
IMG OID637879002 
ProductSufS subfamily cysteine desulfurase 
Protein accessionYP_480769 
Protein GI86740369 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0520] Selenocysteine lyase 
TIGRFAM ID[TIGR01979] cysteine desulfurases, SufS subfamily 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGCTGA CCGACGCGGC CGTGCGCCCG GGGTCCGGAC CCCGGCTCGT CGGCGGGTTC 
GACGTCGAGC AGGTCCGCCG GGACTTCCCC GTCCTGGCCC GCACGGTGCA CGGCGACCGG
CCGCTGGTCT ACCTCGACAG CGCCGCCACC TCGCAGAAGC CGCTGGCGGT GCTCGACGCC
GAGCGGACCT ACTACGAGCT GCACAACGCC AACGTGCACC GCGGCATCCA CGTGCTGGCC
GAGGAGGCCA CCGGGCTGTA CGAGGCCTCC CGGGACAAGG TCGCCGCCTT CATCGGGGCG
GGCGACCGCC GGGAGGTGGT GTTCACCAAG AACTCCTCGG AGGCGCTGAA CCTCGTCGCC
TACGCGATGA GCAACGCCGG GGCCGCCGGC GCCGAGGCGG AGCGGTTCCG GGTCGGCCCG
GGTGACGAGA TCGTCGTCAC CGAGATGGAG CATCACTCCA ATCTCGTGCC GTGGCAGATG
CTGGCCCGGC GCACCGGGGC GACGCTGCGC TGGATCGGGC TGACCGACGA CGGCCGGCTC
GACCTGAGCA ACCTCGATTC GATCATCACC GAGCGGGCGA AGGTCGTCGC CTTCGTCCAC
CAGTCGAACA TTCTCGGCAC GATCAACCCG GTCGCGCAGG TGGTGGCGCG GGCCCGCGAG
GTCGGCGCGC TGACCGTGCT CGACGGCTCC CAGTCGGTAC CGCACAACCC GGTCGACGTC
ACCGAGCTCG GGGTGGACTT CCTGGCCTTC ACCGGGCACA AGATGTGCGC GCCCACCGGG
GTCGGGGTGC TCTGGGGACG CTACGAGCTG CTTGGGGTCA TGCCGCCCTT CCTCGGCGGC
GGCGAGATGA TCGAGCTTGT CACCATGGAG GGCTCGACCT ACGCGGCGCC GCCGCACCGG
TTCGAGGCCG GCACCCCGAT GATTGCCCAG GTGGTCGGGC TCGGCGCGGC CGTCGACTAC
CTGACCGCGC TCGGCATGCC GGCCGTCGCC GAGCACGAGC ACGCGGTGAC GGCCTACGCG
CTGGACGCCT TCGCCGAGGT GCCGGGGTTG CGGATCATCG GTCCGCCGGC GGCGGACGCC
CGCGGCGGGG CGATCTCCTT TGTGCTGCAC GAGGACGACG GCCGGCCGAT CCATCCCCAC
GACGTCGGAC AGATCCTCGA CGAGCGCGGC ATTGCCGTGC GGGTCGGGCA CCACTGCGCC
CGGCCGGTCT GCCTGCGGTT CGGGGTACCT GCGACGACCC GCGCGTCGTT CCACCTCTAC
ACGACGACCG GCGAGGTCGA CGCCCTGGTG GAAGGACTGC ACGGGGTCCG GAGGTTCTTT
CTGAGGTGA
 
Protein sequence
MTLTDAAVRP GSGPRLVGGF DVEQVRRDFP VLARTVHGDR PLVYLDSAAT SQKPLAVLDA 
ERTYYELHNA NVHRGIHVLA EEATGLYEAS RDKVAAFIGA GDRREVVFTK NSSEALNLVA
YAMSNAGAAG AEAERFRVGP GDEIVVTEME HHSNLVPWQM LARRTGATLR WIGLTDDGRL
DLSNLDSIIT ERAKVVAFVH QSNILGTINP VAQVVARARE VGALTVLDGS QSVPHNPVDV
TELGVDFLAF TGHKMCAPTG VGVLWGRYEL LGVMPPFLGG GEMIELVTME GSTYAAPPHR
FEAGTPMIAQ VVGLGAAVDY LTALGMPAVA EHEHAVTAYA LDAFAEVPGL RIIGPPAADA
RGGAISFVLH EDDGRPIHPH DVGQILDERG IAVRVGHHCA RPVCLRFGVP ATTRASFHLY
TTTGEVDALV EGLHGVRRFF LR