Gene Franean1_2089 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_2089 
Symbol 
ID5670490 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp2514482 
End bp2515807 
Gene Length1326 bp 
Protein Length441 aa 
Translation table11 
GC content72% 
IMG OID641241011 
ProductSufS subfamily cysteine desulfurase 
Protein accessionYP_001506432 
Protein GI158313924 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0520] Selenocysteine lyase 
TIGRFAM ID[TIGR01979] cysteine desulfurases, SufS subfamily 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.819759 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGTCG TCGAGACACG CGCGGCGGGG GCCGGCGCGC ACGCCGGCGC CGGGTTCGAC 
GTCGAGCGGG TCCGTCGCGA CTTCCCGATC CTCGCCCGCA CGGTGCACGA CGGCCTGCCG
CTGGTCTACC TCGACAGCGC CGCGACGTCG CAGAAGCCGC TGGCCGTCCT CGACGCCGAG
CGGACCTACT ACGAGCGGCA CAACGCCAAC GTGCACCGCG GCATCCACGT GCTCGCCGAG
GAGGCCACGG CCCTCTACGA GGAGTCCCGG GACAAGATCG CGGCGTTCGT CGGCGCGCCC
GACCGGCGCG AGATCGTGTT CACGAAGAAC TCCTCGGAGG CGCTGAACCT CGTCGCCTAC
GCGATGAGCA ACGCCGTCAC GGGCGGCCCC GAGGCGGAGC GGTTCCGGCT CGGGCCGGGC
GACGAGGTCG TCATCACCGA GATGGAGCAC CACTCCAACC TGGTGCCCTG GCAGATGCTG
TGCGCGCGCA CCGGCGCCAC CCTGCGCTGG ATCGGTCTCA CCGAGGACGG CCGGCTCGAC
CTGGCGCACC TCGACGAGGT CATCACCGAC CGGGCGAAGA TCGTCTCCTT CGTGCACCAG
TCGAACATCC TCGGCACGGT CAACCCGGTC GCGACGATCG TCGCCCGGGC TCGTGAGGTC
GGGGCGCTCA CCGTGCTCGA CGGCTCCCAG TCGGTGCCGC ACATGCCGAT CGACGTCGTC
GACCTCGGGG TGGACTTCCT CGCCTTCACC GGGCACAAGA TGTGCGGGCC CACCGGCATC
GGCGTGCTGT GGGGGCGGCG CGAGCTGCTC GAGGTGATGC CTCCGTTCCT CGGTGGTGGC
GAGATGATCG AGGTCGTCAC CATGGAGGCC TCGACCTACG CGGCCCCGCC GCACCGCTTC
GAGGCGGGCA CCCCGATGAT CTCCCAGGCG ATCGGCCTGG GTGCCGCGGT GGACTACCTC
ACCGGCCTCG GCATGGACGC GGTCGCCGCG CACGAGCACG AGATCACCGC GTACGCGCTC
GACGCGCTCG CGGGCGTCCC GGGCCTGCGG GTCATCGGCC CGCCGACCGC CGAGGGCCGC
GGCGGGGCGA TCTCGTTCGC CCTGCGTGAC TCCGAGGACC GCCCGCTGCA CCCGCACGAC
GTCGGCCAGA TCCTGGACGA GCAGGGCGTC GCGGTGCGGG TCGGCCACCA CTGCGCCCGC
CCGGTCTGCC TGCGCTACGG CGTGCCGGCG ACCACCCGGG CCTCGTTCCA CCTGTACACG
AACACCGCCG ACGTCGACGC GTTGGTGGAA GGTCTCGGGC AGGTCAGGAG GTTCTTCCTG
AAGTGA
 
Protein sequence
MTVVETRAAG AGAHAGAGFD VERVRRDFPI LARTVHDGLP LVYLDSAATS QKPLAVLDAE 
RTYYERHNAN VHRGIHVLAE EATALYEESR DKIAAFVGAP DRREIVFTKN SSEALNLVAY
AMSNAVTGGP EAERFRLGPG DEVVITEMEH HSNLVPWQML CARTGATLRW IGLTEDGRLD
LAHLDEVITD RAKIVSFVHQ SNILGTVNPV ATIVARAREV GALTVLDGSQ SVPHMPIDVV
DLGVDFLAFT GHKMCGPTGI GVLWGRRELL EVMPPFLGGG EMIEVVTMEA STYAAPPHRF
EAGTPMISQA IGLGAAVDYL TGLGMDAVAA HEHEITAYAL DALAGVPGLR VIGPPTAEGR
GGAISFALRD SEDRPLHPHD VGQILDEQGV AVRVGHHCAR PVCLRYGVPA TTRASFHLYT
NTADVDALVE GLGQVRRFFL K