Gene Franean1_4922 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4922 
Symbol 
ID5673262 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp5908905 
End bp5910461 
Gene Length1557 bp 
Protein Length518 aa 
Translation table11 
GC content77% 
IMG OID641243777 
Productcysteine desulfurase 
Protein accessionYP_001509193 
Protein GI158316685 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0520] Selenocysteine lyase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00686408 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGTCCCTTT CGTATGCCAC CACGTCTCCC GACGTTGATC GGGAAGGGCC CGCCGTCGGC 
GCGCTGCTCG ACGTGGTGGG GGCCGGCATC CCCGTTCCGC TCGCCGACGG ACGCGAGGTC
CCGTACGCCA ATCTCGACCA GGCCGCCAGC GCCCCCTGCC TGCGCGGGGT CGCCGAGCAC
GTCGAGCGCG TCCTGCCGTA CTCGGCGAGC GTGCACCGCG GAACCGGCTA CTCCTCCGCG
GTCTGCACCG CGCTCTACGA GGGGGCCCGC GCCGCCGTGC GCACGTTCGT CGGCGGCCGC
CCGGACGACG TCGTGATCTT CACCCGGAAC ACCACCGACT CGGTGAACCT GCTCGCCCGC
TGCCTCCCGC CGCAGCCGCC CGACCCGCCG CAGCCGCCCG ACCCGGCGCG GACGTCCGAC
CCGGGCCACG CCGAGCCCGG CGGGGTTGTC GTGTTCGACC TGGAGCATCA CGCGAACCTT
CTCCCGTGGC GGTCCCGGCC GGGCTGCCGG TGGGTGCCCG CCGCGCCCAC CCGCGCCGAC
ACGCTGCGCG CGCTCGCCAC CGCCCTGGAC ACGGCACCGA CCTCGCTCGT CGCGGTGACC
GGCGCGTCCA ACGTCACCGG CGAGGTGCCG CCGCTCGCCG AGATCGTCCG GCTGGCGCGC
GCGGCCGGCG CGCGGGTCTT CGTGGACGGC GCCCAGCTCG TCCCGCACCG TCGGGTCGAC
ATGGCCGCGC TCGGCATCGA CTACCTCGCG TTCTCCGGGC ACAAGCTGTA CGCGCCGTTC
GGCGCCGGCG TGCTCGTCGG ACGTCCGGAC TGGCTGGCGG CGGCCCCGCC CTATCTCGCC
GGCGGGGGCG CCGTGCGTGA GGTGACCAAC TCCGCCGTGG CCTGGGCGGA CGGCCCGGCC
CGGCACGAGG CCGGCAGCCC CAACCTCCTC GGCGCGACGG CGATCGCGGC CGCCTGCCGG
CTGCTGGGCG CGCTCGCCCC CCGCGACCTG CACCAGCACG AGGACCTGCT GCGCCGCCGC
CTGGTGGACG GGCTGCGCGC GATCGAGGGA GTCACCATCC ACTCCCTGTG GGCCGACGGA
GACGATCCCG CCGCTGGCGA GGCCGCCGGA GGCGAGGGCC CGGACGGGAT CCTGGCCACG
GGGCCCGTCG GGGTCGTCAC CTTCTCGGTC GCCGGGCGCG ATCCCGGGTT CGTCGCCGCC
GTCCTCTCCG CCGAGCACGG CGTCGGGGTA CGCGCCGGAC GCTTCTGCGC GCATCCGCTG
CTGGGGCGGC TCGGAGCCGA GGGCGGCGCG ATCCGCGCGA GTGTCGGCAT CAGCTCGACG
AGTGCCGACG TCGACAGGCT GCTGGCAGGG CTGGCGGAGC TCGTCGGCCG CGGGCCGCGC
CAGCAGTACC GCGACCTGGG CGACGGAGGC TGGGCTCCGG CATCGGACGG TCGCGCGCTG
CCGCCCTGGG TCGCCGAGCA CATGGCGGCC GGTCACGTCC GCGGTCGTGT GCCCGCGCCG
GGCCACGCGC ACGACAGCGC CGGCCTGCCC GCGTACTCCA GCCCCTGCGG GACCTGA
 
Protein sequence
MSLSYATTSP DVDREGPAVG ALLDVVGAGI PVPLADGREV PYANLDQAAS APCLRGVAEH 
VERVLPYSAS VHRGTGYSSA VCTALYEGAR AAVRTFVGGR PDDVVIFTRN TTDSVNLLAR
CLPPQPPDPP QPPDPARTSD PGHAEPGGVV VFDLEHHANL LPWRSRPGCR WVPAAPTRAD
TLRALATALD TAPTSLVAVT GASNVTGEVP PLAEIVRLAR AAGARVFVDG AQLVPHRRVD
MAALGIDYLA FSGHKLYAPF GAGVLVGRPD WLAAAPPYLA GGGAVREVTN SAVAWADGPA
RHEAGSPNLL GATAIAAACR LLGALAPRDL HQHEDLLRRR LVDGLRAIEG VTIHSLWADG
DDPAAGEAAG GEGPDGILAT GPVGVVTFSV AGRDPGFVAA VLSAEHGVGV RAGRFCAHPL
LGRLGAEGGA IRASVGISST SADVDRLLAG LAELVGRGPR QQYRDLGDGG WAPASDGRAL
PPWVAEHMAA GHVRGRVPAP GHAHDSAGLP AYSSPCGT