Gene Franean1_6111 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_6111 
Symbol 
ID5674432 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp7438201 
End bp7439379 
Gene Length1179 bp 
Protein Length392 aa 
Translation table11 
GC content67% 
IMG OID641244963 
Productradical SAM domain-containing protein 
Protein accessionYP_001510361 
Protein GI158317853 
COG category[H] Coenzyme transport and metabolism
[R] General function prediction only 
COG ID[COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes 
TIGRFAM ID[TIGR00423] radical SAM domain protein, CofH subfamily 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0611343 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.194203 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGTCGGTCC GCGGAGCTAG GCTGGATCGC ATGGATGCCG GGCTCAAGCG CGAGATCGAA 
GCCAAGGTCC ACGACGGTGC CCGGCTCAGC CGGGCCGACG GTGAGGCCCT CTACGCGAGC
GACGACCTCG CCTGGCTGGG CGGTCTCGCG CACGAGGTGC GTACCCGAAA GAACGGCGAC
CAGACCTTCT TCAACGTGAA CCGGCACCTC AACCTGACGA ACGTCTGCTC GGCCTCGTGC
GCCTACTGCT CGTTCCAACG CAAGCCCGGC GAGTCGGACG CCTACACTAT GCGCATCGAG
GAGGCCGTCC GGCTGGCGAA GGATATGGAG CCGGCCGGGA TCACCGAGCT GCACATCGTC
AACGGCCTGC ACCCGACGTT GCCGTGGCGT TACTATCCGC GGTCGCTGCG CGAGCTGGGG
AAGGCGCTGC CCGGCGTCGC GCTCAAGGCG TTCACCGCCA CCGAGATCCA CTGGTTCGAG
AAGATCAGCG GCCTCTCCGC GGACGAGATC CTGGACGAGC TCATCGACGC GGGCCTGGAG
TCGTTGACGG GCGGCGGCGC GGAGATCTTC GACTGGGAGG TGCGGCAGAA GATCGTCGGC
CACGAGACCC ACTGGGAGGA CTGGTCGCGG ATCCACCGTC TCGCGCACTC CAAGGGCCTG
CGCACGCCGT GCACGATGCT GTACGGCCAT GTCGAGGAGC CGCGGCACCG GGTCGACCAC
GTGCTGCGCC TGCGTGAGCT GCAGGACGAG ACGGGCGGTT TCGCGGTGTT CATCCCGCTG
CGCTTCCAGC ACGACTCGGT CGGCGATCCC CGCAACCGCC TGATGAACCA GCCGATGGCG
ACCGGCGCGG AGGCTCTCAA GACGTTCGCG GTGTCGCGGC TGCTGTTCGA CAACGTCGAT
CACGTCAAGT GCTTCTGGGT GATGCACGGG CTGACCACCG CCCAGCTGTC CCTGAACTTC
GGCGTCGACG ACCTCGACGG CTCGGTCGTC GAGTACAAGA TCACTCACGA CGCGGACGGC
TTCGGAACGC CGAACACGAT GACCCGGGAG GATCTTCTAT CCGTGATCCG TGACGCGGGC
TTCCGGCCGG TCGAGCGGGA CACCCGCTAC CGGGTCGTGC GCAGGTACGA CGGTCCGGAC
ACCACCCGGC GGGACAACCC CGTCTCGATC GACGCCTGA
 
Protein sequence
MSVRGARLDR MDAGLKREIE AKVHDGARLS RADGEALYAS DDLAWLGGLA HEVRTRKNGD 
QTFFNVNRHL NLTNVCSASC AYCSFQRKPG ESDAYTMRIE EAVRLAKDME PAGITELHIV
NGLHPTLPWR YYPRSLRELG KALPGVALKA FTATEIHWFE KISGLSADEI LDELIDAGLE
SLTGGGAEIF DWEVRQKIVG HETHWEDWSR IHRLAHSKGL RTPCTMLYGH VEEPRHRVDH
VLRLRELQDE TGGFAVFIPL RFQHDSVGDP RNRLMNQPMA TGAEALKTFA VSRLLFDNVD
HVKCFWVMHG LTTAQLSLNF GVDDLDGSVV EYKITHDADG FGTPNTMTRE DLLSVIRDAG
FRPVERDTRY RVVRRYDGPD TTRRDNPVSI DA