Gene Francci3_0519 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_0519 
Symbol 
ID3905177 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp605148 
End bp606296 
Gene Length1149 bp 
Protein Length382 aa 
Translation table11 
GC content66% 
IMG OID637877848 
Producthypothetical protein 
Protein accessionYP_479632 
Protein GI86739232 
COG category[H] Coenzyme transport and metabolism
[R] General function prediction only 
COG ID[COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes 
TIGRFAM ID[TIGR00423] radical SAM domain protein, CofH subfamily 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.520088 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATGCTG GGCTCAAGCG GGAGATCGAG GCCAAAGTTT CCGCCGGCGA GAGACTGAGC 
CGTGCCGACG GGGAGGCGCT GTACGCCAGC GACGACCTCG TCTGGTTGGG CGGTCTCGCC
CACGAGGTTC GCACCAGAAA GAACGGCGAC AAGACCTTCT TCAACGTCAA CCGGCATCTG
AACCTGACGA ACGTCTGCTC GGCGTCCTGC GCGTACTGCT CGTTCCAGCG CAAGCCCGGT
GAGTCCGATG CCTACACCAT GCGCATCGAG GAGGCTGTCC GGCTGGCGAA GGAGATGGAG
CCGGCCGGGA TCACCGAGCT GCATATCGTC AACGGTCTGC ATCCGACGCT GCCCTGGCGT
TACTATCCGC GCTCGCTGCG GGAGCTCGGC AAGGCGCTGC CCGGCGTCGC CTTGAAGGCG
TTCACCGCTA CCGAGATCCA CTGGTTTGAG AAGATCAGTG GCCTGTCCGC CGATGAGATC
CTCGACGAGC TCATCGACGC GGGACTCGAG TCGCTCACCG GCGGCGGCGC GGAGATCTTC
GACTGGGAGG TCCGCCAGAA GATCGTCGGT CACGAGACGC ACTGGGAGGA CTGGTCGCGC
ATCCACCGGC TCGCCCACGC CAAGGGCCTG CGCACTCCGT GCACGATGCT GTACGGGCAC
GTCGAGGACC CCCGGCACCG GGTGGACCAC GTGCTGCGGC TGCGTGAGCT GCAGGATTCC
ACGGGCGGGT TCACGGTCTT CATCCCGCTG CGCTTCCAGC ACGACGCCGC CGGCGACCCG
CGCAACCGGT TGATGAACCA GCCGATGGCG ACGGGGGCCG AGGCGTTGAA GACGTTCGCC
GTCTCCCGGC TGCTGTTCGA CAACGTGGAC CACATCAAGT GCTTCTGGGT GATGCATGGT
CTCACCACGG CGCAGCTCGC GTTGAACTTC GGCGCCGACG ACCTCGACGG TTCCGTTGTC
GAGTACAAGA TCACGCACGA TGCGGACCGG TTCGGGACGC CGCACACCAT GACCCGCGAG
GATCTGCTCG CAATCATCCG CGACGCCGGC TTCCGCCCGG TCGAGCGGGA CACCCGCTAC
CGGGAGATCC GTGTCTACGA CGGTCCCGAC CCGGCTCGGC GTGACGTCCC GACCTCGATC
GACGCCTGA
 
Protein sequence
MDAGLKREIE AKVSAGERLS RADGEALYAS DDLVWLGGLA HEVRTRKNGD KTFFNVNRHL 
NLTNVCSASC AYCSFQRKPG ESDAYTMRIE EAVRLAKEME PAGITELHIV NGLHPTLPWR
YYPRSLRELG KALPGVALKA FTATEIHWFE KISGLSADEI LDELIDAGLE SLTGGGAEIF
DWEVRQKIVG HETHWEDWSR IHRLAHAKGL RTPCTMLYGH VEDPRHRVDH VLRLRELQDS
TGGFTVFIPL RFQHDAAGDP RNRLMNQPMA TGAEALKTFA VSRLLFDNVD HIKCFWVMHG
LTTAQLALNF GADDLDGSVV EYKITHDADR FGTPHTMTRE DLLAIIRDAG FRPVERDTRY
REIRVYDGPD PARRDVPTSI DA