Gene Francci3_0533 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_0533 
Symbol 
ID3905444 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp619171 
End bp620358 
Gene Length1188 bp 
Protein Length395 aa 
Translation table11 
GC content68% 
IMG OID637877862 
Producthypothetical protein 
Protein accessionYP_479646 
Protein GI86739246 
COG category[H] Coenzyme transport and metabolism
[R] General function prediction only 
COG ID[COG1060] Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes 
TIGRFAM ID[TIGR00423] radical SAM domain protein, CofH subfamily 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.271829 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.131592 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATGTCG ATGGTGAGAT TCCGGTGGGT CGGGGTGGTG GCGAGATCCG GTCTGTGCTG 
GACCGGGCCG CAGCCGGCGG GCGTATCTCC GCGGAGGAGG CGCTGCTCCT CTATACGAGG
GCGCCGCTGC ACGCGCTCGG CGGCGCGGCG GACACGGTCC GTCGGCGCCG TTTCCCCGAT
GGCATCGCGA CGTACATCAT CGACCGGAAC ATCAACTACA CGAATGTCTG CGTGACCGCC
TGCCGGTTTT GCGCCTTCTA CCGCGCTCCG AAGCATGCCG AGGGCTGGGT CCGCGACGTC
GAGGACATCG TCGCCAAGTG CGGGGAGGCG GTCGAGCTCG GCGCCACGCA GATCATGCTG
CAGGGCGGGC ACCATCCCGA CTTCGGCATC GAGTGGTATG AGCGTACCTT TGCCGCCATC
AAGAAGGCAT ATCCCCAGTT GGCGCTGCAC TCGCTGGGCG CCAGCGAGGT TGTGCACATC
GCCCGGACGT CCGATCTGAC TTTTCCCGAG GTCATTACCC GGCTGCGGGA CGCGGGCCTG
GACAGCTTCG CGGGCGCGGG AGCGGAGATT CTCACCGAAC GGCCCCGGCA GGCGATCGCT
CCGCTGAAGG AGCCCGGTCA CGTCTGGCTG TCCGTGATGG AGACCGCCCA CAACCTCGGC
CTGGAATCCA CCGCCACCTT CATGATGGGC ACGGGGGAGA CGAACGCCGA GCGCATCGAA
CACCTGACGA TGATCCGGGA CGTCCAGGAC CGGACCGGCG GGTTCCGTTC GTTCATCCCC
TGGACCTACC AGCCGGAGAA CAATCATCTC GGCGGGCGCA CCCAGGCGAC GACCCTGGAG
TACCTCCGCC TCGTCGCGGT CGCGCGGCTG TTCTTCGACA ACATCACGCA CCTGCAGGGC
TCCTGGCTGA CCACCGGCAA GGAGATCGGC CAGCTCACCC TGCACATGGG CGCCGACGAC
CTCGGCTCGG TGATGCTGGA GGAGAACGTC GTCTCCTCCG CCGGGGCGCG CCACCGCACC
AACCGGTCGG AGCTGATCTC CCTGATCCGT GCTGCCGGCC GCATCCCCGC TCAGCGCGAC
ACCCGCTACC AGCACCTCGT CGTGCACCGC GACCCGGCGC AGGACCCGGT TGACGACCGG
GTGGCCTCGC ACTTCTCCTC CACCGCGCTA CCGCTCATCT CCGCGTAG
 
Protein sequence
MDVDGEIPVG RGGGEIRSVL DRAAAGGRIS AEEALLLYTR APLHALGGAA DTVRRRRFPD 
GIATYIIDRN INYTNVCVTA CRFCAFYRAP KHAEGWVRDV EDIVAKCGEA VELGATQIML
QGGHHPDFGI EWYERTFAAI KKAYPQLALH SLGASEVVHI ARTSDLTFPE VITRLRDAGL
DSFAGAGAEI LTERPRQAIA PLKEPGHVWL SVMETAHNLG LESTATFMMG TGETNAERIE
HLTMIRDVQD RTGGFRSFIP WTYQPENNHL GGRTQATTLE YLRLVAVARL FFDNITHLQG
SWLTTGKEIG QLTLHMGADD LGSVMLEENV VSSAGARHRT NRSELISLIR AAGRIPAQRD
TRYQHLVVHR DPAQDPVDDR VASHFSSTAL PLISA