Gene Francci3_1389 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_1389 
Symbol 
ID3903370 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp1670567 
End bp1671463 
Gene Length897 bp 
Protein Length298 aa 
Translation table11 
GC content70% 
IMG OID637878726 
Product5,10-methylenetetrahydrofolate reductase 
Protein accessionYP_480495 
Protein GI86740095 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0685] 5,10-methylenetetrahydrofolate reductase 
TIGRFAM ID[TIGR00676] 5,10-methylenetetrahydrofolate reductase, prokaryotic form 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.695264 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000418344 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCGGCGG TCCGGAGTCT GAGCACGCAA CCCACGACGA TGGGCGAGTA CCTCGCCACC 
GGTCGTGCGT CCTTTTCCTT CGAGTTCTTT CCGCCGAAGA CGCCGGACGG CGAGCGGCAG
TTGTGGACGG CCCTGCGCCA GATCGAGGCG CTGCATCCCT CGTTCGTCTC GGTGACGTAC
GGGGCGGGCG GGTCGACGCG CGAGGGCACG ATCCGGGTGA CCGAGCGGAT CGCCACCGAC
ACCACCCTCA CGCCGATCGC CCATCTCACC GCGGTCAACC ACTCCGTCGC CGAGCTGCGC
CGGGTCATCG GCAACTACGC CGGCGTCGGG GTGCGCAACG TCCTGGCCCT GCGCGGCGAC
CCGCCAGGGG ATCCGCAGGG CACCTGGACC GCCCATCCCG CGGGCCTGCG GCACGCGGAC
GAACTCGTGC GCCTGGTGAG GTCGGTCGGC GGCTTCTGCG TCGGCGTGGC AGCCTTCCCC
GACAAGCACC CACGATCGGT CGACTTCGAC AGCGACGCGC GCTACCTGGT GGGCAAGTTC
GACGCCGGCG CGGACTACGC CATCACCCAG TTCTTCTTCG GCGCTGACGA CTATTTCCGT
CTCGTCGACC GGGTCCGCCG GCTGGGCTGC GACAAGCCGA TCATTCCTGG GATCATGCCG
GTGACGAACG TCGCGCAGAT CGTCCGGATG GCGCAGCTGT CCGGCGCGGC CTTCCCGCCG
GCGCTGGCCT CCCGGTTGCA GGCCGTCGCC GACGACCCGG CGGCCGTGCG GGCGATCGGG
GTCGAGGTGG CGACCGAGTT GTCCCGTCGC CTGCTGGACG GCGGTGCACC GGGCCTGCAC
TTCATCACCC TCAACCGCTC CAGCGCCACC CGAGAGATCT ACCAGGCGGT CCGATAG
 
Protein sequence
MAAVRSLSTQ PTTMGEYLAT GRASFSFEFF PPKTPDGERQ LWTALRQIEA LHPSFVSVTY 
GAGGSTREGT IRVTERIATD TTLTPIAHLT AVNHSVAELR RVIGNYAGVG VRNVLALRGD
PPGDPQGTWT AHPAGLRHAD ELVRLVRSVG GFCVGVAAFP DKHPRSVDFD SDARYLVGKF
DAGADYAITQ FFFGADDYFR LVDRVRRLGC DKPIIPGIMP VTNVAQIVRM AQLSGAAFPP
ALASRLQAVA DDPAAVRAIG VEVATELSRR LLDGGAPGLH FITLNRSSAT REIYQAVR