Gene Franean1_5271 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_5271 
Symbol 
ID5673605 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp6339986 
End bp6341392 
Gene Length1407 bp 
Protein Length468 aa 
Translation table11 
GC content74% 
IMG OID641244126 
ProductFolC bifunctional protein 
Protein accessionYP_001509535 
Protein GI158317027 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0285] Folylpolyglutamate synthase 
TIGRFAM ID[TIGR01499] folylpolyglutamate synthase/dihydrofolate synthase 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.73776 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0925623 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCCGCGC CGGCCGCGCC CTCGGAACGG GCGTCCCGAT GCCATAACCT CGGCTCCGTG 
ACAGACGCAC GCACGGGGCG CACGCCGGAC GACGCGGCCG AGCTCGCCCG GGTGGAGAAG
GCACTGGCCG GGCGGTTCCC GCACCGGATG CACCCGGACC TGGACCGGAT GCGGGATCTC
GTCGACCTGC TGGGGCACCC GGAGCGGTCC TTCCCGTCCA TCCACATCAC CGGGACGAAC
GGCAAGACGT CCACGGCTCG GATGATCGAC GCGCTGCTTC GGGGCTTCGG GCTGCGGCCG
GGCCGGTACA CCTCCCCGCA CATGGAAAGC GTCACGGAGC GGATCAGCAT CAACGGCGTG
CCCGCCGACG CTGAGGTGTT CGCCAGGGCC TACGACGACG TGATCCCCTA CGCGGAGCTC
GTCGACAGCC GCCATCCCGA GCGGGTCACG TTCTTCGAGC TGCTCACCGC GATGGCGTTC
TCCGCCTTCT CCGACTCGCC CGTGGACGCC GGGGTCGTCG AGGTCGGCAT GGGCGGCACC
TGGGACGCGA CGAACGTGGT CGACGCGGGC GTCGCGGTGA TCACGCCGGT CTCCCTCGAC
CATCCGGAGC TGGGTGACAC CGTGGAGGCG GTGGCGGCGG AGAAGGCGGG GATCATCCGG
CCGGACGCCC TGGTGGTGCT GGCGCAGCAG TCGCTGCCCG CCGCCGAGGT CCTGCTGCGG
CGCAGCGCCG AGGTCGGCGC CACCGTCGCG CGGGAGGGCC TGGAGTTCGG CGTCGTCGGC
CGGCGGGTGG CGGTGGGGGG CCAGGTGCTG ACCCTGCGCG GTCTGGGCGG GACGTACGAG
GAGGTCTTCC TGCCGCTGCA CGGCATCCAC CAGGCGCACA ACGCGGCCGT CGCGCTGGCG
GCCGTCGAGG CGTTCCTCGG CGGCGGGCAG GGTCAGCTGG ACGCCGACGC CGTCCTCGCC
GGGTTCGCCC AGGCCGACTC GCCTGGCCGG CTGGAGGTCG TGCGGCGCTC GCCGACCATC
GTCCTGGACG GCGCGCACAA CGTCGCCGGC GCGCAGGCCC TCGCGGCCGC GGTCGACGAG
GCCTTCGGCT TCGACAATCT CGTCGGGGTC GTCGGTGTGC TCGGAGACAA GGACGCCGAG
GGCATCCTCG CCGCCCTGGA ACCGGTGCTC AGCAGCGTCG TCGTGACCCA GAGCACGTCA
CCGCGGGCGA TGGGCGCGGA CGACCTGGCC GCGATCGCGG TGGACGTCTT CGATGCCGAC
CGGGTCGAGG TCGCGCCACG GCTGGACGAC GCCCTCGAGG CCGCGGTGCG GCTCGCGGAG
GAGGAGGGCG AGCTGGGCGG CACGGGTGTG CTGGTCACCG GCTCGCTGGT GACCGTGGGT
GAGGCGCGGC ACCTGCTTCG TCGCTGA
 
Protein sequence
MPAPAAPSER ASRCHNLGSV TDARTGRTPD DAAELARVEK ALAGRFPHRM HPDLDRMRDL 
VDLLGHPERS FPSIHITGTN GKTSTARMID ALLRGFGLRP GRYTSPHMES VTERISINGV
PADAEVFARA YDDVIPYAEL VDSRHPERVT FFELLTAMAF SAFSDSPVDA GVVEVGMGGT
WDATNVVDAG VAVITPVSLD HPELGDTVEA VAAEKAGIIR PDALVVLAQQ SLPAAEVLLR
RSAEVGATVA REGLEFGVVG RRVAVGGQVL TLRGLGGTYE EVFLPLHGIH QAHNAAVALA
AVEAFLGGGQ GQLDADAVLA GFAQADSPGR LEVVRRSPTI VLDGAHNVAG AQALAAAVDE
AFGFDNLVGV VGVLGDKDAE GILAALEPVL SSVVVTQSTS PRAMGADDLA AIAVDVFDAD
RVEVAPRLDD ALEAAVRLAE EEGELGGTGV LVTGSLVTVG EARHLLRR