Gene Francci3_1208 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_1208 
Symbol 
ID3903562 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp1441752 
End bp1443089 
Gene Length1338 bp 
Protein Length445 aa 
Translation table11 
GC content72% 
IMG OID637878541 
ProductFolC bifunctional protein 
Protein accessionYP_480315 
Protein GI86739915 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0285] Folylpolyglutamate synthase 
TIGRFAM ID[TIGR01499] folylpolyglutamate synthase/dihydrofolate synthase 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.482446 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAAGTGGA ACCAGCCTGT GAAGTGGACG TACGAGTCGG CGTGGGCCGC CCTGAACAAC 
ACGGTGGATC TCGAAAAGCA GACGATGCCG GCGGGTCGGC CGGTGCCGAG CCTTGACCGG
ATGCGGGAGC TGGCGGGCCG GCTCGGGGAT CCGCACCGGG CGGTTCCGCT GATCCACCTG
ACCGGCACGA ACGGCAAGAC CTCGACGGCG CGGATCATCT CGGCGCTGCT GGGCGCGGCG
GGGCTGCGGG TGGGGCTGTA CACCAGTCCG CATCTGGAGC GGGTGAACGA GCGTCTGGTC
GTCGACGGCC GACCGATCGG CGACGAGGAG TTCGGACGCT GCGTGGGCAC CGTCCTGGAC
GCGGCCGCCC CGATGAGTGG ACGGCCCACC TTCTTCGAGC TGCTGACGGC CACCGCCTTC
CGATGGTTCG CCGACCTGGC CGTCGACGTC GCCGTGGTCG AAGTGGGCCT GCTCGGTCGC
TGGGACGCCA CGAACATCGC CGACGGGCGG GTCGCGGTCG TCACCAGCAT CGGCGCTGAT
CATCTGGACT ACGCCGGGAG TATGGCGGGC GTGGCCCGCG AGAAGGCCGG GATCGTCAAG
CCCGGCAGTC ACCTGGTGCT CGGCGAGGTG GACCCCCGCT TCGACGACAT CTTCGCCCGG
ACCCCGGCCA CGGACGTCCT GCGGCTGGGT CGGGACTTCG CCGCGGTCGC CGGCCACCCG
GACGCGGCGG GTCGACGGGG CGGTTTCCGC ACCCCGCAGG CGCGCTACGA CGACGTGCGA
CTGTCCCTGC ACGGTTCCTA TCAGGATGCC AACGCGGCGT GCGCCCTGGC CGCCGTGGAG
ACCTTCGTCG GTCATTCCCT CCCCGACGTC GTGGTCCGGA CCGGTCTCGG GGGCGTCCGG
GCGCCGGGAC GCCTGGAAAT CGTGCGCGCA CAGCCGCTGT GCGTGCTTGA CGGGGCGCAC
AACCCGGCAG CGGGCGCGGC GCTCGCCCGG TCCCTGCGTG AGGAGTTCCC CGGGCGAGAG
TGGACCGTCG TCTACGGGGC GCTGCGCGGT CACGACTACG AGGGGACTCT CGCGGAACTG
CGCCACCAAC CGATCAGCTC GCTGATCGCC TGCGAACCGG CGTCACCGCG GGCCATCCGC
GCCGAGCACC TGGTCTGCGC GGCGCGGGCG CGGGCGATGC CGGCCCACGC CGCATCCGAC
GTCGGCTGCG CCGTCCGCCA CGCCCTGCGG GACGCGGCGG GGCCGGGTGG GGCGATCCTG
GTGACCGGGT CCTTCTACCA TCTGGCTGAG GCCCGCCGGA CCCTGACCGC GCTGGACCCG
ACGCACGGAA TGCACTGA
 
Protein sequence
MKWNQPVKWT YESAWAALNN TVDLEKQTMP AGRPVPSLDR MRELAGRLGD PHRAVPLIHL 
TGTNGKTSTA RIISALLGAA GLRVGLYTSP HLERVNERLV VDGRPIGDEE FGRCVGTVLD
AAAPMSGRPT FFELLTATAF RWFADLAVDV AVVEVGLLGR WDATNIADGR VAVVTSIGAD
HLDYAGSMAG VAREKAGIVK PGSHLVLGEV DPRFDDIFAR TPATDVLRLG RDFAAVAGHP
DAAGRRGGFR TPQARYDDVR LSLHGSYQDA NAACALAAVE TFVGHSLPDV VVRTGLGGVR
APGRLEIVRA QPLCVLDGAH NPAAGAALAR SLREEFPGRE WTVVYGALRG HDYEGTLAEL
RHQPISSLIA CEPASPRAIR AEHLVCAARA RAMPAHAASD VGCAVRHALR DAAGPGGAIL
VTGSFYHLAE ARRTLTALDP THGMH