Gene Francci3_1388 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_1388 
Symbol 
ID3903369 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp1668334 
End bp1670238 
Gene Length1905 bp 
Protein Length634 aa 
Translation table11 
GC content74% 
IMG OID637878725 
Productpolyprenyl synthetase 
Protein accessionYP_480494 
Protein GI86740094 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0142] Geranylgeranyl pyrophosphate synthase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.196896 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00569212 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGCGCCGAC GGGGTCAGCT CCACGTCGGC CGCCCCGGGA TCGGCGGTCT CCACCTCCAC 
CCCGTACTCG CCGTCGGGCA GCATGACGGT GACCATGCCT GTCTCGCTCT GCGCCCGCAC
GCCGTCCGGC GGTTCGCTGA AGTGCAGGTT GACGTCGCCG TCCCGGCTGC GCGCGGCGAC
CGGACCGCGC GTGAGCTCCC GCCCGGCGAT GGTGCCATCG CGGGTGACCA GGGACAGTTC
ACCGCCTACC CCCCACAGCG TGATTTCGCC GGCGAGCACC TCGGCCCGCA CCCGGACCCG
TGGCGGCACG GTCAGCCTGA CCCGCACGTC GCTGGCCCGG CCGTCGATCC GCAGCGTGCT
CCCGGCCCTC GTGAGCACGG GTACGGACCG ACCGGGACGA CGCCGCACGG TCAGCGCGAC
GTGCACGTCC TCGCGGTCCC GCCCGGAGAT CTCGACGCGG CCGGCGGCCA CCAGAACCTC
CACGGCGTCG ATACCGTCGA CAGTGTGAGT CAGCTCCGCG GGTGGCAGCC GCAACAGGGT
CAATCCGCCC AGTGCGGCGG CGACCAGCAG AACGGCGACC ACGACGACGG CGGTGTCGAA
CACGGGCACA CCGCACGGTA CCCCGGGCGC CGGCCAATCC CCGGCCACAC CACGGTGCCG
GACCACACCA CGGTGCCGGA CCACACCACG GTGCCGGACC ACACCACGGT GCCGGACCAC
ACCACGGTGC CGGACCACAC CACGGTGGAT GTGCGCCGGC GGGTCTCCGC CGTGTTGCGG
CGCTTCGCCG GTTCGCGCGG AGCGCTCCTG CGCGGAATCG ACGACGATCT GATCCCGTTC
GTCCGTATCG CCACCGAATT TCTGCTGGCG GAGGGCAAGC GGTTGCGTCC CGCCTTCTGC
TACTGGGGGT GGCGGGGCGC GGGCGGCCCG GACTGCGACG AGATTGTGAC CGCCGCCGCC
GCCATCGAGC TGCTGCACGC CTGCGCGCTG ATCCACGACG ACGTCATGGA CGCCTCCGAC
ACACGGCGCG GCAGGCCGGC CGCGCACCGG CGCTTCTCCC GGGTCCACCG GACCGCCGGC
TGGCGGGGTG ATCCCGCCGA CTTCGGCCGC TCGGCCGCGA TCCTGCTCGG CGATCTGTTC
CTGGCCTGGG CCGACGAACT GCTCGCGGCC AGCAGGATGC CACCCGAGGC GTTGGTCCGG
GCCTGGCCCA CCTACGGGCG GATGCGCAGC GAGCTGATGG CGGGGCAGTA CCTCGACCTC
GTCGGCCAGG CCGAGGCCGG TCCGCACGGC GGCCTCGATC CCGGGCGGGC GGTCCGCATC
GCCCGGTACA AGACCGCCGG TTACACGGTG GTCCGTCCCC TGCAGCTCGG CGGTCTGCTC
GCGGGCGCGC CGCCGGACCT GTTGGCGGCC TATGCGGCGT TCGGCCTGCC GCTCGGCGAG
GCGTTCCAAC TCCGCGATGA CCTCCTGGGC GTGTTCGGCG ATCCGGCGGT GACGGGAAAA
CCCACCGGGG AGGATCTGCG TGACGGGCGG CCCACTGGCC TGCTGGCGCT CGCGCTGACC
CGTGCGCAGC CGGCCGCGGC GGCCCGGCTG CGCACGCTGA TCTCCCCGCC GGTCCGGCGC
GCCGGGCACC CTGAGGATGC GGGACAGCCC GAGAATCCCG GGTATTCCGG GGGACCCGGA
TGTTCCCCGG GATTCGGGCC CACCGAGAAC CCCGCGGCCC GCGCCGCCCG GGTCGCCGAG
GCCCGCGACA TCATCGCCGC AAGCGGAGCG GTGGCCGCCG CCGAGGAACG GATCGCCGCC
CGGACCGCCA CGGCGGTCGA GGCGGCCCGA CGGGCGGATC TGGACGTCAC AACCCTCGCC
GCCCTGACGG AACTCGCCAT GGCAGCGACT TCGCGATCAC ATTGA
 
Protein sequence
MRRRGQLHVG RPGIGGLHLH PVLAVGQHDG DHACLALRPH AVRRFAEVQV DVAVPAARGD 
RTARELPPGD GAIAGDQGQF TAYPPQRDFA GEHLGPHPDP WRHGQPDPHV AGPAVDPQRA
PGPREHGYGP TGTTPHGQRD VHVLAVPPGD LDAAGGHQNL HGVDTVDSVS QLRGWQPQQG
QSAQCGGDQQ NGDHDDGGVE HGHTARYPGR RPIPGHTTVP DHTTVPDHTT VPDHTTVPDH
TTVPDHTTVD VRRRVSAVLR RFAGSRGALL RGIDDDLIPF VRIATEFLLA EGKRLRPAFC
YWGWRGAGGP DCDEIVTAAA AIELLHACAL IHDDVMDASD TRRGRPAAHR RFSRVHRTAG
WRGDPADFGR SAAILLGDLF LAWADELLAA SRMPPEALVR AWPTYGRMRS ELMAGQYLDL
VGQAEAGPHG GLDPGRAVRI ARYKTAGYTV VRPLQLGGLL AGAPPDLLAA YAAFGLPLGE
AFQLRDDLLG VFGDPAVTGK PTGEDLRDGR PTGLLALALT RAQPAAAARL RTLISPPVRR
AGHPEDAGQP ENPGYSGGPG CSPGFGPTEN PAARAARVAE ARDIIAASGA VAAAEERIAA
RTATAVEAAR RADLDVTTLA ALTELAMAAT SRSH