Gene Francci3_0516 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_0516 
SymbolubiA 
ID3905174 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp602790 
End bp603749 
Gene Length960 bp 
Protein Length319 aa 
Translation table11 
GC content70% 
IMG OID637877845 
Productprenyltransferase 
Protein accessionYP_479629 
Protein GI86739229 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0382] 4-hydroxybenzoate polyprenyltransferase and related prenyltransferases 
TIGRFAM ID[TIGR01475] putative 4-hydroxybenzoate polyprenyltransferase 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.494925 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGTGATCG CGGTGAGTGA CGCCGCGTGG GTGCCTGGCG CGTCGGAGCC GGTCGGGCGA 
GGGTCGGGAG CCGGCCCGGG ACACGCCACC GGCCGGGTAC GGGCGTTTCT GCGGCTCGTC
GTCATCGAGC ACTCGGTCTT CGCGCTGCCG TTCGCCTACA TCGCGGCGCT CGCCGCCTCG
TTCGCGGATT CCCGGTCGGT GCACTGGTCG GACCTCGCCC TCGTCACCGT GGCGATGGTC
GCGGCCCGAA CTTTCGCCAT GGCCGCAAAC CGGGTCATCG ATCGCGCGAT CGACGCCCGT
AATCCGAGGA CGGCGAACCG AGAGCTGGTC ACCGGCGTGG TCTCGGTGCG TACGGCGGTC
GTCGGCGCCG CGGTCGCGCT CGCCGTCTTC CTCACCGCCG CCGCGGCCTT GTCCTGGCTC
TGCCTGCTGC TCGCGCCGGT CGCCGTCGCG CCGCTCGTCG TCTACCCCTA CGCGAAACGG
TTCACCGACT TCCCGCACGC CGTGCTCGGC ATCGCCCAGG CGGTGGCCCC GGTCGGCGCG
TGGATCGCGG TCACCGGTGA GTGGTCCTGG GCCGCACTGG TGCTCGGCCT TGCGGTCGGT
AGCTGGATCG GCGGCTTCGA CGTGATCTAC GCCTGCCAGG ATGCCGAGGT CGACCGGCGC
ATCGGCGTGC GTGCGGTGCC GGCCCGGTTC GGGGTGCGGG CCGCGCTCAT TGGTTCCACG
GTTACGCATA TGATCACCTT TGCGTTGTTC ATCGTCTACG GGCTGATGGA CAACGCCGGT
CCGTGGTGGT GGGCTGGGCT CGTGCTGACG GCGGCGGCAT TCTGCTATGA GCACGCCATC
GTGTCCCCGA ACGACCTGTC GCGGGTCAAC CGGGCCTTCC TCACCGCGAA CGGATTCGTC
GGCATCGTCT TGTTCCTCTT CGCCGTTGTC GATCTCGCCA GCCGTGGTCT GGCCGTCTGA
 
Protein sequence
MVIAVSDAAW VPGASEPVGR GSGAGPGHAT GRVRAFLRLV VIEHSVFALP FAYIAALAAS 
FADSRSVHWS DLALVTVAMV AARTFAMAAN RVIDRAIDAR NPRTANRELV TGVVSVRTAV
VGAAVALAVF LTAAAALSWL CLLLAPVAVA PLVVYPYAKR FTDFPHAVLG IAQAVAPVGA
WIAVTGEWSW AALVLGLAVG SWIGGFDVIY ACQDAEVDRR IGVRAVPARF GVRAALIGST
VTHMITFALF IVYGLMDNAG PWWWAGLVLT AAAFCYEHAI VSPNDLSRVN RAFLTANGFV
GIVLFLFAVV DLASRGLAV