Gene Francci3_1303 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_1303 
Symbol 
ID3904352 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp1558262 
End bp1560121 
Gene Length1860 bp 
Protein Length619 aa 
Translation table11 
GC content72% 
IMG OID637878636 
Productmethyltransferase FkbM 
Protein accessionYP_480409 
Protein GI86740009 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0451] Nucleoside-diphosphate-sugar epimerases 
TIGRFAM ID[TIGR01444] methyltransferase, FkbM family 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGGTTCC TCATCACCGG TGGCACCGGT TTCCTCGGAA GCCGGGTGGT CGACCGGGCG 
CTCGCGGACG GACACCGGGT CGTCGGGCTG GCGCGCAGCG ACGCGGCGGC GACGAAGCTG
CGCCGACACG GCGCGGGGAC TGTGCGCGGT GACCTCGACG ATCCGGCGAC GTTGCTCCCG
GCGTTCCGCG AGGCGAACTG CGAGGCGCTG ATCAACATCG CCTCACTCGG GTTCGGCCAT
GCGGAGACGA TCGTGACCGC CGCCCGTGCG GCCGGCATTC GCCGGGCCGT CTTCCTTTCC
ACCACCGGGA TCTTCACCAC GCTCGATCCA CCCTCGAAGC GGATCCGGGT AGCGGCCGAG
GGGACCATCG CGGCGAGCGG GCTGGACTGG ACGATCATCC GACCGACGAT GATCTACGGG
GGTCCCGACG ACCGGAACAT GGCCCGGCTG CTGGCGCTGC TCCGGCGCGT TCCGGTGCTG
CCGGTCCCCG GCGGGGGGCA TCATCTGCAG CAGCCGGTGC ACGTCGAGGA CCTGGCCCGC
ACCGTGCTGC GCGCAACGAC GACCGCGGCG GCGATCGGAC GCGCCTATGA CGTCGCCGGG
CCGGAGGCAT TGACGTTCCG GCAGGTCGTG ATCACGGCCG GCGCCGCCGT CGGCCGGCGG
GTGATCTGCG TGCCCGTGCC GGTCCGCCCC GTCATCGCCG TGACCCGCGC CTACGAGCGG
CGGGTGTCCT CGCCAAGGTT GAAGGCCGAA CAGATCGCCC GGCTCATCGA GGACAAGGCG
TTCGCGATCG ACGCGGCCCG CCGCGACCTG GACCATCGTC CGCGGCCGTT CGCGGCCGGT
ATCGCGGCGC AGGCCACCTC ATCACCCTAT CGGGTGACCC CAAGCCGCTC TCCCGACCGA
ACGGAGACCC GGATGTTCCG GATCGACCAG GCCACCCAGC TGCTCCAGGA CCTCCAACGG
ATCGCCAGGG TGGCTGGTCC CCCGACGGCG CTGCGCTTCG GTGCCGCCGT GGCCCGGCAT
GCTCCGACCA TCCGCCGGGC GGGCAACCTG GATGCCGCCG ACCGGGCGAT GGCCACCGGC
GGCCACACCT ACCGGCCGCT GCCCGGGGTG ACGGTCATGT TGCCCGGCAC CGCGTTCGGC
GGCGCGCGCG AGATGTACTG CCGCGGGGTC TACCACGCCC TCCCCGGGTA CGCGCCGGCG
GCCGGCGAGG TCGTCGTCGA CCTCGGCGCA AACCAGGGCC TGTTCTCGGT GCTCGCGGCG
CGAGCGGGCG CCGATGTGAT TGCCGTCGAG GCGCAGCGCG GGTTCGCCCC CGCCTTCATC
AACCATGCCG CCGGCAACGG TGTGTCGAAC CACATCCAGC TGCTCCACGC CCTGGTCGGT
CCGACCGCCG GCGTGTTCGC CGATCCGCGG GCCCGCCGGA ACGCGACACA CTGGGACGGG
GACGTCGACG TCCTCACCAT GGCAGAGGTG TTCGAGGCTG GCGGGGTCGA CCAGGTCGAC
CTCGTCAAGC TGGACATCGA AGGCTCCGAG TTCGCCCTGT TCGACGAGCC TGGATGGCTG
GACGCGGTTG GTCGGATCGT GATGGAGGTG CACACCGGGT TCGGCGATCC CCGTACGCTC
GACGACCTGC TGGTCCGGCA CGGCTTCGAG GTCACTCTGC TCGGTAACGA CCTGGTGCCG
ACTGCCCATC TGGGCGACGC GGCGTCGGGC TACCTCTACG CCCGCCGGAC CCGATCGACG
ACCCGGGCCA TCCGCGGGTC CGCGCAGCCC GACGGTCGCC CGGTGTCGGT ACCCGGCAGC
CGCCCGTCAT CCGAGCCCGG TTTCGTCGAC GACCCCGCAC CGCGGGGTGT CTCGGCGTGA
 
Protein sequence
MRFLITGGTG FLGSRVVDRA LADGHRVVGL ARSDAAATKL RRHGAGTVRG DLDDPATLLP 
AFREANCEAL INIASLGFGH AETIVTAARA AGIRRAVFLS TTGIFTTLDP PSKRIRVAAE
GTIAASGLDW TIIRPTMIYG GPDDRNMARL LALLRRVPVL PVPGGGHHLQ QPVHVEDLAR
TVLRATTTAA AIGRAYDVAG PEALTFRQVV ITAGAAVGRR VICVPVPVRP VIAVTRAYER
RVSSPRLKAE QIARLIEDKA FAIDAARRDL DHRPRPFAAG IAAQATSSPY RVTPSRSPDR
TETRMFRIDQ ATQLLQDLQR IARVAGPPTA LRFGAAVARH APTIRRAGNL DAADRAMATG
GHTYRPLPGV TVMLPGTAFG GAREMYCRGV YHALPGYAPA AGEVVVDLGA NQGLFSVLAA
RAGADVIAVE AQRGFAPAFI NHAAGNGVSN HIQLLHALVG PTAGVFADPR ARRNATHWDG
DVDVLTMAEV FEAGGVDQVD LVKLDIEGSE FALFDEPGWL DAVGRIVMEV HTGFGDPRTL
DDLLVRHGFE VTLLGNDLVP TAHLGDAASG YLYARRTRST TRAIRGSAQP DGRPVSVPGS
RPSSEPGFVD DPAPRGVSA