Gene Francci3_0318 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_0318 
Symbol 
ID3903350 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp369431 
End bp370813 
Gene Length1383 bp 
Protein Length460 aa 
Translation table11 
GC content71% 
IMG OID637877647 
Productglycosyl transferase family protein 
Protein accessionYP_479434 
Protein GI86739034 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0463] Glycosyltransferases involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.163788 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGGGA CCTTCGAGAC GACCGGGAGC ACCGGAGAGA GCATTCCCGT TCAACATCAT 
CAGCCCGGGT GGCTCGGGCC GGACGGCGGC CGTCCGCGCC CGGTGCTGGA CGTCGTCATC
CCCGTGTACA ACGAGGAGAA CGACCTCGCC CCCTGCGTCC GACGGCTGTA CGCCCACCTG
ACCGGGACGT TCCCGTACCC CTTTCAGATC ACGATCGCCG ACAACGCCAG CACCGACGGC
ACCCTGGCCA TCGCCCAGGC GTTGGAGAAG GAGCTCCCCG AGGTCGCCGC GATCCACCTC
GAGGCCAAGG GCCGCGGCCG GGCACTGCGG GCGGCCTGGG GCCTCTCGCC CGCACCGGTG
CTCGCCTACA TGGACGTCGA TCTGTCGACC GACCTCGCCG CGCTGCTGCC CCTGGTGGCT
CCGCTCATCA GCGGCCACTC GGATCTCGCT ATCGGCACCC GGCTCTCCCC CGCCTCCCGG
GTCGTGCGGG GACCGCGCCG GGAGGTGATC TCCCGCTGCT ACAACCTGAT CCTCCGCAGG
ACCCTGGCGG CCCGGTTCTC CGACGCGCAG TGCGGCTTCA AGGCGATCCG CGCCGACGCC
GCAGCGGGCC TGCTACCCCT GGTGGAGGAT AGCGGCTGGT TCTTCGACAC CGAACTGCTC
GTCCTGGCCG AACGGGCCGG GATGCGCATC CACGAGGTCC CGGTCGACTG GATCGATGAT
CCGGACAGCC GCGTCGACGT CCTCGCCACG GCCATCGCCG ACCTGAAGGG TGTGGTCCGC
CTCTTGCGGG CGTTCGGCAG CGGAAAGCTG CCGCTCGCCA AGCTGCACCA GGAGTTCGGC
CGAGGTCCGC TCACCGCCGG CCACGCCGAG GAGGGCAAGG TCGTCGAGGT CCCGGGGGTA
CCGAAGGGAC TCGCCGGTCA GCTCCTGCGA TTCGCCGCGA TCGGGGTCGC CAGCACGCTG
TCCTATCTGG TGCTCTTCGT CCTGCTGCGG ACAGTCACCG GGGCGCAGAT CGCGAACCTG
CTGTCGCTGC TTCTCACGGC GGTCGCGAAC ACCGCGGCGA ACCGGCGGCT GACCTTCGGT
CTCACCGGTC CGCGGCGCGC CGGTCGCCAC CATCTGCAGG GCCTGGTGGT GTTCGCCGTC
GGCCTCGGCC TGACCAGCGG TTCGCTCGCG CTCCTGCACG CGGCGAGCAC GAACCCCGGC
CGCGGCCTCG AACTCTCCGT GCTGGTGCTG GCGAACCTGG CCTCCACGGT CATCCGGTTC
CTTCTGCTAC GCGCCTGGGT TTTCCGCCCG GACCGGGAGG CGAGGAACGT GGCCGGGATG
CCCCCGGCCA CGACACCCCC GCGGAGGCGG GCCCCGACCG GCGAGATCAG GAACGCAGAG
TAA
 
Protein sequence
MTGTFETTGS TGESIPVQHH QPGWLGPDGG RPRPVLDVVI PVYNEENDLA PCVRRLYAHL 
TGTFPYPFQI TIADNASTDG TLAIAQALEK ELPEVAAIHL EAKGRGRALR AAWGLSPAPV
LAYMDVDLST DLAALLPLVA PLISGHSDLA IGTRLSPASR VVRGPRREVI SRCYNLILRR
TLAARFSDAQ CGFKAIRADA AAGLLPLVED SGWFFDTELL VLAERAGMRI HEVPVDWIDD
PDSRVDVLAT AIADLKGVVR LLRAFGSGKL PLAKLHQEFG RGPLTAGHAE EGKVVEVPGV
PKGLAGQLLR FAAIGVASTL SYLVLFVLLR TVTGAQIANL LSLLLTAVAN TAANRRLTFG
LTGPRRAGRH HLQGLVVFAV GLGLTSGSLA LLHAASTNPG RGLELSVLVL ANLASTVIRF
LLLRAWVFRP DREARNVAGM PPATTPPRRR APTGEIRNAE