Gene Francci3_1581 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_1581 
Symbol 
ID3903716 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp1896273 
End bp1897673 
Gene Length1401 bp 
Protein Length466 aa 
Translation table11 
GC content70% 
IMG OID637878918 
Productglycosyl transferase family protein 
Protein accessionYP_480686 
Protein GI86740286 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1215] Glycosyltransferases, probably involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.452346 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCACCA CGCCGCCGGC CATGATGCTG CCGGGCGACG GTCCGATCGC GCTGCACGAG 
GTAGAACTGT CCGCGCCGCT TCCGGTGCTC ACCGGTGACG GGCCGGCACA GGTGCTGGTC
CGGCTGCACG GCCGTCCCCT GGGGGTGGTC GCTGCGACGC TGTCCCCGGC CGGCCTGCGC
CCGGACCAGC TCGCCGGCCG GGTCATGGAC AGACTCGGGC CGGACGTGGC CGCCCACCTG
ACCGCGGACG GGCTCGCGGT TCCCGAAACG GTCCCGATCG ACGGAGTTGT GACGTCAGAC
GTCTGCCGGC CGGTGCTCCC CGCCGAGCTC GCCGGGACCG TCGTCGTGAC AAGCTGCGTC
GCCTCCCCCT CGCTGGCGGC CACGCTCGCG GGAATCCTGG CGCAGTCCGT GCCAGCGCAG
GAAATCATCG TGGTCGACAA CCGGCCGCGC ACGAGCGGCA TCCGCGAACA GCTCAGTCGG
ATGGCCGGCG AGGTGCGCTA CGTCGCCGAG CCAGAGAGGG GGCTTTCCCG GGCCCGCAAC
GCCGGACTGG CCGCCGCGAC CACACCGGTG GTCGTCTTCA CCGACGACGA CGTCGAGGTC
GACCCGCGCT GGTTGGAGTT CCTCCTGTCC GGGTTCGCCG CCGGCTCGGG TGTCGTCGAC
GAGACGGTGG GGTGCGTCAC CGGGCTGATC CGGCCACTGG AGCTCTCAAC CCCGGCGCAG
GTGTGGTTCG AGCAGTTCGG CGGCTTCGGC AAGGGCTTCG TCGGACGCCG GTTCGACCGC
ACCGAAAATC GATCAGGCGA CCTGCTCTAT CCCTATACCG CGGGCGTGTT CGGCAGCGGC
GCGAACAGCG CTTTCCGGAC TGATACGCTG CGTCAGCTTG GTGGTTTCGA CGAATTTCTC
GGCACGGGCA CCGCGGCCCG CGGTGGCGAG GATCTCGACA TCTTCCTGTC CGTGGTGCGC
AGTGGCCACG TCCTGGTCTA CGAGCCGGCC GCGCTGATCC GCCACCTGCA CAAGCGGACC
ACTGTCGAAC TGCGGCGCCA GATGTACGAC TACGGGGCCG GCCTCGGGGC GATGGTCACC
AAACGCATCG CGACTCAACC CGCCGAACGC CTCCAGATCG CCCGTCTCGT CCCCGGCGGC
CTGCATCATC TTCTCCACCC ACGCAGCAGC AAGAACGCCG GCAAATCCCG TGACTATCCC
CGGTCCCTGA CCTTGATCGA GTTGGTCGGT GTGGCCCGAG GGCCGGTCGG CTATGCCGCC
AGCCGGGCCC TCGCCCGGCG CCGGCAGCGC GATCTTCCTC ATGACCACGC ACCGACTCCC
GGTGCTCCCC CCGCCCCCTT TCCCAGGGCG CCCATCCCCG AACCGCGGAT CTCCACCGTG
ACCACTGGAG CGCAACCATG A
 
Protein sequence
MTTTPPAMML PGDGPIALHE VELSAPLPVL TGDGPAQVLV RLHGRPLGVV AATLSPAGLR 
PDQLAGRVMD RLGPDVAAHL TADGLAVPET VPIDGVVTSD VCRPVLPAEL AGTVVVTSCV
ASPSLAATLA GILAQSVPAQ EIIVVDNRPR TSGIREQLSR MAGEVRYVAE PERGLSRARN
AGLAAATTPV VVFTDDDVEV DPRWLEFLLS GFAAGSGVVD ETVGCVTGLI RPLELSTPAQ
VWFEQFGGFG KGFVGRRFDR TENRSGDLLY PYTAGVFGSG ANSAFRTDTL RQLGGFDEFL
GTGTAARGGE DLDIFLSVVR SGHVLVYEPA ALIRHLHKRT TVELRRQMYD YGAGLGAMVT
KRIATQPAER LQIARLVPGG LHHLLHPRSS KNAGKSRDYP RSLTLIELVG VARGPVGYAA
SRALARRRQR DLPHDHAPTP GAPPAPFPRA PIPEPRISTV TTGAQP