Gene Francci3_0964 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_0964 
Symbol 
ID3903871 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp1139233 
End bp1140699 
Gene Length1467 bp 
Protein Length488 aa 
Translation table11 
GC content71% 
IMG OID637878298 
Productglycosyl transferase family protein 
Protein accessionYP_480077 
Protein GI86739677 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0463] Glycosyltransferases involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.824604 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCTCGA CCGATCAGGG GGGCAGGACC GATCAAGCGA GTAGGACCGA TCAAGCGAGT 
AGGACCGAGC AGGGGAGCCG GACCGGGCCG GCGGGCAGAC CGGACGACCG CACGGGCCTG
CTCCGTCTGC CCGTTCTGCC GCATCCCCGC CACCCGCTGG AGCGTCCCGA GCGGACGGGC
GGTTCGGCCG CGGAACCGGT CGGTTCCCGG CTGGACGTCG GGCCACAGGT GTCGGTCGTG
ATACCCACCC GCAACGAGGC GCGCAACGTC GAACCGCTGC TGCGGCGCCT CGACGAGGCG
CTGCACGGCC TGTCCGGAGA AGTAATCTTC GTTGACGACT CCGACGACGG AACGCCCGAG
GTCATCGCGC GCGTCCGCCC CTCGGTGCGG TTACCGGTAC GGGTGCACCA CCGGACCCCG
GCCCAGCGGG TCGGCGGGCT GGGGGGCGCG GTCAGCGAGG GCTTCGCGCT CTGTGCCGCG
CCCTATGCGG TGATCATCGA TGGGGACCTC CAGCATCCGC CGGAGACCAT CCCGGCGCTG
CTCGGCACCG CCGTGGAACA TGCCGCCGAT GTGGTGATCG GCAGCCGGTA CGTGTCCGGC
GGCAGCGCAT CGGGGCTCGC TGGAAGCATG CGGCACCTGG TCTCGACGGG GTCGAACCGG
TTGTGTCGGT GGGTCTTCCC CCGTCGGCTG CGCGGCGTCT CGGACGTGAT GAGCGGCTTC
TTCCTGGTAC GGGTCGCCGT CGTGGACCGG GCCGGCCTGC GACCGGACGG CTACAAGATC
CTGCTGGAGC TGCTCGTCGC GTCCGGACGG CTGCGCGTCC GCGAGATCGG GTACGCCTTC
GCCGAGCGGC ACGCCGGGAC CTCCAACGCC TCGCTGACCG AAGGCGCCCG CTTCGCCCGG
CGGCTGTTCG CGTTGCGGGT TCCGAAGCCC GCGCGGTTCG CCCTGGTCGG GGCGTCCGGG
ACGGTGCCGA ACCTGCTTGG CACCGCCGTG CTTCACCACG TCGGCTTGCA CTACCTGGTC
GCGGCGATCG TCGCGACCCA GATCGCCGTC GGCTGGAACT TCCTCGGCTG CGAGCTCCTG
GTCTGGGATC GGGAGACGGG TTCCCGGCTG CGTCGCTATC CGGCGTTCGC GCTCATCAAC
AATCTCGATC TGGTCATTCG GCTGCCACTG CTCGCGGTGC TGGTCGGGCG ATGGCATCTC
GGCGTCGGCA TCTCGACCCT GATGTCCCTG GCCGCCGCGG TGATCGTCCG ATACCTGGTG
GTGGATCGGC TGGTGTACCG GCGACGGGCG GTGTCTGAGC GGGCGGTGTC TGAGCGGGCG
GTGTCTGAGC GGGCGGTGTC TGAGCGGGCG GTGTCTGAGC GGGCGGTGTC TGAGCGGGCG
GTGTCTGAGC GGGCGGTGTC TGAGCGGGCG GTGTCGCCGT CCCACGGAAG GCCGTCGGAG
GACGGGGTGT CCGGTGCGGT TTCGTAG
 
Protein sequence
MTSTDQGGRT DQASRTDQAS RTEQGSRTGP AGRPDDRTGL LRLPVLPHPR HPLERPERTG 
GSAAEPVGSR LDVGPQVSVV IPTRNEARNV EPLLRRLDEA LHGLSGEVIF VDDSDDGTPE
VIARVRPSVR LPVRVHHRTP AQRVGGLGGA VSEGFALCAA PYAVIIDGDL QHPPETIPAL
LGTAVEHAAD VVIGSRYVSG GSASGLAGSM RHLVSTGSNR LCRWVFPRRL RGVSDVMSGF
FLVRVAVVDR AGLRPDGYKI LLELLVASGR LRVREIGYAF AERHAGTSNA SLTEGARFAR
RLFALRVPKP ARFALVGASG TVPNLLGTAV LHHVGLHYLV AAIVATQIAV GWNFLGCELL
VWDRETGSRL RRYPAFALIN NLDLVIRLPL LAVLVGRWHL GVGISTLMSL AAAVIVRYLV
VDRLVYRRRA VSERAVSERA VSERAVSERA VSERAVSERA VSERAVSERA VSPSHGRPSE
DGVSGAVS