Gene Franean1_5123 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_5123 
Symbol 
ID5673457 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp6137155 
End bp6138480 
Gene Length1326 bp 
Protein Length441 aa 
Translation table11 
GC content77% 
IMG OID641243973 
Productglycosyl transferase family protein 
Protein accessionYP_001509387 
Protein GI158316879 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1215] Glycosyltransferases, probably involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.354278 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0746579 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAACGGGG ACAGACTGGC AACGGCGCTG GTGCGTTCCG GGGCGGCCGG AGCCGTCGCC 
GTCGCGGCGC ACACGGCGGT GAACGCGGCA CTGCTGCGCG TCCCGGCGCC GGCCGTCCCG
GTGCGGGAGC GGGTCAGCGT GATCCTGCCC GTGCGGGACG AGGCGGCGCG GGTGCGCACC
TGCCTGACTG CTCTGCTCGG CTCCCGCGAC GTCGCCGACC TCGAGGTGAT CGTCTACGAC
GACGGCTCGA CGGACGGCAC GGGCATGATC CTGCGCCGGC TCGCCGAGCG GGACCGGCGC
CTGCGGGTGC TGACCGGGCC GGAGCCGCCG GACGGCTGGC TGGGCAAGCC CCACGCCTGC
GCCCGGGCGA CGCGCCAGGC CACCGGCACC GTGCTGGTCT TCGTCGACGC CGACGTCGCC
CTCGCCCCGG ACGGCCTCGC CCGGGCCGTG GGCCTGCTGC GCGGATCGGG CCTGGACCTC
GTCTCGCCCT ACCCGCGGCA GGTCGCCGTC GGGGCGGCCG AACGGCTCGT GCAGCCGCTG
TTGCAATGGT CGTGGCTGGC GCTGCTCCCG CTGCGCGCCG CCGAGAGCTC CGCCCGCCCG
TCGTTGGCCG CGGCCAACGG CCAGTTCCTC TGCGTGGACG CGGCGGCCTA CCGCCGGGCC
GGCGGGCACG GCGCGGTCGG CGGCGCCGTA CTGGACGACA TCGAGCTGCT GCGCGCGGTC
AAGCGCTCCG GCGGGCGCGG TGTGGTCGCC GACGGCACCG AGCTGGCGGT CACCTGGATG
TACGACGGCT GGCAGCCACT GCGGGACGGC TACGCCAAGT CGCTGTGGGC CGCGGGCGGC
ACCCCGGCGG CCAGCGTGGG TCAGCTGGCC GTGCTCGGAT GGCTCTTCGT CGGCCCGGCG
GTCGCCGCGG CGCGCGGGTC ACGCGCGGGG CTCGTCGGCC TGCTGGCCGG CACGGTGAGC
CGGCTGATCG CGGCCCGGCG CACCGGCGGC CGCGCCTGGC CGGACGCGGC GGCCCATCCG
GTCTCTGTCT GCCTGCTCGG CTACCTGACG GTGCTGTCCT GGTGGCGGCA CCGGCACGGC
ACCATCCGTT GGAAAGGCCG CGCGCTGAAC GGGCCACCGA CCGGGAGCGG ACGACCGCGC
ATAGGCTCGG AGCCGTGGCG ACGGTCGTCG TGGTCGGGGC GGGCGTCGGC GGGCTCGCCG
CCGCCGCTCG GCTCGCCGCC GCCGGGCACC GGGTCACCGT CTGCGAGGCG GCGGAGCGGA
TCGGCGGCAA GCTCGGCTGG TACGAACGCG ACGGCTACGG GTTCGACACC GGCCCGTCCC
TGCTGA
 
Protein sequence
MNGDRLATAL VRSGAAGAVA VAAHTAVNAA LLRVPAPAVP VRERVSVILP VRDEAARVRT 
CLTALLGSRD VADLEVIVYD DGSTDGTGMI LRRLAERDRR LRVLTGPEPP DGWLGKPHAC
ARATRQATGT VLVFVDADVA LAPDGLARAV GLLRGSGLDL VSPYPRQVAV GAAERLVQPL
LQWSWLALLP LRAAESSARP SLAAANGQFL CVDAAAYRRA GGHGAVGGAV LDDIELLRAV
KRSGGRGVVA DGTELAVTWM YDGWQPLRDG YAKSLWAAGG TPAASVGQLA VLGWLFVGPA
VAAARGSRAG LVGLLAGTVS RLIAARRTGG RAWPDAAAHP VSVCLLGYLT VLSWWRHRHG
TIRWKGRALN GPPTGSGRPR IGSEPWRRSS WSGRASAGSP PPLGSPPPGT GSPSARRRSG
SAASSAGTNA TATGSTPARP C