Gene Franean1_5703 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_5703 
Symbol 
ID5674029 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp6920571 
End bp6921935 
Gene Length1365 bp 
Protein Length454 aa 
Translation table11 
GC content73% 
IMG OID641244556 
Productglycosyl transferase family protein 
Protein accessionYP_001509959 
Protein GI158317451 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1215] Glycosyltransferases, probably involved in cell wall biogenesis 
TIGRFAM ID[TIGR03469] hopene-associated glycosyltransferase HpnB 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0459973 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCTTCCAC TGCTGGTGAT TGCGCTGATC TCGCTCGCGT CCTGGATTTT CCTGGCGCTG 
TTCCGAGGAT TCTTCTGGCG GACCGATCAA AGACTGCCCG CCGGCGACGG ACCGCTACCG
GAACAGTGGC CGGCGGTAGT CGCCGTGGTC CCCGCGCGTG ACGAGGCGGA CGTCCTTCCC
GACACGCTTC CCTCACTGCT CGCGCAGGAC TATCCCGGGC GCCTGAGCAT CATTCTGGTG
GATGACGCCA GTACCGACGG GACGGGCGAG CTGGCCCGGG AGCTGGCGGC GCGGGCGGCG
GCGGCCCGTC CCGAGGCGGC GGTGGCGCTC ACCGTCATCG GGTCGAGCGA GCCGCCGGCC
GGCTGGACGG GCAAGCTCTG GGCGTTGCGG CACGGCATCG CCGCGGCCGG CGCGCCCGAG
TTCCTGCTGC TGACCGACGC CGACATCGCG CACGATCCGA GCTCGGTGCG CGAGCTCGTC
CGGGCGGCGA CGGCCCGCCG GCTTGATCTC GTCTCGCAGA TGGCGCGGCT GCGGGTCAAC
ACCGGATGGG AACGCCTCAT CGTGCCGGCC TTCGTCTACT TCTTCGCGAT GCTCTACCCG
TTCCGGTGGT CCAACGACCC GGACTCGCGG ATCGCCGCCG CCGCCGGGGG ATGCTCGCTC
GTCCGCCGCC GGGCGCTCGC CGACGCCGGG GGGCTGGACG CCATCCGGGA CGCGGTGATC
GACGACGTCG CACTGGCCCG CGTGATCAAG AAGTCCGGCG GGCGGACCTG GCTCGGGCTC
GCCGACCACG TCTCCAGTCG CCGGCCGTAC CCGCGGCTGG CGGACCTGTG GCACATGGTG
GCGCGCACCG CCTACGCGCA GTTGTTCTGG TCGCCGCTGC TACTCGTGGG CACGGTTCTC
GGGCTCGGTT TGGTCTTCGT CGCCCCGGTC GTCGCGACCA TCGCCGGCAT CGCCGCCGGA
AATGTGGCAG TGGCCGCCGC CGGGCTGCTC GCCTGGTCGG TCATGATCAC GACGTTCGGA
CCGATGCTGC GGTACTACGA CCAGCCCGTG CTCTCCTCGC TCGCGCTGCC GTTCACCGCG
GCCCTCTACC TGGCCATGAC CATGGACTCG GCCCGGCGGC ACCGTGCCGG CCGGGGCGCG
GCCTGGAAGG GGCGCACCTA CTCAGCTCCC GACGGGAAGC GGGTCGGCCA GGGAGCCGGT
CAGGACGCCG GCCAGGGAGT CGAGACGGCG GCGCAGGGCG AGGGCGTCGT CCGCCAGCGC
GCAGACGAGG ACCCCCGGTC CGGCCAGGGG CATCAGGGCG ACACTGTCCC CTACCGCGGC
GGTGACCGGC TCGAGACCGG TCGCCGGCCC GGCGACCAGG ACTGA
 
Protein sequence
MLPLLVIALI SLASWIFLAL FRGFFWRTDQ RLPAGDGPLP EQWPAVVAVV PARDEADVLP 
DTLPSLLAQD YPGRLSIILV DDASTDGTGE LARELAARAA AARPEAAVAL TVIGSSEPPA
GWTGKLWALR HGIAAAGAPE FLLLTDADIA HDPSSVRELV RAATARRLDL VSQMARLRVN
TGWERLIVPA FVYFFAMLYP FRWSNDPDSR IAAAAGGCSL VRRRALADAG GLDAIRDAVI
DDVALARVIK KSGGRTWLGL ADHVSSRRPY PRLADLWHMV ARTAYAQLFW SPLLLVGTVL
GLGLVFVAPV VATIAGIAAG NVAVAAAGLL AWSVMITTFG PMLRYYDQPV LSSLALPFTA
ALYLAMTMDS ARRHRAGRGA AWKGRTYSAP DGKRVGQGAG QDAGQGVETA AQGEGVVRQR
ADEDPRSGQG HQGDTVPYRG GDRLETGRRP GDQD