Gene Franean1_5500 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_5500 
Symbol 
ID5673831 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp6659634 
End bp6661001 
Gene Length1368 bp 
Protein Length455 aa 
Translation table11 
GC content75% 
IMG OID641244355 
Productglycosyl transferase group 1 
Protein accessionYP_001509761 
Protein GI158317253 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0983279 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCACGC TGCCGGAGGG CTCACTGCGC CCCACGACCA TTCCGCCGCT GCCGGCACCG 
CCGTCCGCCG CCGGGCAGGT GCCCCTCGTC GGCACCGGGA TGCCCGGGCT GACCGAGCAG
TTCGTCCCGT CGGCGACGTC CGAGCGGCTG CTGGCCGGCC TCGCCGACGC GCTGCCCGCG
CAGCCCACAC TGGCGGAGCT CGTCGAGACC TCGGGGATGC GCCGCATCCA CATGCTGGCC
TGGCGGGATC TGGACGACCC CGAAGCGGGT GGCTCCGAAC TGCACGCCGA CAAGGTCGCC
GAGCGGTGGG CCGCCGCCGG CGTCGACGTC AGCCTGCGCA CCGCCGAAGC ACCCGGCCAC
CCCGAGACCA CGCGGCGCAA CGGCTACCAG ATCGTCCGCA AGGCCGGCCG GTACTCGGTG
TTCCCGCGGA CGGCGACGTC CGGCGCGCTG GGCCGCACCG GCCCGTGGGA CGGCCTGGTC
GAGATCTGGA ACGGGATGCC GTTCTTCTCC CCCGTCTGGG CGCGCTGCCC GCGGGTGGTG
TTCCTGCACC ACGTCCACGG CGCGATGTGG CGGATGGTGC TCTCCCCCAA GCTGGCCCAG
GTCGGCGAGA CCATCGAGTT CAAGGTGGCG CCGCCGCTGT ACCGGCGCAC CCGCATCCTC
ACCCTCTCCC AGTCGTCCCG GGACGAGATC ATCGAGCTGC TCGGCCTGCC CGCGGGGAAC
ATCTCGGTGA TTCCCCCGGG CATCGACTCC TCGTTCAGCC CCGCCGGGGA GCGCTCCGCA
CGCCCGCTGG TGCTCGCCGT CGGCCGGCTG GTGCCGGTGA AGCGGTTCGA CGTGCTGATC
GACTCGCTGA TCCGGGCGCA CGACGAGCAC CCCGCGATGG AGGCCGTGAT CGTCGGCGAG
GGCTACGAGC GCCCGGCGCT CGAGGCGCGC ATCGCCGCGG CGGGCGCGGG CGACTGGCTG
CGGCTGGTCG GCCGGGTGGA CGACGCGGGT CTTCTCGACC TCTACCGGCG TGCCTGGGTG
CTCACCTCGG CCTCCGCCAG AGAGGGTTGG GGCATGACGA TCACCGAGGC GGCCGCCTGC
GGGACGCCGT CCGTCGCGAC GAAGATCGCC GGGCACACCG ACGCCGTCGC GGACGGCGTG
TCCGGCCTGC TGGTCGAGGA CCCGAACGAC CTGGGCAAGA CCCTGGCCGG CGTGCTGTCC
GACCCCGAGC TGCGGGCCCG GCTCTCCGCC GGCGCGCTCG CGCACGCGGC GACGTTCACC
TGGGAGCACA CCGCCCGCTC GACCTACCTC GCGCTGGTCA ACGAGGCCGC CCGCCGCCGG
CTCGTCCGCC GCCCCGCCCC GAGCTCGCGC TCGGGCGCGC CCCGGTGA
 
Protein sequence
MSTLPEGSLR PTTIPPLPAP PSAAGQVPLV GTGMPGLTEQ FVPSATSERL LAGLADALPA 
QPTLAELVET SGMRRIHMLA WRDLDDPEAG GSELHADKVA ERWAAAGVDV SLRTAEAPGH
PETTRRNGYQ IVRKAGRYSV FPRTATSGAL GRTGPWDGLV EIWNGMPFFS PVWARCPRVV
FLHHVHGAMW RMVLSPKLAQ VGETIEFKVA PPLYRRTRIL TLSQSSRDEI IELLGLPAGN
ISVIPPGIDS SFSPAGERSA RPLVLAVGRL VPVKRFDVLI DSLIRAHDEH PAMEAVIVGE
GYERPALEAR IAAAGAGDWL RLVGRVDDAG LLDLYRRAWV LTSASAREGW GMTITEAAAC
GTPSVATKIA GHTDAVADGV SGLLVEDPND LGKTLAGVLS DPELRARLSA GALAHAATFT
WEHTARSTYL ALVNEAARRR LVRRPAPSSR SGAPR