Gene Franean1_6558 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_6558 
Symbol 
ID5674873 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp7976592 
End bp7977968 
Gene Length1377 bp 
Protein Length458 aa 
Translation table11 
GC content63% 
IMG OID641245407 
Productglycosyl transferase group 1 
Protein accessionYP_001510801 
Protein GI158318293 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCGAAAGA AACGTGACAT AAGTACTGCA CTGATAGTGC GGCGGAATCG CGAGGAGATG 
CCTACTTCGG CAGCCGTCGG TACGGGCCGG TCCGGTCCCA TCGACATGGG ACGAAGACGG
CGAGTCCTAA TTGTCGTCCA GAATCTTCCC GTCCGAATCG ATCGGAGGGT ATGGCAGGAA
TGCGTGGCTC TCATCGCCTG CGGTTACCAG GTCTCGGTCA TCTGCCCACG CGGTGACGGG
GAGCGGCGCC ATCAAATCAT TGAAGGTGTC AGCGTGTGGA CATACCGGGC CGCACCCGCG
GCCAGCGGAG TGCTGAGCTA CATTTTCGAG TTCGTCTACT GCTGGTTTCG CACCTTCATC
CTGACCCTGG CCGTGGTCCG TAAGGAGGGC TTCGATGTAA TTCAGGCATG CAACCCTCCC
GACACGTACT GGTTGCTGGC GGTGCTTTAT AAGCCGTTTG GTAGGAAGTT CGTCTTCGAC
CATCACGATC TATGTCCGGA GCTGTACCGC TCGCGGTTCG ACCGGGATTC CCCGATTTTG
CTCCGCGCGC TGCTGCTGCT CGAACGGGCA AACCAGGCCA TGGCCGACCA TGTGATAGTC
ACGAATGACT CCTACCGACA GCTCGCCATG ACCAGGGGTC GAAAGCGACC GGACCGGGTG
ACCGTGGTCC GCAGCGGACC GGACCCTGAC CTCATGAAGC CAGCGTCGCA GCGCCCGGAG
CTACGGCGTG GCCGCCGACA CCTCGCCTGC TACCTGGGTG TCATGGGCCC GCAGGATGGC
GTCGACCAGT TGCTCGACGC CATCGAGCAC TATGTCCACG GTCTGCGCCG TACCGACTGC
TTCTTCGCGT TGCTCGGCTT CGGTGACTGC CTGGATGAGT TGCGGGTGAG ATCCAGCAGG
CTCGCCCTCG ATGACTGGGT CGAGTTCACC GGATTGGCCG ACGACGTGAT GATCCGCGAC
TATCTCTCCA CTGCAGCCGT CGGTTTGTCT CCCGACCCGC GCAGTCCCCT GAACGAGATC
TCGACCATGA ACAAGACCCT GGAATACATG GCCTATGGGC TACCGGTCGT GGCCTACGAC
CTGGTGGAGA CGCGGGTCAG TGCGGCCGAC GCGGCGGTCT ACGCGGCCTC GGACACAGCG
GAGGACTTCG CCCGCACGCT CGCCGGCCTG CTGGACGACC CGGAAGGTTG CCGCGTCCTC
GGAGCTCGCG GCAGGGAGCG GATCGTCAAC GAGCTGTCCT GGCAGCATTC CGCGCGCAGA
TATGTGGAGA TCTACGATCA CCTCCTCGGT GCCGGCGCCC GGCCCGTCAT TCCGGTACCC
CGACAGAGCG AAGCGCCGGT GGGGCAGGAC CGGAACGATC AGCGGGCCGT CCGGTGA
 
Protein sequence
MRKKRDISTA LIVRRNREEM PTSAAVGTGR SGPIDMGRRR RVLIVVQNLP VRIDRRVWQE 
CVALIACGYQ VSVICPRGDG ERRHQIIEGV SVWTYRAAPA ASGVLSYIFE FVYCWFRTFI
LTLAVVRKEG FDVIQACNPP DTYWLLAVLY KPFGRKFVFD HHDLCPELYR SRFDRDSPIL
LRALLLLERA NQAMADHVIV TNDSYRQLAM TRGRKRPDRV TVVRSGPDPD LMKPASQRPE
LRRGRRHLAC YLGVMGPQDG VDQLLDAIEH YVHGLRRTDC FFALLGFGDC LDELRVRSSR
LALDDWVEFT GLADDVMIRD YLSTAAVGLS PDPRSPLNEI STMNKTLEYM AYGLPVVAYD
LVETRVSAAD AAVYAASDTA EDFARTLAGL LDDPEGCRVL GARGRERIVN ELSWQHSARR
YVEIYDHLLG AGARPVIPVP RQSEAPVGQD RNDQRAVR