Gene Franean1_0542 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_0542 
Symbol 
ID5668959 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp628130 
End bp629374 
Gene Length1245 bp 
Protein Length414 aa 
Translation table11 
GC content73% 
IMG OID641239469 
Productglycosyl transferase group 1 
Protein accessionYP_001504907 
Protein GI158312399 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCCCTGC GTTATGGCTT CCTCAGCACC TATCCGCCCA CCCAGTGCGG TCTCGCGACC 
TTCACGGCCG CCCTGTTCGA CGAGCTGAAC AGTCCAGCGC CCGGTGCGTC CAGTGGGGTG
GTGCGCCTCC TGGACGCCAC CGACCGAGCC GGTGTCGTGC GAGCTGGTGC CACGCGAGCC
GGTGCCACGG CCGAGCCGGC CGAGCCGGTG GCCAGACCAC CGGGTCGATC CGGTGGGGCC
GGCCGGCCGG TTCTGGTCGG AGATCTCGTC GCCGGCACGC CGGGCGGTCC GCGTGCGGCG
GCACGGCTGC TGAACGGCTT TGACGTCGTC GTGGTGCAGC ACGAGTACGG CGTGTATGGA
GGTCCGGACG GCGACGAGGT GCTCGCGGTG CTCGACGCCC TCGACGTTCC GGTGATCGTC
GTCCTGCACA CCGTGCTGGT GAAGCCGACC TCGCACCAGC GGCATGTCCT GGAGTCCGTG
GTCGCGTCGG CGGACGCGGT GGTCGTGATG ACCGAGACCG CGCGGATCCG GCTCGTCGAG
GGTTTCCAGG TCCATCCACG ACGGGTGGTG GTCATCCCGC ACGGCGCGGC GGACAACCGC
CGGGCGCCAG CGGAGCACAG CGGCGGGCCG ACCATCCTCA CCTGGGGGCT GATCGGGCCC
GGGAAGGGCA TCGAGTGGGG CATCGCCGCG ATGGCGGACC TTGCCGACCT CGATCCCGCC
CCGCACTACG TCATCGCGGG CCAGACCCAT CCGAAGGTCC TCGCGAGGGA GGGCGAGGCC
TACCGGGAGG GGCTGGCCGC CCGGGTCCGC GACCTCGGCC TGACCGGCTC GGTCAGCTTC
GACGATCGTT ACCTGGACCC GGTGTCCCTC ACGGAGCTCG TGCGCCAGGC CGACGTCGTC
CTGTTGCCGT ACGACTCGGT CGACCAGGTG ACCTCCGGTG TCCTCATCGA GGCGGTCGCC
GCGCTCCGGC CGATCGTCGC CACCCGGTTC CCGCACGCGG TCGAGCTCCT CGGCGACGGC
AGCGGACTGC TCGTGCCGCA CCGGGACCCA GCGGCCATCG CCGCCGCGGT GCGCCGCATA
ACGACAGACG AGACGGTGAG CGCCGGCCTG GCCAGCGCCG CGGCCGTCCA GGCCCCCGAC
CTGCTGTGGC CGGCGGTCGC CGGCCGCTAC CGGCGGCTGG CCGCCGGACT GGTCGCCCGC
ACCGGGAGCC GGCCGTCGAC CGCGCCCGTG CCGGTGGCCC GGTGA
 
Protein sequence
MPLRYGFLST YPPTQCGLAT FTAALFDELN SPAPGASSGV VRLLDATDRA GVVRAGATRA 
GATAEPAEPV ARPPGRSGGA GRPVLVGDLV AGTPGGPRAA ARLLNGFDVV VVQHEYGVYG
GPDGDEVLAV LDALDVPVIV VLHTVLVKPT SHQRHVLESV VASADAVVVM TETARIRLVE
GFQVHPRRVV VIPHGAADNR RAPAEHSGGP TILTWGLIGP GKGIEWGIAA MADLADLDPA
PHYVIAGQTH PKVLAREGEA YREGLAARVR DLGLTGSVSF DDRYLDPVSL TELVRQADVV
LLPYDSVDQV TSGVLIEAVA ALRPIVATRF PHAVELLGDG SGLLVPHRDP AAIAAAVRRI
TTDETVSAGL ASAAAVQAPD LLWPAVAGRY RRLAAGLVAR TGSRPSTAPV PVAR