Gene Franean1_0784 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_0784 
Symbol 
ID5669200 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp909433 
End bp910659 
Gene Length1227 bp 
Protein Length408 aa 
Translation table11 
GC content74% 
IMG OID641239712 
Producthypothetical protein 
Protein accessionYP_001505148 
Protein GI158312640 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.246481 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGACT CCGAGCGCTG GATCGTCGTC GTCGAGGAGC CCTTCCTCCC CGCGGATGCC 
GGCGGGCGGG TGGAGACCTT CAGCTTCCTC ACGGCGGCCT CGGCGGCCGG TATCCGGATG
CAGGTCCTGG TGCCGTCCCG CACCGACCTG GACATCGCGG CCTACGAGGA CGCCGTCCCC
GGCGCGGCGG TCATCCGCCT ACCCCGGGAC GACAGCCCGC TGGCGCACCT CTCGCCGCGG
CCGTTCACGC ACGCGTCCCG CCCGGTCGGC CCGCTGCGCC GGGCGCTGGA GAACACCCCG
CCGCGCGCCG ACTCCGTCAT CAGCTACAGC TGCCGGACGT CGCATCTCGG CGAGGAGATC
GCGCGGATCT GGCGGCTTCC CCACCTGGTG CGGGCGCACA ACATCGACTC GGAGTTCTTC
CGCGTCCTGG CCCGGAACTC CACGGGCCCC CGCGCGGTCG CCTACGAGCT CGAGTACCAC
CGCCTGCGGC TCGCCGAGCA GGCCATGCAC CACTCACCGC TGGTGAACGC CATCGCGGAC
ATCTCCGTGG AGGACCACGA GTGGCGCCGC GGGCGGGCGA GCGTCCCGAC GTTCCACCTG
CCGCCGTTCC TGCCCGCCAG CACGGTCGCC GAGGCCCGCG CGGCCGGCGG CGTCGCGGAC
ACGGAGCGGG CCGGCGAGCG CCTGGTCTTC GTCGGCTCGC TGGACACGCC CACCAACATC
GAGGCGCTGC GCTGGTTCCT GGGCGGCTGC TGGCCCACGA TCCGGGTGCG CCACCCCGCG
GCCGTCCTGC AGGTCGTCGG CCGCCGTCCG GAGGACGGCC TGGCCGAGTG GCTGGCCGGC
TTCGACAGCG TTGAGCTGCA CACCGACGTG CCGAGCGTGC TCGGCTACGT GGCCGGGGCG
ACCGTGTCGG TGAACCCGAT GCGCTCCGGA TCCGGGGTCA ACATCAAGGC GATCGAGGCG
ATGTCCGCCG GGACGCCTGT CGTCAGCACC CCGACCGGCA GCCGCGGCCT GGGCTGGCGC
CCGGGCGAGC ACCTGCTGGT CGCCGACGAT CCGGGCGCGT TCGCGGACGC CGTCTGCGGG
CTGCTGGACA ACCCCTGGCT CGCCGCCGAG GTCGGGACGG CCGGGCGCGA GTTCGTCCTG
CGCGAGCTCG ATCACGCGAC GCTCATCGAC CGGGTCCGGG GCATGCTGGC CGGGCGCACC
GAGGAGACCA CAGCTCAGAC CGCTTGA
 
Protein sequence
MSDSERWIVV VEEPFLPADA GGRVETFSFL TAASAAGIRM QVLVPSRTDL DIAAYEDAVP 
GAAVIRLPRD DSPLAHLSPR PFTHASRPVG PLRRALENTP PRADSVISYS CRTSHLGEEI
ARIWRLPHLV RAHNIDSEFF RVLARNSTGP RAVAYELEYH RLRLAEQAMH HSPLVNAIAD
ISVEDHEWRR GRASVPTFHL PPFLPASTVA EARAAGGVAD TERAGERLVF VGSLDTPTNI
EALRWFLGGC WPTIRVRHPA AVLQVVGRRP EDGLAEWLAG FDSVELHTDV PSVLGYVAGA
TVSVNPMRSG SGVNIKAIEA MSAGTPVVST PTGSRGLGWR PGEHLLVADD PGAFADAVCG
LLDNPWLAAE VGTAGREFVL RELDHATLID RVRGMLAGRT EETTAQTA