Gene Franean1_5878 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_5878 
Symbol 
ID5674201 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp7135435 
End bp7136565 
Gene Length1131 bp 
Protein Length376 aa 
Translation table11 
GC content73% 
IMG OID641244728 
Productglycosyl transferase group 1 
Protein accessionYP_001510130 
Protein GI158317622 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0821211 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.729107 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGGACCCC GGGTGCTCGT TGATGCGACC TCGGTTCCGG CCGACCGCGG TGGTGTCGGG 
CGGTATGTCG ACGGGCTCGT CGCTGCTCTG GGCGCGGCCG GCGCCGACAT GGCGCTGGTG
TGCCAGCGAT CGGACGAGGA ACGTTACAGC CGGATGGCGC CGCGGGCGAC CGTCCTGTCG
GGCCCGGCGG CCATCGCGCA CCGGCCGGCT CGGCTGGCGT GGGAGCAGAC AGGTCTTCCG
CTCGTCGCCG AACAGGTCAA TGCGGACGTC ATCCACTCGC CGCACTACAC GATGCCACTG
CGCGCGCAGC GGCCGGTATG CGTGACGATC CATGACGTCA CCTTCTTCAC CGAGCCGGAG
ATGCACACGG CGGTGAAGGG CACGTTCTTC CGGTCGGCGA TGCGGACGGC GGTGCGCCGG
GCGAGCCGCA TCATCGTCCC GTCGAAGGCC ACGCGCGACG AGCTCGTCCG CGTCCTCGAG
GGCGAGTCGA CGACGACCGA CGTCGCCTAT CACGGGGTGG ACACGACCAC GTTCCACCCG
CCGACGGAGG AGGACCGGCG CCGGGTGCGG CTGCGCCTCG GCCTCGGTGA CACCCGTTAC
GTGGCCTTCC TCGGAATGCT CGAGCCGCGC AAGAACGTCC CGAACCTGAT TCGCGGCTGG
GCGGAGGCGG TGCACTGGCG GGACGAGCCC CCGGCGCTCG TGCTGGCCGG TGGTTCCGGC
TGGGATGACG ACGTCGACGC GGCCGTCGCC TCGGTGCCGA GCCATCTGCG GGTGATCCGG
CCCGGCTACC TGCGCTTCTC CGACCTCCCG GGCTACCTGG GCGGTTCGGA GCTGGTCGCC
TATCCGTCGC ACGGTGAGGG CTTCGGCCTA CCGGTGCTGG AGGCGATGGC CTGCGGCGCC
CCCGTGCTGA CGACCCCGCG CCTCTCGCTG CCCGAGGTGG GCGGCGACGC GGTCGCCTAC
ACCCAGCCCG ACCCGGACTC GATCGCCCGC GAGATGAGCG CGCTGCTCGA CGACGCCGAG
CGTCGCGCCC AGCTCGCCGC GGCCGGGCTC GCCCGGTCCC ACGAGTTCAC CTGGGCGGCC
TCCGCGGAGG CCCACCTGGC GAGCTACGCC CGCGCGGTGG CCGACGCCTG A
 
Protein sequence
MGPRVLVDAT SVPADRGGVG RYVDGLVAAL GAAGADMALV CQRSDEERYS RMAPRATVLS 
GPAAIAHRPA RLAWEQTGLP LVAEQVNADV IHSPHYTMPL RAQRPVCVTI HDVTFFTEPE
MHTAVKGTFF RSAMRTAVRR ASRIIVPSKA TRDELVRVLE GESTTTDVAY HGVDTTTFHP
PTEEDRRRVR LRLGLGDTRY VAFLGMLEPR KNVPNLIRGW AEAVHWRDEP PALVLAGGSG
WDDDVDAAVA SVPSHLRVIR PGYLRFSDLP GYLGGSELVA YPSHGEGFGL PVLEAMACGA
PVLTTPRLSL PEVGGDAVAY TQPDPDSIAR EMSALLDDAE RRAQLAAAGL ARSHEFTWAA
SAEAHLASYA RAVADA