Gene Franean1_2166 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_2166 
Symbol 
ID5670566 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp2600408 
End bp2601814 
Gene Length1407 bp 
Protein Length468 aa 
Translation table11 
GC content72% 
IMG OID641241087 
Productglycosyl transferase group 1 
Protein accessionYP_001506508 
Protein GI158314000 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.485753 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAAGCAGG AACCCGCACC GCGAGGTGTG GGAATCCCCG GTCTTCAGAC CGGGGAGGAG 
GTCAACACGG AGTCTCGGAT CCGTCGGAGT GCCGACGCGG CGGCGTGCGC CCGTTCCCCG
GCGGACAAGA GCGCGGGACC CACTGCGATC GACAGACCCG CCGCGGTCGA CGGGACCACC
GGTCACGGCG AGCGGCGCGG CTCGGGAAAC GACCGCGAGC GCACGTCGGT CGGGAGGCGG
GTGGACCCCG CGCCCGCCCG CCCTGACCGC GACAGTCCTG TGACGGTGCT GGAGGTGCTC
CCGAGGATGG ACCGGGCGGG CGAGGTGATC CGCGCGGTCA ACCTCCTGCG GCGCCTGGAC
CCGCAGGAGT ACCGGCTGCT GTTCTGCGTC ACCTCGGGTG CCCCCGGATC GCTGGACGAC
GAGATCCGGG CACTGGGTGG CGAGGTCTAT TACTGCCGCG CCGACCTGAG GTTCCCGCTC
GCCTTCTACC GGCTGCTGCG TTCGGTCCGG CCGGACATCG TCCACTCGGG TGTGGCGACC
TTCTCGGGCG TGGTGCTCGC GGTGGCCCGG GTGGCCGGTG TGTCGCGCCG CGTCGCGCAC
TTCTTCAGCA GCGCGGACCA GAGCGGCGAC AGCCTCCGCG GCCGCCTCCA GCGGATGGTG
GGGCGGGTGT TGCTGGACGC GTTCGCCACC GACCTGCTCG CGGTCAGCGA GGCGGCGATG
CGCGGACGGT GGCGGGAGAC CTGGCGGCTC GACCCCCGGT GCCGGGTCAT CTACAACGGG
GTCGAGCTCG AGCCCTTCGG AGTGGCCATC GCGGGCCAGC GGCCCATGCC GGACCTCCCC
GAACTCGACG AGTTCGGGGA GGCCATGGCA CCGCAGCTGA CCGTCCTGCA CGTCGCCCGC
CCGGACCCGG TCAAGAACCG GGCCCGGGCC ATCGAGATCG TCGCGGCGAT GTGCGCGCGG
GGGCTCGACG TCCGCCTGCG GATCGTCGGG CGCCAGACCG AGGAGGAGAC CGAGCGGCTG
ATGACCCTGG CCCGGGGTCT GGGTGTGTCC GACCGGGTCG AGTTCATCGG CGAGCGGCTC
GACATCCCGA AGCTGTTGGT GACCTCGTCG CTGCTGCTGG TGACTTCGCT GCGCGAGGGG
CTGCCGAGTG TGGTGCTCGA GGCCTGCGCG GTCGGGACCC CGGTGCTGTC GTCCGACCTG
CCGGGAGTGG GGGAGATCGC CCGGGTGCTG CCCGGGATCA CCATGCTGCC GCTGGGCACC
CCCAACGAGA TCTGGGCCAA CACCGCGGCT GATCTGGCGG TCGTCCCGCC CACGATGGAC
GAGCGCCGTG AGGCGATGCG GCGGCTGCGG CGGTCCCCGT TCACGATGGA GAACTGGCAG
CGCGACATCA CGGCCGTCTG GTCGTAG
 
Protein sequence
MKQEPAPRGV GIPGLQTGEE VNTESRIRRS ADAAACARSP ADKSAGPTAI DRPAAVDGTT 
GHGERRGSGN DRERTSVGRR VDPAPARPDR DSPVTVLEVL PRMDRAGEVI RAVNLLRRLD
PQEYRLLFCV TSGAPGSLDD EIRALGGEVY YCRADLRFPL AFYRLLRSVR PDIVHSGVAT
FSGVVLAVAR VAGVSRRVAH FFSSADQSGD SLRGRLQRMV GRVLLDAFAT DLLAVSEAAM
RGRWRETWRL DPRCRVIYNG VELEPFGVAI AGQRPMPDLP ELDEFGEAMA PQLTVLHVAR
PDPVKNRARA IEIVAAMCAR GLDVRLRIVG RQTEEETERL MTLARGLGVS DRVEFIGERL
DIPKLLVTSS LLLVTSLREG LPSVVLEACA VGTPVLSSDL PGVGEIARVL PGITMLPLGT
PNEIWANTAA DLAVVPPTMD ERREAMRRLR RSPFTMENWQ RDITAVWS