Gene Franean1_0851 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_0851 
Symbol 
ID5669267 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp997386 
End bp998636 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content77% 
IMG OID641239780 
Productglycosyl transferase group 1 
Protein accessionYP_001505215 
Protein GI158312707 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0874 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.426804 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGAGGAAC AGCCGGCCGC ACGGGTCCTC ATCGTCGAGC AGGGTGAGGG GCTGTGGGGC 
GCCCAGCGCT TCCTGCTGAG GCTCGCCCCG CTGCTGGAGC GACGCGGGAT CGAGCAGATT
CTCGCCGCCC CGGAGGACAG CGCGACGGGC GCGGCCTGGC GGGCGTCCGG GCGGCACCAC
GCCGTCCTGC CAGTGCCGGC GGACCGGAGG TTGCGCCGCC CGGACGGCCG CCCGAGCCCC
GCGCTGGTAC TGCGCGAATC CGGCCGGACG GCGGTCATGG CGGCCCGGAC GGCCCGGCTC
GCCCGGCGGT TCGGCGTCGA CGTCCTGCAG GCGAACAGCC GCTGGTCGCA TCTGGAGGCC
GTCGGGGCCT CGGCGCTGTG CCGGCGGCCC GCGTTGCTGC TGCTGCACGA GGAGAACGAG
CCGGACCTGG TCGGCCGGCT GCGCGGGCTG GCCGTCCGGG GGGCCGCGCG GTCCGTGGCG
GTGAGCGGCG CGGTCGCGGC GTCGCTGCCG GGGTGGGCCG CCCGACGCGC GGTGGTGATC
CGCAACGGGG TCGACACCGA CGCGCTGCGC CCCGGCCCGG CGGACCCGGC CGTGCGGGCC
AGCCTGTCGA CGGACCCGGC GGCGCCGCTG GTCCTCGCGA TGTCCCGCCT GGACCCCCGC
AAAGGCGTCG ACAAGGTGAT CCGTGCGGTG GCCGCGCTGC CGGACCACCT GAAGTCCACG
CGGCTGGCGG TCGCGGGCGC GCCCAGCCTC GACCCGGCGT CCGGGGAGTC GCTGCGCCGG
CTCGGCGCCG AACTGCTCGG TGACCGGGTG CTGTTCCTGG GGCCGCGCTC GGACATCGGC
GACCTGTTGC GCGCCACCGA TGTCCTGGTC CTCGCGTCGA GCCTGGAGGG GCTGCCGCTG
AACGTGCTGG AGGCGCAGGC GTGCGGGCGG CCGGTGGTGG CGTTCCCGAC CGCGGGCATC
CCGGAGATCG TGACCGACGG AGCGACCGGC CTGATCGCCC GCCAGGACGA CGTGGCCGAC
CTCAGCGCGA AGCTCGCCCG GGTGCTCGAC GACCAGACGC TGGCCGCTCT GCTCGGCGCC
CGCGCGCGGG CGAGCGTCGT CGCCCACCAC ACACTGGACG CGCAGGCGGA CGCGCTGGGC
GGCCTGCTGA TCAGCCTCGC CGGGCAGGCC CGCGCGCGGA GGCACCGGTC AGCCGGCCAC
GACACCGCAC ACCATGAGGT GCATCACACC ACCGCTGGAC GCAGGTCGTA G
 
Protein sequence
MEEQPAARVL IVEQGEGLWG AQRFLLRLAP LLERRGIEQI LAAPEDSATG AAWRASGRHH 
AVLPVPADRR LRRPDGRPSP ALVLRESGRT AVMAARTARL ARRFGVDVLQ ANSRWSHLEA
VGASALCRRP ALLLLHEENE PDLVGRLRGL AVRGAARSVA VSGAVAASLP GWAARRAVVI
RNGVDTDALR PGPADPAVRA SLSTDPAAPL VLAMSRLDPR KGVDKVIRAV AALPDHLKST
RLAVAGAPSL DPASGESLRR LGAELLGDRV LFLGPRSDIG DLLRATDVLV LASSLEGLPL
NVLEAQACGR PVVAFPTAGI PEIVTDGATG LIARQDDVAD LSAKLARVLD DQTLAALLGA
RARASVVAHH TLDAQADALG GLLISLAGQA RARRHRSAGH DTAHHEVHHT TAGRRS