Gene Franean1_2149 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_2149 
Symbol 
ID5670549 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp2578176 
End bp2579357 
Gene Length1182 bp 
Protein Length393 aa 
Translation table11 
GC content61% 
IMG OID641241070 
Productglycosyl transferase group 1 
Protein accessionYP_001506491 
Protein GI158313983 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGCACT TGGGAAAGCT CTCCGCCGGT ATATCTGATC TCCGAGTCGG AGTGGTTGCC 
GAGTGCTTTC CCGTCTATCG CGCCGCAGTC CTCGCCGAGC TCCTGCGAAT TCCAGACATC
AAGTATTATT TCCTGGGTGG CACCGAACCA ATCCTGCCTG GGTACCGGAC CCACATCCCG
GGCCGTCCGG AGGACTTTAT TCGGCTCCGT ACGAGGAGGA TCGGCCCCAT CCGATGGCAG
CAGGGGTTGC TCCGAGAGAT CGTGGCGAGA CGGTTCGACG TACTCATCAT TACTGGCGAC
TGGGCGTTCA TCTCGACCTG GCTCGGTGCG ATAGTTGCTC GTCTGCTGGG ACTGCCGGTG
CTGTTCTGGA CCCATGGTTG GGCTCGTCCT GAGCGAGGGC TCCGGCTTCT CGTCCGGCGC
TGCTTCTATC GGTGCGCAAC CGGCCTTCTG CTGTACAGCG AGTACGGCCG TCGCTTGGCG
GAGTCCTACG GGCTTCCCGC CAATCGGCTA TTCGTCGTGC ATAACAGCCT GGACCTACCA
GCCCAAGATG CTGCCGCACA GTCTATCGAG CCCTCGTCGG TCAAAGCAGT GCTCGAAAGG
TTTCCGGACC CAAGTCTTCC ACTTGTTGTT TCGAGCTTTC GGCTTGTCTC TGATCGAGCA
GTGGACGAAT GTATCTCTGC TGTTGCGTGG CTGGGCCGCA CCGGGTTCCC GGTCAACTAC
CTTATCGTCG GAGATGGTCC AGATCTTCCC CGGCTGCAGG CTGTGGCGGT CGAGTCAGGG
GCTGCAGTTT CCTTCTTCGG ACCCTGCTAT GACGAAGCTA CGCTGGCCAA GGTTTATGCC
GCTGCGGACG TGTCGGTAGC GCCCCGAATG GTTGGGCTGT CCGCTCTCCA GAGTCTGGCG
TATGGGACCA TGATGGTGAC CTGTGATGAC ATCACCCTAC AGACTCCTGA ATGGGAGGTC
CTGGAGGACG GTGTCACTGC GGTGCTCTAT ACCGCCGGCG ATGTATCGGC ACTGGCGAGG
GCGATGCGAA AAGTGATCGC TCTGTCGCGG TCCGGCGAGA TTGACGAGAA TCGGCTACGG
AAACGCCTGG CTGAGTCATA CAACCCGGCC GAACATGCAC GTCGAATAAA CGCGGCTGTA
CTGGCTGCGG CCCAGGGTCG GGCTGAGGCC GGTGGTAGCT GA
 
Protein sequence
MPHLGKLSAG ISDLRVGVVA ECFPVYRAAV LAELLRIPDI KYYFLGGTEP ILPGYRTHIP 
GRPEDFIRLR TRRIGPIRWQ QGLLREIVAR RFDVLIITGD WAFISTWLGA IVARLLGLPV
LFWTHGWARP ERGLRLLVRR CFYRCATGLL LYSEYGRRLA ESYGLPANRL FVVHNSLDLP
AQDAAAQSIE PSSVKAVLER FPDPSLPLVV SSFRLVSDRA VDECISAVAW LGRTGFPVNY
LIVGDGPDLP RLQAVAVESG AAVSFFGPCY DEATLAKVYA AADVSVAPRM VGLSALQSLA
YGTMMVTCDD ITLQTPEWEV LEDGVTAVLY TAGDVSALAR AMRKVIALSR SGEIDENRLR
KRLAESYNPA EHARRINAAV LAAAQGRAEA GGS