Gene Franean1_1051 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_1051 
Symbol 
ID5669465 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp1233736 
End bp1235454 
Gene Length1719 bp 
Protein Length572 aa 
Translation table11 
GC content67% 
IMG OID641239980 
Producttrehalose synthase 
Protein accessionYP_001505413 
Protein GI158312905 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0366] Glycosidases 
TIGRFAM ID[TIGR02456] trehalose synthase 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.828904 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.512672 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAACGAGT CGGAACCGTT CGCGGAGCAG CCGCTGCGCG AGACCGCGGC CGAGGGCGGC 
GACACGGGGA CGTCCCAGGA CCCGCTGTGG TTCAAGCGGG CCGTGTTCTA CGAGGTCCTC
GTCCGCGGCT TCGCCGACTC CAACGGCGAC GGCACCGGTG ACCTGGCCGG GCTGGTCTCG
AAACTGGACT ACCTCGAGTG GCTCGGCGTG GACTGCCTGT GGCTGCTGCC CATCTACTCC
TCGCCGCTGC GCGACGGCGG GTACGACATC AGCGACTACT TCCAGATCCT CCCCGAGTTC
GGGGACCTGG GCGACTTCAT CAACCTCGTC GACGAGGCGC ACCGGCGCGG GCTACGGATC
ATCGCCGACC TGGTGATGAA CCACACCTCC GACGAGCACC CGTGGTTCCA GGCGTCGCGC
TCGGACCCGG ACGGGCCGTA CGGCGACTTC TACGTCTGGT CCGACACCGA CGAGAAGTAC
CCCGACGCCC GGATCATCTT CGTCGACACC GAGAAGTCGA ACTGGACCTG GGACCCGGTG
CGCGGGCAGT ACTACTGGCA CCGGTTCTTC TCCCACCAGC CCGACCTCAA CTACGACAAC
CCGGACGTCC AGGAGGCGAT GCTGGAGGTC CTGCGCTTCT GGCTCGACCT CGGCCTCGAC
GGGTTCCGCC TGGACGCCGT TCCCTACCTG TACGTGCGCG AGGGCACGAA CGGGGAGAAC
CTGCCGGAGA CGCACGAGTA CCTGCGCCGG GTCCGCAAGG AGATCGACGC CAAGTACGCC
GACAGGGTCA TGCTGGCCGA GGCGAACCAG TGGCCCTCGG ACGTCGTCCA GTACTTCGGC
AATGACGACG AGTGCCACAT GGCCTTCCAC TTCCCGCTGA TGCCGCGCAT CTTCATGGCG
GTGCGGCGGG AGTCGCGCTA CCCGATCTCG GAGATCCTGG CGCAGACCCC GGAGATCCCG
CCGAACTGCC AGTGGGGCAT CTTCCTGCGC AACCACGACG AGCTGACCCT GGAGATGGTC
ACCGACGAGG AGCGGGACTA CATGTACGCC GAGTACGCGA AGGACCCGCG TATGAAGGCG
AACATCGGGA TCCGCCGACG CCTCGCCCCG CTGCTGGACA ACAGCCGCGA CCAGATGGAG
CTGTTTACCG CCCTGCTGCT CTCCCTGCCC GGCAGTCCCG TGCTCTACTA CGGCGACGAG
ATCGGTATGG GCGACAACAT CTATCTCGGT GACCGCGACG GCGTGCGCAC CCCGATGCAG
TGGTCCCCGG ACCGCAACGC CGGGTTCTCG ACAACCGACC CGGCCCGGCT GTACCTGCCG
GTGATCATGG ACCCGGTGTA CGGCTACCAG GCGCTGAACG TCGAGGCCGA GCAGCGGATG
CCGACGTCGT TCCTGTCTTG GACCAGGCGG ATGATCGAGG TCCGCAAGCG GCATCCCGTC
TTCGGGCTCG GCACCTACGA GGAGCTCGGC GCGTCGAATC CGTCGGTCTT CGCGTATGTC
CGGGAGTTCG GTGACGACAG GGTGCTCTGC GTCGCGAACC TCTCCCGGTT CGCCCAGCCC
GTCGAGCTTG ACCTGCGGAG ATTCGCCGGC CTGGTGCCGG TGGAGCTGCT CGGCCGGGTC
CATTTCCCAC CGGTCGGCGA GCTTCCGTAC CTGCTGACAC TGCCCGGTCA CGGACACTAC
TGGTTCGCTC TGTCCAATCC GGGGGAATTC ACTCAGTAG
 
Protein sequence
MNESEPFAEQ PLRETAAEGG DTGTSQDPLW FKRAVFYEVL VRGFADSNGD GTGDLAGLVS 
KLDYLEWLGV DCLWLLPIYS SPLRDGGYDI SDYFQILPEF GDLGDFINLV DEAHRRGLRI
IADLVMNHTS DEHPWFQASR SDPDGPYGDF YVWSDTDEKY PDARIIFVDT EKSNWTWDPV
RGQYYWHRFF SHQPDLNYDN PDVQEAMLEV LRFWLDLGLD GFRLDAVPYL YVREGTNGEN
LPETHEYLRR VRKEIDAKYA DRVMLAEANQ WPSDVVQYFG NDDECHMAFH FPLMPRIFMA
VRRESRYPIS EILAQTPEIP PNCQWGIFLR NHDELTLEMV TDEERDYMYA EYAKDPRMKA
NIGIRRRLAP LLDNSRDQME LFTALLLSLP GSPVLYYGDE IGMGDNIYLG DRDGVRTPMQ
WSPDRNAGFS TTDPARLYLP VIMDPVYGYQ ALNVEAEQRM PTSFLSWTRR MIEVRKRHPV
FGLGTYEELG ASNPSVFAYV REFGDDRVLC VANLSRFAQP VELDLRRFAG LVPVELLGRV
HFPPVGELPY LLTLPGHGHY WFALSNPGEF TQ