Gene Franean1_5165 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_5165 
Symbol 
ID5673499 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp6192778 
End bp6195372 
Gene Length2595 bp 
Protein Length864 aa 
Translation table11 
GC content73% 
IMG OID641244019 
Productmalto-oligosyltrehalose synthase 
Protein accessionYP_001509429 
Protein GI158316921 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3280] Maltooligosyl trehalose synthase 
TIGRFAM ID[TIGR02401] malto-oligosyltrehalose synthase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.501183 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCAACTG ACCGGCCCGG CGGACCCGAC CGGCACGGGC CCGCCGATCG GCACTCCGCG 
TCCGACCGGC ACTCAGCGCC CGACCGCCGC CGTGAGGGTG CCGGGGCACG GGAGAACCAG
CGCGGGGAGG GCGAGGGCCG CGTCGTCCCG ACCGGTACCT ACCGGCTGCA ACTGCACATG
GAGTTCTCGT TCACCGACGC CGCGGTGATC ATTCCTTATC TCGCCGGGCT CGGGGTCTCG
CACCTGTACC TGTCACCGGT CCTGGAGGCC GCGCCGGGGT CGGCCCACGG CTACGACGTG
GTCGACCACA GCCAGATCAG CCCCGAGCTG GGCGGGCTCG GCGGGCTGCG CCGGCTGGTC
GCCGCCGCGC GCCGCGCGGG GCTCGGCATC ATCGCGGACG TCGTGCCCAA CCACATGGCG
GTGCTCACGC CCGGCACCAC GAACACGGCG TGGTGGTCGG TGCTGCGCGA GGGCCCGGAC
TCCCCCTACG CCTCGTGGTT CGACATCGAC TGGGACTCCC CCGACAACCC CGGCCGGGTC
CTGCTGCCGC TGCTCGGCCA GTCGCTGGCG GACAGCCTCG CCGCCGACGA GATCGCCGTC
GAACAGGACG AGGACGGCGA GTGGATCGTC GTCTACTACG ACCACGTGCT GCCGGTCGCC
GCCGGCACCG CCGACCCCTC CGACGTCGCC GCCACCCTGG ACGCCCAGTA CTACCGCCTG
TGCTGGTGGC GGGTCGGCGG GACGGAGCTG AACTACCGGC GCTTCTTCGA CATCCCCTCG
CTGGCCGGCC TGCGCCAGGA GGAGCCGGAC GTCTTCGCCG CCACCCACCG GCTGCTGATC
GAGCAGGTGC GCTCGGGCAA CCTGGAGGGC CTGCGGATCG ACCACCCCGA CGGCCTGGCC
GACCCCGAGG ACTACCTGCG CCAGCTCGCC GAGGCCACCG GCGGGGTCTG GACGGTCGTC
GAGAAGATCG TCGAGCGGGA CGAGGCTCTG CCCGAGGGCT GGGCCTGCGA CGGGACCACC
GGGTACGAGA CACTCAACCG GCTCACCAGG CTGTTCCTCG ACCCGGTGTC CGCGCGTGCG
ATGTCCACGC TGTACACGGA GATCACCGGG AGGTCACCGG ACTTCGCGCC GGTGGCGCGC
GAGGCGAAGA TGGACGTGCT GACGTCCGTC CTGCGGCCGG AGGTCGACCG GCTCACCGGG
CTCGCCCTGG CCGAGGCCCG CCGCGACCGC GCCGACCTCA CCCGGGCCGG CCTGCGCGAG
GCGCTGCGCG AGGTGCTCGC CGCGTTCGAG GTGTACCGCG CCTACATCCG CCCCGACGGC
ACGCCCAGCC TGGAGGCGCG CGGCCACATC GTCCGGGCCT GTGAGCAGGC CCGGCTGGCG
CTGCCGGGCC GGGCCAGCGA GATCGACCTC ATCGAGGACC TCGCGCTCGG CGGGCCGAGC
GAGTTCGTGG TGCGCTTCCA GCAGACCTGC GGCCCGGTGA TGGCCAAGGG GATCGAGGAC
ACCGCCTTCT ACCGGTACGT CCCGATGGTC GCGCTCAACG AGGTGGGCGG GGCGCCCGGG
GATCTCCCGG CCTACCCGAC GGGGCATCCG CGCAGCGCCG TCAACGAGTT CCACGAGGCG
AACATCACCA CGCAGCGGAC GTGGCCGCTG ACGATGACCA CGCTCTCCAC GCACGACACC
AAGCGGTCCG AGGACGTCCG CGCCCGGCTC GCCGTGCTCT CGGAGGACCC CCGGGGCTGG
GCCGACATCG TGCGGCGGCT GACCCGCCTC GGCGAGCGGC ACGCCGACCC GGAGGCCGGC
TGGCCCGATC CGGTGACGAT GTACCTGCTG GTACAGACGC TCGTCGGCGC CTGGCCGATC
TCGGCCGACC GGGTCACCCA GTACATGGTC AAGGCGGTGC GGGAGGCCAA GCTGTTCACG
ACGTGGACGG ACACCGACCC GTCGTACGAG GCCGCGCTGA CGAACTACGT CGAGGCCGCC
CTCGAGGACG AGGAGTTCGT CGCCGCGCTG GACACCTACG TCTCCACCCT GGTCGAGCTC
GGCCGGCAGA ACAGCCTGGC GCAGAAGCTG CTCCAGCTCA CGATGCCCGG CATCCCGGAC
GTCTACCAGG GCCAGGAGCT GTGGGACCAC TCGTTGGTGG ACCCGGACAA CCGCCGTCCG
GTCAACTTCG GCGAGCGGAC GAAGATGCTC ACCGAGCTGG GCGTCGAACC GGGCTCGGAG
CTAGCGGCGC CGCGGCGCCC GCCCGTGCTC GACGACTCCG GGGCGGCGAA GCTGCTGGTG
GTGGCGCGCT CGCTGCGCAT CCGCCGGGAC CACCCCGAGT GGTTCGGGGA GGAGGGGACG
TACCGGCCGC TGTGGGCGTC GGGGTCCGCG GCGGAGCACG TGGTCGCGTT CAGCCGCTCC
GAGTCGGTGG TGACCGTGGT CCCCCGCCTC GTGCTCGGGC TGCGCCGCGG GGGCGGGTGG
CGCGACACCA CCGTGACCCT GCCCGAGGGC CGCTGGGCGG ACGTCCTCAC CGGCCGGCGG
CACGACGGTG GCACGGCCTA CGTGCTGCGC CTGCTGCGCG ACTTCCCGGT CAGCCTGCTG
ATCCGCACGC CCTGA
 
Protein sequence
MSTDRPGGPD RHGPADRHSA SDRHSAPDRR REGAGARENQ RGEGEGRVVP TGTYRLQLHM 
EFSFTDAAVI IPYLAGLGVS HLYLSPVLEA APGSAHGYDV VDHSQISPEL GGLGGLRRLV
AAARRAGLGI IADVVPNHMA VLTPGTTNTA WWSVLREGPD SPYASWFDID WDSPDNPGRV
LLPLLGQSLA DSLAADEIAV EQDEDGEWIV VYYDHVLPVA AGTADPSDVA ATLDAQYYRL
CWWRVGGTEL NYRRFFDIPS LAGLRQEEPD VFAATHRLLI EQVRSGNLEG LRIDHPDGLA
DPEDYLRQLA EATGGVWTVV EKIVERDEAL PEGWACDGTT GYETLNRLTR LFLDPVSARA
MSTLYTEITG RSPDFAPVAR EAKMDVLTSV LRPEVDRLTG LALAEARRDR ADLTRAGLRE
ALREVLAAFE VYRAYIRPDG TPSLEARGHI VRACEQARLA LPGRASEIDL IEDLALGGPS
EFVVRFQQTC GPVMAKGIED TAFYRYVPMV ALNEVGGAPG DLPAYPTGHP RSAVNEFHEA
NITTQRTWPL TMTTLSTHDT KRSEDVRARL AVLSEDPRGW ADIVRRLTRL GERHADPEAG
WPDPVTMYLL VQTLVGAWPI SADRVTQYMV KAVREAKLFT TWTDTDPSYE AALTNYVEAA
LEDEEFVAAL DTYVSTLVEL GRQNSLAQKL LQLTMPGIPD VYQGQELWDH SLVDPDNRRP
VNFGERTKML TELGVEPGSE LAAPRRPPVL DDSGAAKLLV VARSLRIRRD HPEWFGEEGT
YRPLWASGSA AEHVVAFSRS ESVVTVVPRL VLGLRRGGGW RDTTVTLPEG RWADVLTGRR
HDGGTAYVLR LLRDFPVSLL IRTP