Gene Franean1_1050 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_1050 
Symbol 
ID5669464 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp1232081 
End bp1233469 
Gene Length1389 bp 
Protein Length462 aa 
Translation table11 
GC content74% 
IMG OID641239979 
Producthypothetical protein 
Protein accessionYP_001505412 
Protein GI158312904 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3281] Uncharacterized protein, probably involved in trehalose biosynthesis 
TIGRFAM ID[TIGR02457] trehalose synthase-fused probable maltokinase 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.729107 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGACC ACCACTCCGA GCTCACGGCC CTGCTCTCGG ACTGGCTGCC CAGGCAGCGC 
TGGTTCGCGG GCAAGGGCCG GCCGGGCGGG AGGCTGCGCG TCGGGCAGGA CGTCCGCCTC
AGCTTCGACG CGGCGGTGAA GGCGGCGATG CACCTGCTGG TGGTGGAGGT CCGCTACGAC
GACGGCGGCC TCTCCGACCA CTACCAGGTC CCGGTGGTGA TCCGGCCGGA CGCCCCCTTC
GGCCACGAGG GGTTCCTCAT CGGCGAGTCG TCGGTGGGCC TGGTCTACGA CGGCCTGCAC
GACTCCGACG GAAGCGCCGC CCTGCTGGAC TACCTGCGCC GGGGCGCGAG CCGCGAGGGG
CTGACCGCGA CGGCGGTCGA GTCGTTGGAC GACCTGCCCG CGCACGCGGT CGGCGCCGAG
CAGTCGAACA CCTCGATCGT CTACGGCGAC GCCTACATCC TGAAGGTCTT CCGCCGCCTG
TGGCCCGGGA CGAATCCCGA TCTGGAGATC ACCCGGGTTC TGGCGCGCGC CGGCAGCGAG
CACGTGGCCC GGCCGGTGGC CTGGCTGAGC GGGCAGCTCT CCGGCGTCCC GACGACCTTC
GCGTTCATGC AGGACTTCCT GCGCACCGGG GCCGAGGGCT GGCTGCTGGC CCTGGCCAGC
GTCCGCGACC TCTACGCCGA GGGCGACCTG CACGCCGACG AGGTGGGCGG CGACTTCGCC
GCCGAGGCCG AGCGGCTGGG CGCCGCGACC GCCCAGGTGC ACCGCGACCT GGCCGCCGCG
CTGCCGACCC GGCCCGCCGA CGCCGCCGCG CTCGGCGAGG TCGCCGACTA CCTGCACAGC
CGGCTCGACG CCGCGCTGGC GGCCGTGGCG GAGCTCGCGC CGTTCGAGGC CGCACTGCGG
ACCGCCTACG ACGAGGTCCG CCGCGCCGAC CACGCGGCGC CGTTCCAGCG CATCCACGGC
GACCTGCATC TCGGCCAGGT GCTGCGGGTG GAGTCGGGCT GGGTGCTGTT CGACTTCGAG
GGTGAGCCGG CGCGGCCGGT GCCCGAGCGG ACCCTGCTCG AATCCCCGCT GCGCGACATC
GCCGGCATGC TCCGGTCGTT CGACTACGCG GCCCAGTCGA TGCTGCTCGA GCGCTCCGAC
GAGCCGTCGC TGGCCTACCG GGCGCTGGAG TGGGCCGACC GCAACCGGGA CGCCTTCTGC
CGCGGCTACG GCGCGGTGTC CGGCGCGGAT CCCCGGGACG GCGGCGCCGT CCTGCGTGGT
CTCGAGCTCG ACAAGGCTGT GTACGAAGTG CTCTACGAGG CGCGCCACCG GCCGGGCTGG
ATCAGCATCC CGCTGCGTTC GGTCGAACGG TTGACCGGCG GGCGACCCAC TGAGCTCCCC
GCGCCCTGA
 
Protein sequence
MTDHHSELTA LLSDWLPRQR WFAGKGRPGG RLRVGQDVRL SFDAAVKAAM HLLVVEVRYD 
DGGLSDHYQV PVVIRPDAPF GHEGFLIGES SVGLVYDGLH DSDGSAALLD YLRRGASREG
LTATAVESLD DLPAHAVGAE QSNTSIVYGD AYILKVFRRL WPGTNPDLEI TRVLARAGSE
HVARPVAWLS GQLSGVPTTF AFMQDFLRTG AEGWLLALAS VRDLYAEGDL HADEVGGDFA
AEAERLGAAT AQVHRDLAAA LPTRPADAAA LGEVADYLHS RLDAALAAVA ELAPFEAALR
TAYDEVRRAD HAAPFQRIHG DLHLGQVLRV ESGWVLFDFE GEPARPVPER TLLESPLRDI
AGMLRSFDYA AQSMLLERSD EPSLAYRALE WADRNRDAFC RGYGAVSGAD PRDGGAVLRG
LELDKAVYEV LYEARHRPGW ISIPLRSVER LTGGRPTELP AP