Gene Franean1_0058 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_0058 
Symbol 
ID5668484 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp72866 
End bp74167 
Gene Length1302 bp 
Protein Length433 aa 
Translation table11 
GC content70% 
IMG OID641238987 
Productprotein-L-isoaspartate(D-aspartate) O-methyltransferase 
Protein accessionYP_001504432 
Protein GI158311924 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG2518] Protein-L-isoaspartate carboxylmethyltransferase 
TIGRFAM ID[TIGR00080] protein-L-isoaspartate(D-aspartate) O-methyltransferase 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTCCGTC TGTGGCGCCG AGAGACTGGG GCAAGACCGC CGGAAGGATC TTCGTTGATT 
CAGATGATTG GCACGTCCAG CGCCCCGGAC CGGGCGACTG AGGTCCGCAA CGCGCTGGTC
GACAAGCTCT GCGTGACCGG CATGATCACC TCGCTGGAGG TGGAGCGGGC GTTTCGTGCC
GTGCCGCGGC ACCTGTTCGT CCCTGAGGGC ACCTCGCTGG AGGTTGCCTA CAACGCCGAT
GACTCGGTGG CGGTCAAGCG GGCCGCGGAT GGCGTGATCA TCTCGTCGAT CAGCGCGCCG
TTCATTCAGG CTCGGATGAT CGAGCAGGCC GGGCTCGGAC CCGGGATGAG CGTGGTCGAG
ATCGGGTCCA GCGGCTACAA CGCCGCGCTC CTCGCTGAGA TCGTCGGCCC CTCGGGCCGG
GTGGTCAGCG TGGACATCGA TCCTGAGGTG ACCGATCGAG CACGCGCGCT TCTGGAAGCG
ACCGGGTACG CGGACCGGGT CACGGTCGTT CGCGCGGACG CGCAGGACGG TGTGGCCGAT
CACGGTGACC GGGTCGACGC GATCCTGGTG ACGGCCGGCG CCTGGGATCT CTCGCCGGCG
TGGCTCGCGC AGCTGGCCGA GGACGGCCGG ATCGTCGTGC CACTGCGAAT GAACGGGATC
ACCCGGTCGA TCGGGTTCCG CCGCGACGGC GATCATCTGG TGAGCACCTC GGCCGAGGTG
TGCGGGTTCG TCCCCATGCA GGGCGCCGGT GCCCACGACG AGCGGGTCTT CCTTCTGCCG
GATGGGCACG GCCGTCATGT CAGGCTGCGT TTCGACGCCA ACCCGCCCCA GGACCTGGAT
CTTCTGGACG GTGTCCTGGC GACACCTCGG TCCGAGGTGT GGTCCGGGGT CACGATCCGT
AACGGTGTGT CGTTCGCCGA CCTGCACCTG TGGTTCGCGT GCTTCCTGTC TGGCTTCTGC
CGGCTGGCGG CGGACGAGGG AACCGATCTG GCTGCTGAAC GCAAGAGCTG GTTCCCCTTC
GGGGCTGTGC AGGGCGACTC GTTCGCCTAT CTGGCGGTGC GCCCGGCGCT GGACGGCGCC
GGTGTCGAGT TCGGCGGGCG AGCGTACGGC CCGCACGGCG AGGCCGCTGC TACCGCGCTC
GTCGAGCAGA TCCAGGCCTG GGATCGCCAG GCACGGGGCG GACCGGCCCC GACCTTCGCC
TACTGGCCGA CGGGCGCCGA CCGTCCGCCG GCCGGCGAGG GCACGGCCGT GCTGGCGAAG
ACCCACGGCC TGGTCTCGAT CTCCTGGCCG GTCGCTGGCT GA
 
Protein sequence
MFRLWRRETG ARPPEGSSLI QMIGTSSAPD RATEVRNALV DKLCVTGMIT SLEVERAFRA 
VPRHLFVPEG TSLEVAYNAD DSVAVKRAAD GVIISSISAP FIQARMIEQA GLGPGMSVVE
IGSSGYNAAL LAEIVGPSGR VVSVDIDPEV TDRARALLEA TGYADRVTVV RADAQDGVAD
HGDRVDAILV TAGAWDLSPA WLAQLAEDGR IVVPLRMNGI TRSIGFRRDG DHLVSTSAEV
CGFVPMQGAG AHDERVFLLP DGHGRHVRLR FDANPPQDLD LLDGVLATPR SEVWSGVTIR
NGVSFADLHL WFACFLSGFC RLAADEGTDL AAERKSWFPF GAVQGDSFAY LAVRPALDGA
GVEFGGRAYG PHGEAAATAL VEQIQAWDRQ ARGGPAPTFA YWPTGADRPP AGEGTAVLAK
THGLVSISWP VAG