Gene Franean1_0032 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_0032 
Symbol 
ID5668458 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp40475 
End bp41677 
Gene Length1203 bp 
Protein Length400 aa 
Translation table11 
GC content69% 
IMG OID641238961 
Productprotein-L-isoaspartate(D-aspartate) O-methyltransferase 
Protein accessionYP_001504406 
Protein GI158311898 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG2518] Protein-L-isoaspartate carboxylmethyltransferase 
TIGRFAM ID[TIGR00080] protein-L-isoaspartate(D-aspartate) O-methyltransferase 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACGTCCT GCACGGCCGA CGTCTCCCTC GCGCCGCTGC GGAACGCGAT GGTCGAACGG 
ATCCTGGCCG CCAAGCCGGT GTCGGCCCCG GTCGAGGCTG CGATGCGCAC GGTTCCACGC
GAGCTGTTCC TTCCAAACCT GCCACCCGAG GTGGCGTACC AGGACCGGGC CGTGGTCCTC
AAACGCGACG TCTACGGGAA CCCTGTGGGC TCGGTGTCCC AGCCCTCGGT GATTGCGGCG
ATGCTGGAAG CGCTGCGGGT CGAACCTGGC CAGCGGATTC TCGAGCTGGG TTCCGGGGGC
TACGGGGCCG CGTTGTTGGC GAGGCTGGCC GGTCGGACAT GTTCGGTGGT GAGCATCGAC
CTCGATGAGA CCGTGATCCA CCGCACCCAC GAATACCTGC GCGCGGCCGG CTACACCGGG
ATCACGGCGC TGGTCGGCGA TGGGCGTTAC GGGTTCCGGC TCCGCGCCCC CTACGACCGG
ATCATCGTCA CGTTCGACAC GCTCGACGTG CCACAGGACT GGTTCGACCA GCTTGTCGAG
GGCGGCAGGG TCATCATCCC GCTACACCTG CGCGGTCTCA CCCGCACCAT CGCGTTCACC
AAGACCGGCG GGATCCTGAC CTCCGACTCG ATCCGACCGT TCGCGTTCGG ACCGATGCAA
GGATTCGGTG TCCCCCGCCG AACCACCGTG TCCCTCGCAG GCGGGGCACG GCTCGATGTC
GGCCCCGACC AGGACGTCGA CGTCGACGCC CTGTGCGCCG CCCTCGCCGA GCCACGGCAC
ACGGTGGGAA CCGGTGTCAT CCTCCCCCCC GGAAACACCC TGCTACCGAG CCTGGACCTG
TGGCTGGCCA CAGCCCAACA GACCTACGGC CGGCTCCACA CCGACACCCG CACCGATCAG
CAGGGCCCGG ACGCCTCGGC CATCCCGCCC GGTGTGCCAG CAGTCTGGTC CGACGACACG
ATCGCCTTCC TCGCCCTCGC ACAGACCACC GGCACGGGAC TCGAACTCGG TGTGATCGCC
TACGGGCCGC AGGCAGGCCG GCTCGGAGAC CAGCTCGCTC ACGACGTCCG TGCCTGGAAC
ACAGAGCACT ACGGCAAGCT CCCCACGATC AAGGTCTACC CCCGCGGAAT CGTCGAGAAG
TCCCCGCCAC CGGGCCGGCT TCTCGACCTG CCTTCCGCGC AGGTACTGAT CAGCTGGGGC
TGA
 
Protein sequence
MTSCTADVSL APLRNAMVER ILAAKPVSAP VEAAMRTVPR ELFLPNLPPE VAYQDRAVVL 
KRDVYGNPVG SVSQPSVIAA MLEALRVEPG QRILELGSGG YGAALLARLA GRTCSVVSID
LDETVIHRTH EYLRAAGYTG ITALVGDGRY GFRLRAPYDR IIVTFDTLDV PQDWFDQLVE
GGRVIIPLHL RGLTRTIAFT KTGGILTSDS IRPFAFGPMQ GFGVPRRTTV SLAGGARLDV
GPDQDVDVDA LCAALAEPRH TVGTGVILPP GNTLLPSLDL WLATAQQTYG RLHTDTRTDQ
QGPDASAIPP GVPAVWSDDT IAFLALAQTT GTGLELGVIA YGPQAGRLGD QLAHDVRAWN
TEHYGKLPTI KVYPRGIVEK SPPPGRLLDL PSAQVLISWG