Gene Franean1_2627 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_2627 
Symbol 
ID5671021 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp3108574 
End bp3109875 
Gene Length1302 bp 
Protein Length433 aa 
Translation table11 
GC content69% 
IMG OID641241543 
Productprotein-L-isoaspartate(D-aspartate) O-methyltransferase 
Protein accessionYP_001506963 
Protein GI158314455 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG2518] Protein-L-isoaspartate carboxylmethyltransferase 
TIGRFAM ID[TIGR00080] protein-L-isoaspartate(D-aspartate) O-methyltransferase 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCCACCT CCACGAGCGA GACCACCCAT GAACAGGTAG CCCCCCAGCC AGATGACGCC 
GCTCGCCTGC GCGAGGAGCT CATCCGGGAA CTGCATGAGC TGGAGGCGAT CGCGACGCCG
GAGGTGGAGC GGGCGGTACG GACGGTGCCA CGGCATCTGT TCATCCCCGA GATGTCGTTG
GAGGAGGCAT ACGCCGCCGA GTGCCACTAC GTGACGAAGA CGGACAAACT GGGGATCAGC
ATCAGTTCGG TGTCCGCCGC ACGGATCCAG GCCATGATGT TGGAGCAGGC CCAGGTCCGC
CCCGGGATGC GCGTCCTGGA GATCGGCGCG GGCGGCCTCA ATGCCGCGAT GCTCGCCGAG
CTGGTGGGCG AGACCGGCCA GGTCACCTCG ATCGACATCG ATCAGGACGT CATCGACCGG
GCAGCCCGGC TCCTGCCAGC GGCGGGCTAC GACAGCATCA ACCTGCTGCG CGCCGACGGG
GAGTTCGGCG CACCGGAGCA CGCTCCCTTC GATAGGATCA TCGTCACCGT CTGCGCGTGG
GACCTGCCCC CGGCCTGGAG TGACCAGCTC GCTGAGGGCG GCCGGCTCGT CGTCCCGCTG
CGGATGCGCG GCCTGACCCG CTCGGTGGCG TTCGAGCGGG AGAACAACCG TCTGGCTGCT
CGCGGCTACG AGCTGTGCGG CTTCGTGCCC ATGCAGGGCG CCGGAGAACA GCGTGAACGC
CTGGTCCCGC TCCACGGCGA CGATGTGCGC CTGCGCCTGG ACGACGACCA GCACGCCGAC
GGCGACGCCC TGGCCGCCGC GCTGGCAATG CCGCGGAGAG AGGCGTGGTC GGGGATCACG
GTCGGGAAAG GCGTCAGGTT CGACGGCCTG TACCTGTGGA TGGCCATGAA GCTGCCCGAC
TTCGGGTTGC TCGCCGCGAC GAAAGCCGCC GTGGATCACG GACTGGTCGC TCACTCCTGG
GGGCTGGGCG TTCCGACTCT TCTCGACGGG GACAGCTTCG CCTACCTGAC CTACCGCCCC
ACCAGCGAGA CGCGCGAGCA GTTCGAGTTC GGCGCCTACG GGCACGGACC CCACGCCGAG
ATGACTGTCG AAAGGCTGGC CAGCCTCATC AGGTCCTGGG ACGGCACCAG CCTGAACGCT
CGCATCAGCG CCCACCCCGC AGGCGCTCCT GACGAGTTGC TGCCCCCCGA CGCCCTCGTG
CTCGCCCGAC GCCACAGCCG TATCGCGATC ACCTGGCCGT CTCCGCCGCC GACTGACGAC
CCGGGCAGCG CACAGGTCCG GGAGGGGGCG AAGCATGAGT AG
 
Protein sequence
MSTSTSETTH EQVAPQPDDA ARLREELIRE LHELEAIATP EVERAVRTVP RHLFIPEMSL 
EEAYAAECHY VTKTDKLGIS ISSVSAARIQ AMMLEQAQVR PGMRVLEIGA GGLNAAMLAE
LVGETGQVTS IDIDQDVIDR AARLLPAAGY DSINLLRADG EFGAPEHAPF DRIIVTVCAW
DLPPAWSDQL AEGGRLVVPL RMRGLTRSVA FERENNRLAA RGYELCGFVP MQGAGEQRER
LVPLHGDDVR LRLDDDQHAD GDALAAALAM PRREAWSGIT VGKGVRFDGL YLWMAMKLPD
FGLLAATKAA VDHGLVAHSW GLGVPTLLDG DSFAYLTYRP TSETREQFEF GAYGHGPHAE
MTVERLASLI RSWDGTSLNA RISAHPAGAP DELLPPDALV LARRHSRIAI TWPSPPPTDD
PGSAQVREGA KHE