Gene Francci3_1128 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_1128 
Symbol 
ID3906607 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp1344035 
End bp1345201 
Gene Length1167 bp 
Protein Length388 aa 
Translation table11 
GC content71% 
IMG OID637878460 
Productprotein-L-isoaspartate(D-aspartate) O-methyltransferase 
Protein accessionYP_480237 
Protein GI86739837 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG2518] Protein-L-isoaspartate carboxylmethyltransferase 
TIGRFAM ID[TIGR00080] protein-L-isoaspartate(D-aspartate) O-methyltransferase 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.228711 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGATGACC TCGTCGCGGC GGGTTCGATC AAAACGAAGG AGGTCGAGGC CGCGTTCCGC 
GCGGTGCCGC GGCACCTGTT CGCTCCCGAG GTGTCGCTCC AGGACGCCTA CGCCGACGAC
ATCGTGCGGA CCAAGCAGGA CCAGCACGGC ATCACCGTCA GCTCGGTCTC CGCTCCCTGG
CTGCAGGCCA CGATGCTCGA ACAGGCCGGG ATCAGCCCCG GCATGCGGGT CCTCGAAGTC
GGCTCCGGGG GCTACAACGC CGCCCTGATC GCGGAACTCG TCGGCCCGAC CGGTGCGGTG
ACGACCATGG ACATCGACGC CGACGTCACC GAACGAGCCC ACCACTGCCT CGCAGCGGCA
GGCTACGACC AGGTCCACGT CGTGCTGGCC GACGCCGAGC ACGGCGTCCC CGAGCACGCG
CCCTACGACC GGATCATCGT CACCGCGGGC GCCTGGGACA TCCCACCCGC CTGGACCGAT
CAGCTCGCCC CAGACGGCCG GATCGTCGTT CCCCTGCGGA TGCGGGGGCT GAGCCGGACG
GTCGCGCTCA CCCCCCGCGA CGGCGTTCTG GTCAGTCTCG ACCATCAGAT GGCCGGCTTC
GTGCCCGTGC GCGGCGACGG CGCCCATCCC GAACGCCTCG TCCAGCTTCA CGATGCCGAC
GTGGGCCTGC GGTTCGACGA GCAGGCTCCC GAACTCGACG CCGCCGCACT GCGGGCCGCG
CTACTCGCCC CGCGGGTCGA GGCGTGGTCG GGAGTGCGGT TCGGCGGGAT GGAACCGTTC
GACGGGCTGC TGCTGTGGCT TGCCAGCAGC CTCGACGACT ACGGCTTGCT CTCCCGCGCC
CGTTCCCCGA CCGCTCGAGA GCTCGTCGAC CCCGTCTCCC CGATCGGCAC GCCCACCGCC
GTGGAAGCGG ACAGCTTCGC CTACCTCACC CTGCGAAAGG TCGAGGACGG CGACGACACC
TACGAGTTCG GCGCCTACGC CCACGGGCCC GACGCCGCCC GCCTCGCCGG CCTGGTCGTC
ACGCAGGTCC AGGCCTGGAA CCGAACCCAC CGCGACGCGT CGCCGGCCAC GATCAGCGTC
TGGCCGGCCA CAACTCCGGC AGAGGCACTT CCGCCCGGCC GGGTGATCGC CAAGCGCCAC
CGCATCGTCG TCCTGAGCTG GTCCTGA
 
Protein sequence
MDDLVAAGSI KTKEVEAAFR AVPRHLFAPE VSLQDAYADD IVRTKQDQHG ITVSSVSAPW 
LQATMLEQAG ISPGMRVLEV GSGGYNAALI AELVGPTGAV TTMDIDADVT ERAHHCLAAA
GYDQVHVVLA DAEHGVPEHA PYDRIIVTAG AWDIPPAWTD QLAPDGRIVV PLRMRGLSRT
VALTPRDGVL VSLDHQMAGF VPVRGDGAHP ERLVQLHDAD VGLRFDEQAP ELDAAALRAA
LLAPRVEAWS GVRFGGMEPF DGLLLWLASS LDDYGLLSRA RSPTARELVD PVSPIGTPTA
VEADSFAYLT LRKVEDGDDT YEFGAYAHGP DAARLAGLVV TQVQAWNRTH RDASPATISV
WPATTPAEAL PPGRVIAKRH RIVVLSWS