Gene Francci3_3316 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3316 
Symbol 
ID3904102 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp3930523 
End bp3931818 
Gene Length1296 bp 
Protein Length431 aa 
Translation table11 
GC content67% 
IMG OID637880641 
Productprotein-L-isoaspartate(D-aspartate) O-methyltransferase 
Protein accessionYP_482402 
Protein GI86742002 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG2518] Protein-L-isoaspartate carboxylmethyltransferase 
TIGRFAM ID[TIGR00080] protein-L-isoaspartate(D-aspartate) O-methyltransferase 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.376694 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCGACG TGTCTTCACC CGTAGAGCCG AACAGCGACG TCTCCGGCCT CGATCGCGCA 
GCGCAGCTGC GCAGGAAGAT GGTTGACGAT CTTCTTGCCG AGGGCACGAT CACCTCGCGG
CCGGTCGAGG CCGCGATGCG CAAGGTCCGT CGGGAAGCCT TCGCACCGGG GGTGGAGCTG
GAGGAGGCTT ACCAGCTTTA CAACGGAGTA GTTACGAAGC GCGACGACGC TGGAAGTTCT
GTGAGCTCGG TTTCCGCTCC GCAGGTCCAG GCGTACATGC TGGAGCAGGC TGCGATCACC
CCCGGCATGA GGATCTTGGA GATCGGCTCT GGGGGGTACA ACGCTGCCTT GATCGCCGAG
CTCGTCGGTC CGGCAGGTCA GGTCACCACG GTCGACATCG ACAAGGACGT CATCGACCGG
GCGCGCCATC TTCTCGCGCA GGTCGGCTAC CCTCAGGTGA ACGTTGTGCT CGCCGACGCC
GAGTTCGGTG TGCCTGAGCA CGCCCCTTAT GACCGTATTT TGGTCACGGT CGGTGCGTGG
GACGTTCCGC CGGCTTGGGT GGCTCAGCTG GCCGAGGGCG GTCGTCTGGC GGTGCCGTTG
CAGCTGCGGG GCCTGTCGCG TGTGATCACC TTCGAGCGAG CCAACGAGTA CTTGCTCAGT
CGGGCGTCCC GGTTGTTCGG CTTCGTGCCT ATCCAGGGCG CCGGAGCGCA CCAGACCACG
TTGCTGGTCC TACGCGAGGG TGAGATCACC TTGCGCTTCG AGGAGGGGCC CCCGACCGAT
CCGGGCCTGC TGGAAGGCGT ATTCGACGCG CCGCGGGTGG AGGTCTGGAG CGGGGTCACG
ATCGGCCGTT TCGAACCGTG GGCCGGTACG CAGATGTGGC TCGCCACCGT GCTGCCGGGC
TTCTGTCGCG TTGTGCTGGA CAGAAAGCTG GACACGGGTC TGCTCTCCCC ACCGGGCAGT
CACTCCGCCG CGGTGGCCGT CGTCGACGAC GACACCCTCG CCTATGTCAC GACGCGCGGT
GCGGCGGACT CGGTGGACGT CGAGTTAGGT GTGCATGCCT TCGGTCCGGG CGCGTCCGAG
CTGGCCGAAC AGGTCGCCGA GCAGCTCCAG GTCTGGGGGC GGGAACACCG CCACAGCTCG
CCACAGTTCC GCGTTGACCC CGCAGGCACC GCCGATGACC AGCTGCCGGC TGGCCGCGTC
ATCGACAAGA AGCACAGCCG GATCACGATC TCCTGGCCGG GAGCGGCGAG CGTCGCGGCC
GGCCGGGGCG CAGCATCCCG CCGAGCAGGA GGATGA
 
Protein sequence
MTDVSSPVEP NSDVSGLDRA AQLRRKMVDD LLAEGTITSR PVEAAMRKVR REAFAPGVEL 
EEAYQLYNGV VTKRDDAGSS VSSVSAPQVQ AYMLEQAAIT PGMRILEIGS GGYNAALIAE
LVGPAGQVTT VDIDKDVIDR ARHLLAQVGY PQVNVVLADA EFGVPEHAPY DRILVTVGAW
DVPPAWVAQL AEGGRLAVPL QLRGLSRVIT FERANEYLLS RASRLFGFVP IQGAGAHQTT
LLVLREGEIT LRFEEGPPTD PGLLEGVFDA PRVEVWSGVT IGRFEPWAGT QMWLATVLPG
FCRVVLDRKL DTGLLSPPGS HSAAVAVVDD DTLAYVTTRG AADSVDVELG VHAFGPGASE
LAEQVAEQLQ VWGREHRHSS PQFRVDPAGT ADDQLPAGRV IDKKHSRITI SWPGAASVAA
GRGAASRRAG G