Gene Francci3_2034 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_2034 
Symbol 
ID3906751 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp2394076 
End bp2395302 
Gene Length1227 bp 
Protein Length408 aa 
Translation table11 
GC content67% 
IMG OID637879371 
Productprotein-L-isoaspartate(D-aspartate) O-methyltransferase 
Protein accessionYP_481137 
Protein GI86740737 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG2518] Protein-L-isoaspartate carboxylmethyltransferase 
TIGRFAM ID[TIGR00080] protein-L-isoaspartate(D-aspartate) O-methyltransferase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.908018 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.539174 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTTCAG TCCAAGATTC GAGCGCCACC GACAGTGCGG CCACGTTGCG TGCCGCGATG 
GTGGACGAGC TTCGCACCAC GGGGGACGCG ATCAAGACTG GGCAGGTGGC TGCGGCTGTG
GGCAGGGTGC CTCGGCACCT TTTCGCACCC GACGAACCGC TGGAGGCGGT CTACGCCGCA
AACAAAGCAC TTGTGATTAA ACGCGATGGG AATGGTGCGG CGCTCAGCTC GCTGTCCGCC
GCGCACATTC AGGCGGTCAT GTTGGAGCAG GCCGAACTTG AACCGGGAAT GCGTGTCCTG
GAGGTTGGCT CGGGCGGCTA CAACGCCGCC CTCATCCAGG AGATGGTCGG CGACGGAGGC
TCGGTGACGT CCGTCGACAT CGACCAGGAG ATCGTGAGCC GGGCCCGCGC CTGTCTGGAC
GCGGCCGGCT ATCGAAATGT TGAAGTCGTG GCGGCCGACG CCGAGGCCGG CGTACCCGAA
AAAGCACCCT ACGACCGGAT TATCGTGACC GCAGGGGCCT GGGACATCCC ACCAGCCTGG
CAGGAGCAGC TCACGAACGG CGGCAGGCTC GTTGTGCCGC TCCGGCTACG AGGGCTGACC
CGGTCGATCG CGTTCGACCG GGTCGACGAG GACGGTGACG TCGGTCTGGT CAGCCGCAGC
TATCGCCTGT GCGGATTCGT CCCGATGCAG GGCATCGGGA CGTTCCGCGA AAGGCTCGTC
CCTATTACCG ACGAGGTCGT GCTTCGGGTC GACGACCCGT CCCAGGAGTT CGACGTCGAG
GGGCTGCGCG ACGCGCTGGC CACGCCGAGG CTGGAACGCT GGTCCGGAGC GGCCTTCGAT
CTTCCTGACG AGTTGGAGTT CTTCCTCGTC ACGAATCTGG CCCAGGTCGC GCACCTGCAT
GTGGACGAGA CGCAGGTCCA GAACGGCCGC TTCGCGCCCT CGGCCGCCAG AGGTGTGCCC
ACGCTGGTCA GCGGTGGCAG TTTCGCCTAC CGCACCAAGC GCCCGAACGA GAAGACCGGC
GGATTCGAGA GCGGCGTCGT CGCGCACGGT CCCGACGCCG ATCGCGTCGC CGAGCACTAC
GTGGAACTGC TCCGCCGGTG GGCCAGCGAT CACCGCCGTT CCGGTGCCGC CCACATCCGA
TACGTCCCGA AGGCCGCGGG GACGCCGGCG CCGTCCGTGG GGCTGGTGCC GAAACGGCAC
GGCGCTGTCG CTGTCCGCTG GCCCTGA
 
Protein sequence
MTSVQDSSAT DSAATLRAAM VDELRTTGDA IKTGQVAAAV GRVPRHLFAP DEPLEAVYAA 
NKALVIKRDG NGAALSSLSA AHIQAVMLEQ AELEPGMRVL EVGSGGYNAA LIQEMVGDGG
SVTSVDIDQE IVSRARACLD AAGYRNVEVV AADAEAGVPE KAPYDRIIVT AGAWDIPPAW
QEQLTNGGRL VVPLRLRGLT RSIAFDRVDE DGDVGLVSRS YRLCGFVPMQ GIGTFRERLV
PITDEVVLRV DDPSQEFDVE GLRDALATPR LERWSGAAFD LPDELEFFLV TNLAQVAHLH
VDETQVQNGR FAPSAARGVP TLVSGGSFAY RTKRPNEKTG GFESGVVAHG PDADRVAEHY
VELLRRWASD HRRSGAAHIR YVPKAAGTPA PSVGLVPKRH GAVAVRWP