Gene Francci3_3244 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3244 
Symbol 
ID3904415 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp3838808 
End bp3839875 
Gene Length1068 bp 
Protein Length355 aa 
Translation table11 
GC content69% 
IMG OID637880569 
Productprotein-L-isoaspartate(D-aspartate) O-methyltransferase 
Protein accessionYP_482330 
Protein GI86741930 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG2518] Protein-L-isoaspartate carboxylmethyltransferase 
TIGRFAM ID[TIGR00080] protein-L-isoaspartate(D-aspartate) O-methyltransferase 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.618843 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCGTCC GCCGGGTGAA CCGCGAGGAT TTCATCCCGG ATGAGATCTG GGTCCGGGAC 
GAGGACGGGT TCTTCCTGGT CCCGCTCCGG CGCCAGGATG ATCCACAGCG GTGGTCGGAG
CTGTGCCGCG GCGACGACGG GATCACGACG CAGGTCGACG ACGGGACGGG CAGGTACGAC
GGTAGGGGCG TCATACCGAC CAGTTCCAGC AGCGCGCCGT GGGTCATGGC CCGGATGCTC
GACCTCCTCG ATGTGCGGGA CGGGATGAAC GTCCTCGAGA TCGGTACCGG GACCGGCTAT
AACGCGGCGC TGCTCGCCGA GCGGACCCCG ACCGGCCAGG TCACCACCAT CGAGATCGAT
CCGGGAATCG CGGGGCACGC CCGCGCGGCC CTCGCCAGAA TCGGCCGTCC GGTGACGGTG
GTCGTCGGGG ACGGCGCGGC GGGGTTTCCC GATCGGGCTC CCTACGACCG GATCATCGCC
ACGGCGTCGG TGGTTACCGT TCCCTACCCC TGGATCACGC AGACCCGGCC GGGCGGGCGG
ATCGTGCTGC CGTTCACGAG CGAGTTCGGT GGGGCGCTGC TGTCATTGAC CGTCGCGGAC
GGCACCGCCT CGGGCCATTT TCATGATGAT GCGGGTTTCA TGCGGCTGCG CGGCCAACGG
GCCGACAGAC CCGTCTGGTG GCTCGGCGAG GACGACGCCG ACGTCAGAAC AACCTGCCGG
TATCTGGATG AGCCCTTCGC GGACGCCGCG GCCGGATTCG CGGTCGGCCT GTGGCTGCCC
GGCTGCACCA CCGGGCAGAT CGAGGAAGGC GGCCCGGCGA GAACACTTCT GCTGTCCCAC
GCGGCGTCAC ATTCGTGGGC GTCACTGACC GCCGGGTCGG ACGGGCAGTC GGACGGGCAC
GAGGTCACCC AGTATGGGCC ACGACGGTTG TGGGACGAAC TCGAAATGGC CTACGACTGG
TGGGTGAACG CAGGGCGGCC GTCCTGCGCC CGCTTCGGGC TCACCGTCAC CCCCGACGGT
CAGACATCCT GGCTGGACTA CCCGGAGCGG GTCATCCCGG CCTCGTGA
 
Protein sequence
MSVRRVNRED FIPDEIWVRD EDGFFLVPLR RQDDPQRWSE LCRGDDGITT QVDDGTGRYD 
GRGVIPTSSS SAPWVMARML DLLDVRDGMN VLEIGTGTGY NAALLAERTP TGQVTTIEID
PGIAGHARAA LARIGRPVTV VVGDGAAGFP DRAPYDRIIA TASVVTVPYP WITQTRPGGR
IVLPFTSEFG GALLSLTVAD GTASGHFHDD AGFMRLRGQR ADRPVWWLGE DDADVRTTCR
YLDEPFADAA AGFAVGLWLP GCTTGQIEEG GPARTLLLSH AASHSWASLT AGSDGQSDGH
EVTQYGPRRL WDELEMAYDW WVNAGRPSCA RFGLTVTPDG QTSWLDYPER VIPAS