Gene Francci3_1819 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_1819 
Symbol 
ID3906210 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp2154800 
End bp2156032 
Gene Length1233 bp 
Protein Length410 aa 
Translation table11 
GC content71% 
IMG OID637879157 
Productprotein-L-isoaspartate(D-aspartate) O-methyltransferase 
Protein accessionYP_480924 
Protein GI86740524 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG2518] Protein-L-isoaspartate carboxylmethyltransferase 
TIGRFAM ID[TIGR00080] protein-L-isoaspartate(D-aspartate) O-methyltransferase 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.474261 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCATTC AGCATGAGGA GAGTTCCACC GCGGCTGCTG TGTTGCGCGC GGCCATGGTT 
GACGAGCTGC GGGAACTCGG CGCGGTCCGT GACCCGAGGG TGGCGCGTGC GTTGGCCGTG
GTGCCGCGGC ATTTGTTCGC GCCGGGCGCA GATCTGGCAG CGGCGTACGC GGCTACAGGG
ACGGTCGTCC CGGTGCGCGA CGCGGTGGGT CGGATGGTCA GTACGGTGTC GGCGCCGCAC
ATCCAGGCGA TGATGCTGGA GCAGGCGCGG GTGGCGCCGG GGATGCGGGT CCTTGAGGTC
GGTTCGGCGG GCTACAACGC GGCGCTGCTC GCGGAGCTCG TCGGCGAGAC AGGCGAGGTC
ACTACCGTCG ACATTCTGCC CGGGGTTGCC GAGCGCGCCC GGCGTTGTCT GGACGCGGCG
GGCTATGGCC GGGTGCGGGT GGTGCTGGCC GACGCCGAGG GTGGCGTGCC CGACCACGCG
CCCTATGACC TGGTGCTGGT GACGACGGCG GTCCGTGACA TTCCCTCCGC GTGGACGGAC
CAGCTCGCGC CCGGCGGCCG GCTGGTTGTG CCGCTGCGGC TGCGGGGCCA GACACGCTCG
GTCGTGTTCG AGGCCGACGG CGGGCGGCTG GTTGGCCACG ACGCCCAGGT CTGTAGTTTT
GTCCCGCTCG CGGGTGCTGG GGCTTTCCCG GAGCAGGTGC TGGCGTTGGA CGGGGACGAT
GTCGTGGTCC GGCTGGACGA CGCGGCCCCG GTGGACGTGG ACGAGGTCCG CCAAGCGCTG
ACCTGGCCGC GGGTGAAGAC CTGGTCGGGT GTCCACAGCG GCGAAGAACC GTTCGATGAC
CTGCTGCTGT GGCTCGCGGT CGGCCTGGAG AACTCTGGTC TGCTGCTGGC CCGGCAGGCT
GCGGTCACGC GGGGGACAGT CGCCCACGCG TGGAGCCTCG GCATGCCGGC GGTCGTAGGA
AAAGGCAGTC TCGCCTACTT CTCGCTCAGT GCGAAGGTTC CCGGCCGTGT GTCCCGCGAG
TTTGGCGTTC AGGCCCAGGG TCCCGCCGCC GAGCTGCTGA CCTCCCAGTT CATCGAACGG
ATCCGCGCGT GGAAGGCCGA TCGGCCGGCC CGTATCGAGG CATTCCCCGC GGGTACTCCG
GACGCCGCGC TGCCCGCCGG CGGCGTGTTT CTCGATCGCC CTCATCGGCG TGTCGTCGTC
TCCTGGCCGT CCGGCCCGGA GGTTCCCGTC TGA
 
Protein sequence
MTIQHEESST AAAVLRAAMV DELRELGAVR DPRVARALAV VPRHLFAPGA DLAAAYAATG 
TVVPVRDAVG RMVSTVSAPH IQAMMLEQAR VAPGMRVLEV GSAGYNAALL AELVGETGEV
TTVDILPGVA ERARRCLDAA GYGRVRVVLA DAEGGVPDHA PYDLVLVTTA VRDIPSAWTD
QLAPGGRLVV PLRLRGQTRS VVFEADGGRL VGHDAQVCSF VPLAGAGAFP EQVLALDGDD
VVVRLDDAAP VDVDEVRQAL TWPRVKTWSG VHSGEEPFDD LLLWLAVGLE NSGLLLARQA
AVTRGTVAHA WSLGMPAVVG KGSLAYFSLS AKVPGRVSRE FGVQAQGPAA ELLTSQFIER
IRAWKADRPA RIEAFPAGTP DAALPAGGVF LDRPHRRVVV SWPSGPEVPV