Gene Franean1_6752 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_6752 
Symbol 
ID5675065 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp8212786 
End bp8213994 
Gene Length1209 bp 
Protein Length402 aa 
Translation table11 
GC content70% 
IMG OID641245601 
Productprotein-L-isoaspartate(D-aspartate) O-methyltransferase 
Protein accessionYP_001510992 
Protein GI158318484 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG2518] Protein-L-isoaspartate carboxylmethyltransferase 
TIGRFAM ID[TIGR00080] protein-L-isoaspartate(D-aspartate) O-methyltransferase 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.20607 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGAGTACCG TGACCTCCAC GCACGACGCC ACCCCCGATG ACCTGCGCGC GGCCATGGTC 
GACCGCCTCG CCACCTCGGG AGCCATCCTC ACCGCCGCGG TCGAGGACAC GATGCGCACC
GTGCCCCGCC ACCTGTTCGT GCCCGACGCC GCCCCCGGCG AGGCCTACGC CGAGCAGGCC
GTCATCACCA AACGCGCCCC TGACGGCACA TCCCTCAGTT ACGCCTCCGG GCCGGGAATC
GTGGCGATGA TGCTGGAGCA GCTCATCGTC CTACCCGGCC AGCGGATCCT CGAGATCGGG
ACCGGCACCG GCTACAACGC CGCCCTCCTC GCGCACCTGG CCGGGCCCGG CGGGCACGTC
ACCACCATCG ACATCGACCC CGACATCACC AGTGCTGCGA CCAGCGCCCT CGCCGCAGCC
GGCTTCGAAA AGGTCACGGT CCTCACCGGG GACGGCACCT TCGGCGACCC GGACAGCCAC
GTGCACGATC GGCTGATCGC CACGGTGGGA GTCTGGGACA TCTCCAGCGC CTGGTGGGAC
CAGCTCGCCC CGGGCGGCCG GCTCGTCCTG CCCCTGCATT GGCGGGGCCA GACCCGCGCG
GTGACCTTCC GCCACGACGG CAACCGGATG ATCAGCGAGT CGGTAGAGCT CTGCGGCTCC
GTCCCGATGA TCGGGCAAGC GGGCGAACGG ACCGCCGCGA TCCACCATGA TGGCCTCGTC
GCCCTGCACT GGGACCAGGA CCAAGCCATC GACCCGGGCA CTCTCACCGG CGTACTCGAC
AGGAGGCGGA CAATCGCCCT CTCCGGGGTC GAGGTCGGCC CCTACGACCC GTTCGACGGA
ATCTGGCTGC GCCTGACCGC CACCGAGCCC GGATGCTGCC GCATCGAAGC CACCCCCGAA
GCGGTGAAGT CCGCCCTGTG CGCACCCGCG ATACCCCAGC GCAGCCCCGC TCTCGTCGAC
CGCGACTCGC TGGCCTACCT CAGCCTCGCC CGCAACCACA CCGGGCCCGG CCGTCACACC
CTCAGCGTCA TCAGCCACGG CCCGAACCGT CATAACCTGG CCGACCGGCT GATCGAAGGC
ATCCACACCT GGAACAGCGA CCGCACCGCC ACCCCCACCG TCACCGCCCA CCACGGGCCC
GCCGGTCTCG CGCACCAGCC GCCCGGACTT ATCCGGAAAC CGGACAGCCC TCTGACGATT
ACCTTCTGA
 
Protein sequence
MSTVTSTHDA TPDDLRAAMV DRLATSGAIL TAAVEDTMRT VPRHLFVPDA APGEAYAEQA 
VITKRAPDGT SLSYASGPGI VAMMLEQLIV LPGQRILEIG TGTGYNAALL AHLAGPGGHV
TTIDIDPDIT SAATSALAAA GFEKVTVLTG DGTFGDPDSH VHDRLIATVG VWDISSAWWD
QLAPGGRLVL PLHWRGQTRA VTFRHDGNRM ISESVELCGS VPMIGQAGER TAAIHHDGLV
ALHWDQDQAI DPGTLTGVLD RRRTIALSGV EVGPYDPFDG IWLRLTATEP GCCRIEATPE
AVKSALCAPA IPQRSPALVD RDSLAYLSLA RNHTGPGRHT LSVISHGPNR HNLADRLIEG
IHTWNSDRTA TPTVTAHHGP AGLAHQPPGL IRKPDSPLTI TF