Gene Franean1_2801 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_2801 
Symbol 
ID5671190 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp3315505 
End bp3316701 
Gene Length1197 bp 
Protein Length398 aa 
Translation table11 
GC content69% 
IMG OID641241710 
Productprotein-L-isoaspartate(D-aspartate) O-methyltransferase 
Protein accessionYP_001507130 
Protein GI158314622 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG2518] Protein-L-isoaspartate carboxylmethyltransferase 
TIGRFAM ID[TIGR00080] protein-L-isoaspartate(D-aspartate) O-methyltransferase 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.78104 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACACCA CCGCAGCCAC ATCACCAGGC ACGCTGCGCG ACCGCATGGT CGACCGCATC 
CTCACCAGCC AGAATCTGCC CCCGTGGGTC GAAACGGCGC TGCGTTCCGT CAAACGCCAC
CGCTACGTCC CCGAAGCGCC GCTGGCCGAC GCCTACGACG AGAAGGCGGT CATCACCCAC
ACCTTCCCCG ACGGCACCCA CCTCAGCTGC GCCTCCGGCC CCACCATCGT GGCCGCCATG
CTCACCGCCC TCGATGTCCG GCCCGATCAG CACATCCTGG AGATCGGCGC CGGCACCGGC
TACAACGCGG CCCTCCTCGC CACCCTCGTG GGCACCGGCG GCCAGGTCAC CACTATCGAC
ATCAACGCCG ACGTCACCGC CGCAGCACGG CGCAACCTTG ACGACACCGG CTTTCCCTAC
GTTCGCGTTC TCACCCGCGA CGGCGCTGAC GGCGCTGCCG AGGACGGCCC CTTCGATCGG
ATCATCGTCA CGGTCGGCGC CTGGGACATC CCACAGGCCT GGTGGGACCA GCTCGTCCCC
GATGGCCGCC TCGTCCTCCC GCTGCGCTGG CGCGGCACTA CCCGCGCTGT CGCACTCACC
AGGCAGGAAG ACCACTGGAA GTCCGACTGG GTCTTCCTGT GCGGCTTTGT GCCGATGCTC
GGCCAGCCCG GCGAGCGGAG GAGCGTCATC CACCCCGACG GCCTCGCCGC CCTGCACCAC
GATCTCGACC AACCCATCGA CACCGACGCC CTGCGCGGTG TCCTCGACCG GGAGAAGTCC
GTCGTCTGGT CTGACGTGAC CGTGCACGGT CAGGAACCCT TCGACCGCGT CTGGCTGCAC
CTCAGCGCCG TGGAAGACGG CACGGTCCGC ATCGAGGCCG ACCAGCAGGC CGTCGCCGAG
GGCCTGTGCA CACCCGCGAT CGCCTCACGC AGCCCAGCCC TGGTCAAAGA CGGTTCACTG
GCCTACTTCA CCATCCGGCG CGCCGACACC CCAGGGCGCT GGCAGCTCGG CGCCATCGGC
CACGGGCCCC TCGGTCGCCG TCTTGCCTCA CGGATCGTCG ACCAGATCGA CGCCTGGGAC
CACGACCGCA CTGCCGACCC CGAAATCCTC GCCTTCCCGG CCGGCACGCC GATCCCGAAC
CAGACGCAAG GCAAGATCAT AACCAAGCCG GAGAACCGCC TGGTACTGCG CTACTAG
 
Protein sequence
MDTTAATSPG TLRDRMVDRI LTSQNLPPWV ETALRSVKRH RYVPEAPLAD AYDEKAVITH 
TFPDGTHLSC ASGPTIVAAM LTALDVRPDQ HILEIGAGTG YNAALLATLV GTGGQVTTID
INADVTAAAR RNLDDTGFPY VRVLTRDGAD GAAEDGPFDR IIVTVGAWDI PQAWWDQLVP
DGRLVLPLRW RGTTRAVALT RQEDHWKSDW VFLCGFVPML GQPGERRSVI HPDGLAALHH
DLDQPIDTDA LRGVLDREKS VVWSDVTVHG QEPFDRVWLH LSAVEDGTVR IEADQQAVAE
GLCTPAIASR SPALVKDGSL AYFTIRRADT PGRWQLGAIG HGPLGRRLAS RIVDQIDAWD
HDRTADPEIL AFPAGTPIPN QTQGKIITKP ENRLVLRY