Gene Franean1_1335 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_1335 
Symbol 
ID5669746 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp1607244 
End bp1608446 
Gene Length1203 bp 
Protein Length400 aa 
Translation table11 
GC content73% 
IMG OID641240266 
Productprotein-L-isoaspartate(D-aspartate) O-methyltransferase 
Protein accessionYP_001505693 
Protein GI158313185 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG2518] Protein-L-isoaspartate carboxylmethyltransferase 
TIGRFAM ID[TIGR00080] protein-L-isoaspartate(D-aspartate) O-methyltransferase 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCACCC GCACCCCCGA GCAGCTCCGC GACGACCTGG TCGCCCACAT CCGCCGCTGG 
GGCACCTTCC GCACCGGTCA GGTCGAAGCC GCGTTCCGCA CCGTGCCGCG GCATCTGTTC
CTGCCCGACG TCGACCTGGA GACCGCCTAC GGCACGCAGG TCGTCGTGAC CCGCCGCGCA
TCCGACGGCA CGGCGCTGTC CTCGGCCTCG CAGCCCGACC TCGTCGCCGC CATGCTCGAA
CAGGCTGACG TCCACCCCGG TCACCGCATC CTGGAGATCG GCACCGCCAC CGGCATCAAC
GCCGCTCTCC TGGCAGAACT GGCCGGCCCG ACCGGTCATG TGACCACCAT CGAGATCGAC
GAGGACCTCA TCGGCGGCGC CCGCACCGCG CTGGCGGCAG CCAGCTACGA CCAGGTCGAC
GTGATCCACG GGGACGGCGC GGTCGGCTAC CCGGATGGGG CATCTTACGA CCGGATCGTG
ATCACGGCGG GGGCGTTCGA CCTTGCAGCA GCCTGGTGGG AGCAGCTCGC CCCCGCCGGC
CGGATCGTCG TACCACTACG CCTGCACGGA AGCGGCCTGA CCCGCTCCCT CGCGTTCGAC
GCGGGCGAGC CAGGCCGGCT GGTCAGCCGC TCGGCGCTCG TCTGCGGATT CATTCCGCTG
CGCGGCGCCG GCGCCCACGC CGGCCGCACG CTGACCCTCG CGGACGGAGC TGTCCTGCGC
GTCGACGACC ACGACCCCGT GGACGAGCCT GCGCTACGCG CCGCGGCAGA CAGCCCTCCC
CACGCGCTGT GGACCGGTCT GACGATCCAT GACCACGAAC CGACCGCCCA CCTCGACCTG
TGGCTCGTCA CCGCCGGCAC CCGCTTCGCC CGCCTCGCCG TCGACACCAC CGCCCGCCAC
GACAGCCAGC TCGGCCCGGC GCGGCGCTGG GCCGGGGCCA CCATCCACGA CGGCACCACG
ATCGCCTACG TCACCCTGCG CCCACTCGCC CCCGACACCG ACGAGCTCGG CGTCACCGCC
CACGGTCCCC ACGCGACCAC CCTCGCCACG CAGCTCACCG ACCTGCTCCA TCACTGGCGA
AAGGAAGGCC CCGCCGAGCC CGTCATCACC GCCCACCCCG CAGGCACCCC CGACGACCAG
CTCTCCGCTG GACACCGCAT CGACCGGCCG AACAGCCGCC TCACCGTCCG CTGGCAGCCC
TGA
 
Protein sequence
MTTRTPEQLR DDLVAHIRRW GTFRTGQVEA AFRTVPRHLF LPDVDLETAY GTQVVVTRRA 
SDGTALSSAS QPDLVAAMLE QADVHPGHRI LEIGTATGIN AALLAELAGP TGHVTTIEID
EDLIGGARTA LAAASYDQVD VIHGDGAVGY PDGASYDRIV ITAGAFDLAA AWWEQLAPAG
RIVVPLRLHG SGLTRSLAFD AGEPGRLVSR SALVCGFIPL RGAGAHAGRT LTLADGAVLR
VDDHDPVDEP ALRAAADSPP HALWTGLTIH DHEPTAHLDL WLVTAGTRFA RLAVDTTARH
DSQLGPARRW AGATIHDGTT IAYVTLRPLA PDTDELGVTA HGPHATTLAT QLTDLLHHWR
KEGPAEPVIT AHPAGTPDDQ LSAGHRIDRP NSRLTVRWQP