Gene Franean1_3518 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3518 
Symbol 
ID5671888 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp4179077 
End bp4180324 
Gene Length1248 bp 
Protein Length415 aa 
Translation table11 
GC content71% 
IMG OID641242405 
Productprotein-L-isoaspartate(D-aspartate) O-methyltransferase 
Protein accessionYP_001507825 
Protein GI158315317 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG2518] Protein-L-isoaspartate carboxylmethyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.831229 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGTCG GCACCACGAG CAGCGAGAGC GAGAGTCTCC GGCACGCGAT GGTCGACCGG 
CTGGCCGCCG ATCATGCGGC GAAAGGCCTT GCGCTGCGGC CCGAGGTGGA GGCGGCGATG
CGGACCGTCC CCCGCGAGCT GTACACGCCG GGCCTTCCAC CGGAGCAGGC GTACGAGAAC
GAAGCGGTTC TCAAGAAGCG GCGCGGTACG GAAGTGATCA GTTCGGTGTC GGCGCCGTTC
CTGATCGCGG AGATGCTCGG TCAGGCCGCG GACGCGCTCG GTGGGCTGGC GGGCTGTCAC
GCGCTGGAGA TCGGTAGCGG TGGCTACAAC GCGTCGCTGT TGCGGGAGCT CGTCGGCCCG
TCAGGGTCGG TCACAACCGT CGATATCGAC CCCGAGGTGA CCGGCCGGGC GGTCGCCTGC
CTGGCGGCGG CGGGCTACAC CGACCTCACC GTGGTGTGCG CGGACGCCGA GCATCCGATC
ACCTCGGGCC ACCGCTACGA CCTGATCATT GTCACCGTCG GGGCGTGGGA CATCCCGCCG
GCGTGGTGTG AGCAGCTCCG TGACGGCGGT GTGCTGGTGG TGCCGCTGCG CACGTTCGGG
ATCACCAAGT CGTGGGCACT GCGCCGCCGC GGGGACCGCC TGGTCAGTGA GAGCGACCGC
CAGTGCGGGT TCGTCTTCAT GCAGGGCGAC GGGGCCCACA AGGTGCGGTA CGTCGACATC
GCTGATGGTG TCCGCCTGCG GATGGACGAG GGCCAGCAGG TCGATCCCGC CGTTCTCGAG
GGACTTCTGG CGCAGCCTCG GGAGGAGGCC TGGGCCGGGG TGAGCCTGCC GCCCCGTACC
GACCTGATCG ACCTGAACCT GTGGCTGGCG ACCCGCCTCA CCCACGAGGC GGGCCAGTTC
GTGGTGCTGC TGGCGGAGGA GACGGCGATC GAGGCCGGCA CGGTCGCGCC CTCCTGGCAG
CACGGCACCC CGGCGACCCT GCGCGACGGC ACGCTCGCCT ACCGGTCGGC TCTGCGCTGG
ACCGGCCGGC GGTTCGATCT CGGTGCCTAC GCCCACGGGT CCAAGGCCGC CGCGGCGGCC
GGGCGGATGG TCGAACACAT GCGCGCGTGG GTGGACGCGG GCAGCCCGGC TCCGGTGCTG
CACGTCCTTC CCGCGCACAC CCCCGACGGC GACCTGCCGG CCGGAGCGGT CCTGAACAAG
CGGCACAACC GCCTCGTCCT GACCTTCACC CCCACCGCGT CCGTATAG
 
Protein sequence
MSVGTTSSES ESLRHAMVDR LAADHAAKGL ALRPEVEAAM RTVPRELYTP GLPPEQAYEN 
EAVLKKRRGT EVISSVSAPF LIAEMLGQAA DALGGLAGCH ALEIGSGGYN ASLLRELVGP
SGSVTTVDID PEVTGRAVAC LAAAGYTDLT VVCADAEHPI TSGHRYDLII VTVGAWDIPP
AWCEQLRDGG VLVVPLRTFG ITKSWALRRR GDRLVSESDR QCGFVFMQGD GAHKVRYVDI
ADGVRLRMDE GQQVDPAVLE GLLAQPREEA WAGVSLPPRT DLIDLNLWLA TRLTHEAGQF
VVLLAEETAI EAGTVAPSWQ HGTPATLRDG TLAYRSALRW TGRRFDLGAY AHGSKAAAAA
GRMVEHMRAW VDAGSPAPVL HVLPAHTPDG DLPAGAVLNK RHNRLVLTFT PTASV