Gene Franean1_3515 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3515 
Symbol 
ID5671885 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp4176055 
End bp4177173 
Gene Length1119 bp 
Protein Length372 aa 
Translation table11 
GC content73% 
IMG OID641242402 
Productmethyltransferase type 11 
Protein accessionYP_001507822 
Protein GI158315314 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG2518] Protein-L-isoaspartate carboxylmethyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCAACT GGCGGGCCCA CGCCCGCACC CTGGCGGACC AGGTCACCCA TCCCGGCTCC 
CGCTGGCACC GGGCGCTCGT CGACACACCG CGACACCGGT TCGTGCCCGC CTGGTGGGAC
GACAGCGACG GGTCGTGGGC GCTGCGCCGT GGTCCGCTCG CCGGCGCCTA CGCGGACCGG
TCGCTGGTCA CCCGGGTCGG CCCGCTGCAC GCCGACCTCG CCGAGGACGA CGACCACCCG
CAGGGCCGAC CGACCTCGTC GTCGACCGCG CCGAGCCTGG CCCTGACCAT GTACCGGTAC
GGGCATCTGT CCAGGGGCCT GGACATCGCC GACGTCGGCA CCGGGTCGGG GTACGGAGCG
GCCCTGCTCG CCCGCCGCTA CGGGTCCCAG CACGTCACCA CTCTGGACGT CGATCCGTAT
CTGGTGTCCG CCGCCGCCGG CCGGCTGGCC GCCCTCGACC TGCACCCGAC GGCGCTAACC
GTGGACGCCA CCGGCCCGCT GCCCGGCACC TACGACCGGA TCGTCTCCAT GGTTTCGGTT
CCCAGCATCC CGCCGAGCTG GCTGGCCGCG CTGCGTCCCG GCGGCCGGCT GGTCACCACC
ATCCGCGGCA CGTGGATCAT CCTCACCGCG ACGAGAACCC GCGACGGGGT GTTCGGGCAG
GTGGAGCGGG ACTGGGCCGG GTTCATGGAT GTTCGCAGCG GCCCCGACTA CCCTCCGGTC
GCGGCCGTCG ACTTCGACCG GATCGCCGAA CAGGTGGGCG TCGGCCGGTA CCCGGTGCTG
CACGTCGCCG ACGCCTGGGA GTTGTCCACC ATGCTCCACC TGGCCGTCCC GGGTATCGAG
CACCGCTATC GCCGCGAGGC CGACGGCCGG CACACCGCGC TCATGGCTCA CCCTGACGGG
TCATGGGCGC GGGGAACCGC GATCGGTACC GATCCGCCGA CGGTGCACCA GGGCGGGCCC
CGCCGGCTGT GGGAGGCGCT CGACACCGTC CGGGACGACT GGCTCCGTCT CGGATGGGCC
CCGTTCCTCG GCGCGCAGGC GATGATCCGC GACGACGGCA CTATCAAGCT CATCCGCGGC
GACTGGCGGG CAACCATACA CGCCGTCTCA ACACCCTAG
 
Protein sequence
MTNWRAHART LADQVTHPGS RWHRALVDTP RHRFVPAWWD DSDGSWALRR GPLAGAYADR 
SLVTRVGPLH ADLAEDDDHP QGRPTSSSTA PSLALTMYRY GHLSRGLDIA DVGTGSGYGA
ALLARRYGSQ HVTTLDVDPY LVSAAAGRLA ALDLHPTALT VDATGPLPGT YDRIVSMVSV
PSIPPSWLAA LRPGGRLVTT IRGTWIILTA TRTRDGVFGQ VERDWAGFMD VRSGPDYPPV
AAVDFDRIAE QVGVGRYPVL HVADAWELST MLHLAVPGIE HRYRREADGR HTALMAHPDG
SWARGTAIGT DPPTVHQGGP RRLWEALDTV RDDWLRLGWA PFLGAQAMIR DDGTIKLIRG
DWRATIHAVS TP