Gene Franean1_3540 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3540 
Symbol 
ID5671910 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp4198544 
End bp4200844 
Gene Length2301 bp 
Protein Length766 aa 
Translation table11 
GC content74% 
IMG OID641242427 
Product5-methyltetrahydropteroyltriglutamate-- homocysteine S-methyltransferase 
Protein accessionYP_001507847 
Protein GI158315339 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0620] Methionine synthase II (cobalamin-independent) 
TIGRFAM ID[TIGR01371] 5-methyltetrahydropteroyltriglutamate--homocysteine S-methyltransferase 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGTGAGCA CGACCGTCGC CGGCTACCCG CGGATCGGGC CGGCCCGCGA GCTGAAGACC 
ACGACCGAGG CGTACTGGGC CGACCGGCTC GGCGAGGACG AGCTGGTCTG GATCGCCCAG
CAGCTGCGCG CCGACGTGTG GAAGGATCTC GCGGCCGCCG GGCTGGACGC CATCCCGTCG
AACACGTTCT CGTTCTATGA CCAGGTCCTC GACACCGCCG TGCTCTTCGA CGCCGTGCCG
GACCGTTTCC GCGGCCTGAC CGGTGCGGGC GGCTCGCCCG CCACCCCGCT GGAGACGTAC
TTCGCGATGG CCCGCGGCGC CGACGGGGTC GCGCCGCTGG AGATGACCAA GTGGTTCGAC
ACCAACTACC ACTACCTTGT GCCCGAGCTG GACCCGTCGA CCCGGTTCCG CCTCGTCGGC
GACAAGCCGC TGCGCGAGGT CCGGGAGGCC CGGGAGCTGG GCGTGGAGAC CCGGCCGGTG
CTCGTCGGCC CGGTCACCTT CCTGCTGCTG GCGAAGGCCG CGGCTGGCGC GCCCGCGGGA
TTCCGGCCGC TGGACCTGCT CGACGACCTG GTCGGGCAGT ACGTGGAGCT GCTCGACGAC
CTCGCCTCCG ACAGCGTCGC CTGGGTGCAG CTGGACGAGC CGGCGCTCAC CCGGGACCTG
ACCCCGGCCG AGCTCGCGGC GACCGCGCGC GCCTACGCGC GCCTCGGCGG GGCGACCACC
CGGCCGAGGA TCCTGGTCTC GACCTTCTTC GGGGAGGCCG GCGAGGCCCT GCCCGTCCTA
CGGGACGCCC CGATCGACGG CATCGGCCTC GATCTCGTCG CCGGGCCGGG GAACCTCGAC
GCGGTCGCCC GGGCCGGCGG CGTCGGGGCG AAGACGCTGT TCGTCGGCGT CGTGAACGGC
CACAACGTCT GGCGCGCCGA CCTGCCGGCC GCGCTGGCCA CCTGTGCCAC GCTGAGCGGC
CTCGCCGCCG ACGTCGTCGT CACGACGTCG TGCTCGCTGC TGCACGTACC GATCGACGTC
GAGGCGGAGA CGAACCTCGA CCCGCGGCTG GCCGACCGGA TGGCCTTCGC CCGGCAGAAG
GTCGAGGAGG TCGTGCTGCT CGGCACCGCG CTGCGCGCGG GCCGCGACGC CGTGGCCGCG
GAGATCGCGG CCGCCGAGGC GCGCCGGCTC GCCGTCCCCG ACCAGCTCGT CGACCCACGG
GTGCGCGAGC GGCTCGCGGC GCTGGGCGAG GGCTGGCCGA CCCGCGGGGA TCTCGAGCAC
CGGGTCGAGG CGCAGGCCGC CGCGCTCGGG CTGCCGCCGC TGCCCACCAC GACGATCGGC
TCGTTCCCGC AGACGGCGGC GATCCGCGCG GCACGGGCCT CCCGCCGCGC CGGCACCCTC
GACGAGGCCG GGTACGTCCG GGCGATGAAG ACCGAGATCG ACCAGGTCGT CGCCCTGCAG
GAGGACATCG GACTCGACGT GCTCGTGCAC GGCGAGCCAG AGCGCAACGA CATGGTCCAG
TACTTCGCCG AGCTGCTGGC CGGCTACGCG GCCACCGAGC ACGGCTGGGT GCAGTCCTAC
GGCACCCGCT ACGTCCGCCC GCCGATCCTG TTCGGTGACG TCTCCCGGCC CGAGCCGATG
ACCCTGCGGT GGGCCGCCTA CGCGCAGTCC CGCACCAGCA GGCCGGTCAA GGGCATGCTC
ACCGGCCCGG TCACCATGCT CGCCTGGTCG TTCGTGCGCG ACGACCAGCC GCTGGAGGTG
ACCGCCCGCC AGGTCGCCCT GGCGCTGCGG GACGAGATCC ACGGCCTGGA GGCCGCCGGC
ATCCGGATCA TCCAGGTGGA CGAGCCGGCG CTGCGCGAGC TGCTCCCGCC GCGCCGGGCG
CTGTGGGGGG CCTACCTCGA CTGGGCGGTC GGCGCCTTCC GGCTGGCGAC CTCGTCCGTG
GCGGCGAGCA CCCAGATCCA CACTCACATG TGCTACTCGG AGTTCGGCGA CATCATCGGC
TCCATCGACG ACCTCGACGC GGACGTCGCC AGCGTCGAGG CCGCCCGCTC CCGGATGGAG
CTGGTGACCG ACCTGCGGAA GGCCGGCTAC CGGCGGGCCA TCGGCCCGGG TGTCTACGAC
ATCCACTCCC CCCGGGTGCC CACGGTGGAC GAGATCGAGA AGTCGCTGCG GCTGGCGCTC
GCCGCGGTAG AACCCGCCCG GCTGTGGGCC AATCCCGACT GCGGGCTGAA GACCCGCAGT
TTCACCGAGG TCGAGCCGGC GCTGCGCAAC ATGGTCACCG CGACGCGCAG GGTCCGCGAA
TCCCTCCCGG ACGGAGGCTG A
 
Protein sequence
MVSTTVAGYP RIGPARELKT TTEAYWADRL GEDELVWIAQ QLRADVWKDL AAAGLDAIPS 
NTFSFYDQVL DTAVLFDAVP DRFRGLTGAG GSPATPLETY FAMARGADGV APLEMTKWFD
TNYHYLVPEL DPSTRFRLVG DKPLREVREA RELGVETRPV LVGPVTFLLL AKAAAGAPAG
FRPLDLLDDL VGQYVELLDD LASDSVAWVQ LDEPALTRDL TPAELAATAR AYARLGGATT
RPRILVSTFF GEAGEALPVL RDAPIDGIGL DLVAGPGNLD AVARAGGVGA KTLFVGVVNG
HNVWRADLPA ALATCATLSG LAADVVVTTS CSLLHVPIDV EAETNLDPRL ADRMAFARQK
VEEVVLLGTA LRAGRDAVAA EIAAAEARRL AVPDQLVDPR VRERLAALGE GWPTRGDLEH
RVEAQAAALG LPPLPTTTIG SFPQTAAIRA ARASRRAGTL DEAGYVRAMK TEIDQVVALQ
EDIGLDVLVH GEPERNDMVQ YFAELLAGYA ATEHGWVQSY GTRYVRPPIL FGDVSRPEPM
TLRWAAYAQS RTSRPVKGML TGPVTMLAWS FVRDDQPLEV TARQVALALR DEIHGLEAAG
IRIIQVDEPA LRELLPPRRA LWGAYLDWAV GAFRLATSSV AASTQIHTHM CYSEFGDIIG
SIDDLDADVA SVEAARSRME LVTDLRKAGY RRAIGPGVYD IHSPRVPTVD EIEKSLRLAL
AAVEPARLWA NPDCGLKTRS FTEVEPALRN MVTATRRVRE SLPDGG