Gene Franean1_1077 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_1077 
Symbol 
ID5675672 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp1280648 
End bp1284049 
Gene Length3402 bp 
Protein Length1133 aa 
Translation table11 
GC content71% 
IMG OID641240009 
Productmethylmalonyl-CoA mutase, large subunit 
Protein accessionYP_001505439 
Protein GI158312931 
COG category[E] Amino acid transport and metabolism
[I] Lipid transport and metabolism 
COG ID[COG1703] Putative periplasmic protein kinase ArgK and related GTPases of G3E family
[COG1884] Methylmalonyl-CoA mutase, N-terminal domain/subunit
[COG2185] Methylmalonyl-CoA mutase, C-terminal domain/subunit (cobalamin-binding) 
TIGRFAM ID[TIGR00640] methylmalonyl-CoA mutase C-terminal domain
[TIGR00641] methylmalonyl-CoA mutase N-terminal domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.953537 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGCAG CCACCGTGGT GGGCCAGCAG GCCCCCGCCC TGCACGTGCC CGTCTGGTCG 
GTGCGCGTCG TCACCGCGGC GTCGCTGTTC GACGGTCATG ACGCGGCGAT CAACATCATG
CGCCGGATCC TGCAGGCCCA GGGAGCCGAG GTGATCCACC TCGGTCACGA CCGGGGCGTC
GACGAGATCG TGACGGCGGC GATCCAGGAG GACGCGCAGG CGGTCGCGAT CTCGTCCTAC
CAGGGTGGTC ACGTCGAGTA CTTCACCTAC CTGGTGGACC GGCTGCGGGA GCGGGGAGCC
GGGCACATCG GCGTGTACGG CGGCGGTGGC GGGGTCATCG TGCCGGACGA GATCGCGCTA
CTGCACTCCC GCGGTGTCGC GAGGATCTTC TCCCCGGAGG ATGGCCAGCG GCTCGGCCTG
CACCTGATGG TCAACAGCAT CGTCCGCGAC GGGGACGTCG ATCTCGCGGC GCGCCCGCCG
GCGGTCGAGG CGGTGCGGGC CGGCGACGAG TCCGCGCTGG CCCGGGCGAT CACCGTGCTC
GAGGATGGCC GGAACCCGGC CCTGGCCGCA GCGCTGCGCG AGGCGGCCGA GTGCGGACCG
GCCGTGCCGG TGCTCGGCAT CACCGGTACC GGCGGGTCCG GGAAGTCGTC GCTCACGGAC
GAGGTGCTGC GCCGTTTCCG TCTCGACCAG GAAGACGGCC TGCGCATCGC CGTCCTCGCG
GTCGACCCGA CCCGGCGGCG CGGCGGCGGT GCGCTGCTCG GCGACCGGAT CAGGATGAAC
GCGCTGGAAG CCGGTTTCCT CGGCGGTGAG CCCGACGGCG GCGGCCCGGC GAAGGGCCGC
CGGGGGAACG GTGCCCGGGG GAACGGTGCC CGGGGGAATG GCGTCCACGG GAGCGGGGTC
TTCTTCCGCT CGCTGGCCAC CCGCGGCGCG GGGCGCGAGC TGCCCGAGCA CCTCGCCGAC
ATCATCGCGG CCTGCAAGGC CGCCGGGTAC GACCTGATCA TCGTGGAGAC TCCGGGCATC
GGCCAGGGCG ACGCGGCGAT CGTGCCCTTC TGTGACGTCT CGATGTACGT GATGACCCCC
GAGTACGGGG CCGCGTCCCA ACTCGAGAAG ATCGACATGC TCGACTTCGC GGATGTCGTG
GCGGTCAACA AGTTCGAGCG CCGCGGAGCG GAGGACGCGC GGCGCGACGT CGCGCGTCAG
ATGGTGCGCA ACCGGGAGGA CTTCGGCGCC TCCTGGGAGG AGATGCCGGT GTTCGGCACC
TCGGCCGCGC GGTTCAGCGA CGACGGGGTG ACCGCGCTCT ACCAGTTCCT ACGCGACCTG
CTGCGCGAGC GCGGGCTGCG CGTGGTCCCG GGCCGGCTGC CGGCGGTGGA CGTCCGTGTC
TCCTCGGGGC TGACGACGGT GGTGCCGCCG GCGCGGGTCC GCTACCTCGC GGAGATCGCC
GATGCGGTGC GCGGCTACCA CGCGCACACC GAGGAGCTGT CCGAGCTCGC CCGCCGGCGC
CAGCACCTGT CCACGGCGAT CGCCGAGCTC GACGGCGCCC AGGCGGATCA CGCCAGCTCG
AATGGCGCCG GCGGTCCGGG TGACGCGCTG GCGCGGCTGC GGAGCCTCCG GGACGCGGCT
GACGCGGCGC TGGACGCGCG GACCCGCGAG CTGCTCGACG GCTGGCCCGC CCTGGTCGCC
GAGCGGTCCG GCGAGGAGAT GGTCTACACG GTGCGCGACC GGGAGATCCG CACCCCGCTG
CGGCGCACGA CGCTGTCGGG TACCGCCGTC CCCCGGGTGG CGGTGCCGCG GATCGACGAC
GACGCCCGCC TCGTGCGGTT CCTGCGGCGG GAGAACCTGC CGGGGCTCTT CCCGTTCACC
GCGGGCGTGT TCCCGTACAA GCGGGCCGGT GAGGCGCCGG CGCGGATGTT CGCCGGCGAG
GGCGACCCGT TCCGCACGAA CCGGCGGTTC CACCTGCTCT CCGCCGACTC GCCGGCGACC
CGGCTGAGTA CCGCGTTCGA CTCGGTCACG CTTTACGGGC GGGACCCGGG CCCCCGCCCG
GACGTGTACG GAAAGATCGG GACGTCCGGG GTCTCCGTGG CGACGCTGGA CGACATGAAG
GCGCTTTACG CGGGATTCGA CCTGTGCTCG CCGACGACGA GCGTCTCGAT GACGATCAAC
GGCCCGGCCC CGGCGATTCT GGCGATGTTC CTCAACACCG CGATCGATCA GCGGCTCGAG
CTGTTCCGGA CGGAGAACGG GCGCGAGCCG TCGGCGGCCG AGACGGCCGA GGTGGCAGCC
TGGGCGCTGG CGAACGTGCG CGGCACCGTG CAGGCGGACA TCCTCAAGGA GGACCAGGGC
CAGAACACCT GCATCTTCTC GACGGAGTTC GCCCTGCGCT GCATGGCGGA CGTGCAGGAA
TGGTTCATCG ACCACCGCGT CCGAAACTTC TACTCGGTTT CGATCTCCGG CTACCACATC
GCGGAGGCCG GGGCGAACCC GATCAGTCAG CTCGCGTTCA CGCTCGCCAA CGGGTTCACC
TACGTCGAGG CCTACCTGGC GCGCGGAATG CGGATCGACG ACTTCGCGCC CAACCTGTCG
TTCTTCTTCT CCAACGGCAT GGACGCCGAG TACAGCGTGA TCGGCCGGGT GGCCCGGCGG
ATCTGGGCGG TCGCGATGCG CGAGCGCTAC GGCGCGGCGG AGCGCTCCCA GAAGCTCAAG
TACCACGTGC AGACCTCGGG TCGGTCGCTG CACGCGCGGG AGATGAACTT CAACGACATC
CGGACGACCT TGCAGGCATT GTGCGCGATC TACGACAACT GCAACAGCCT GCACACCAAC
GCCTACGACG AGGCTGTGAC GACGCCCACG GAGCAGTCCG TGCGGCGGGC GCTGGCCATC
CAGATGATCA TCGATCAGGA ATGGGGGCTC GCCGGGAACG AGAACCCGCT GCAGGGCTCC
TTCGTGATCG ACGAGCTGAC CGACCTCGTC GAGGAGGCGG TGCTGGCCGA GTTCGAGCGG
ATCTCCGAGC GGGGCGGTGT TCTCGGCGCG ATGGAGACCG GTTACCAGCG CGGGCGCATC
CAGGATGAGT CGATGCTCTA CGAGCGGCGC AAGCACGACG GCTCCCTGCC GATCATCGGG
GTGAACACCT TCATCGGGCC GGACGCCGAC GACAAGGGAG CCGGTCCGCT GGAGCTCGCC
CGGGCGACCG AGGAGGAGAA GCAGTCCCAG CTGCGGCGCC TGGCGGACTT CACCAGCCGT
AACCGCGAGC CCGCCCAGCA GGCTCTCGAG CGGCTGCGGC AGGTCGCCGC GGCGGGCGGT
AACACTTTCG AGGTGCTGAT GGACGCCGTG CGGGTGTGTT CCCTGGGCCA GATCAGCGCG
GCGTTCTTCG AGGTCGGCGG ACAGTACAGG CGTAACATCT GA
 
Protein sequence
MSAATVVGQQ APALHVPVWS VRVVTAASLF DGHDAAINIM RRILQAQGAE VIHLGHDRGV 
DEIVTAAIQE DAQAVAISSY QGGHVEYFTY LVDRLRERGA GHIGVYGGGG GVIVPDEIAL
LHSRGVARIF SPEDGQRLGL HLMVNSIVRD GDVDLAARPP AVEAVRAGDE SALARAITVL
EDGRNPALAA ALREAAECGP AVPVLGITGT GGSGKSSLTD EVLRRFRLDQ EDGLRIAVLA
VDPTRRRGGG ALLGDRIRMN ALEAGFLGGE PDGGGPAKGR RGNGARGNGA RGNGVHGSGV
FFRSLATRGA GRELPEHLAD IIAACKAAGY DLIIVETPGI GQGDAAIVPF CDVSMYVMTP
EYGAASQLEK IDMLDFADVV AVNKFERRGA EDARRDVARQ MVRNREDFGA SWEEMPVFGT
SAARFSDDGV TALYQFLRDL LRERGLRVVP GRLPAVDVRV SSGLTTVVPP ARVRYLAEIA
DAVRGYHAHT EELSELARRR QHLSTAIAEL DGAQADHASS NGAGGPGDAL ARLRSLRDAA
DAALDARTRE LLDGWPALVA ERSGEEMVYT VRDREIRTPL RRTTLSGTAV PRVAVPRIDD
DARLVRFLRR ENLPGLFPFT AGVFPYKRAG EAPARMFAGE GDPFRTNRRF HLLSADSPAT
RLSTAFDSVT LYGRDPGPRP DVYGKIGTSG VSVATLDDMK ALYAGFDLCS PTTSVSMTIN
GPAPAILAMF LNTAIDQRLE LFRTENGREP SAAETAEVAA WALANVRGTV QADILKEDQG
QNTCIFSTEF ALRCMADVQE WFIDHRVRNF YSVSISGYHI AEAGANPISQ LAFTLANGFT
YVEAYLARGM RIDDFAPNLS FFFSNGMDAE YSVIGRVARR IWAVAMRERY GAAERSQKLK
YHVQTSGRSL HAREMNFNDI RTTLQALCAI YDNCNSLHTN AYDEAVTTPT EQSVRRALAI
QMIIDQEWGL AGNENPLQGS FVIDELTDLV EEAVLAEFER ISERGGVLGA METGYQRGRI
QDESMLYERR KHDGSLPIIG VNTFIGPDAD DKGAGPLELA RATEEEKQSQ LRRLADFTSR
NREPAQQALE RLRQVAAAGG NTFEVLMDAV RVCSLGQISA AFFEVGGQYR RNI