Gene Franean1_4310 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4310 
Symbol 
ID5672665 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp5148085 
End bp5149662 
Gene Length1578 bp 
Protein Length525 aa 
Translation table11 
GC content70% 
IMG OID641243183 
Productmethylmalonyl-CoA mutase, large subunit 
Protein accessionYP_001508600 
Protein GI158316092 
COG category[I] Lipid transport and metabolism 
COG ID[COG1884] Methylmalonyl-CoA mutase, N-terminal domain/subunit 
TIGRFAM ID[TIGR00641] methylmalonyl-CoA mutase N-terminal domain 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.57946 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGATT CTGTCCAGAC CAACTCCGGC CTGCCGGTGG AGCCGGTGTA CGGGCCGGCG 
GACCGCGGCA CCGAACCGCC CGCGCCGGGC AAGTTTCCGT TCACCCGCGG CAACTACGCC
TCCGGGTACC GCGGGCGCAC GTGGACGTTC CGCCAGTACT CCGGCTTCGG CACCGCCGAG
GAGTCCAACA ACCGCTACCG CTACCTGCTC GAGCAGGGCG GCACGGGCCT GTCGGTGGCC
CTCGACCTGC CCACCCAGTG CGGGTACGAC TCCGACGACC CGGAGGTGAC CGAGGAGGTC
GGCCGGGTCG GCGTCGCCGT CGACACGCTC GCCGACGCCG AGGTGCTCTT CGAGGGCATC
CCGCTGGACA AGATCAGCAC CAGCTTCACG ATCAACGGCA CCGCCGCGAT CCTGCTGGCC
TTCTACGTGG CGGCCGCGGA GCGCTCCGGG GTGCCCCGGG CCAAGCTGAC CGGGACGATC
CAGAACGACA TCCTCAAGGA GTACGCCTCC CGCGGGACGT GGATCTGGCC GCCGGAGCCG
TCGCTGCGCC TGATCGCCGA CACCATCGAG TTCTGCGCCG AGGAGGTGCC GCGCTTCAAC
GCGATCTCCG TGGCCGGCGC GCACTTCCGG GACGCCGGCG CGAACGCGGT ACAGGAGATG
GCGTTCACCC TCGCCGACGG CGTCACCTAC TGCGACACGG TGATCGAGCG CGGTCGGCTG
TCGATCGAGA AGTTCGCGCC GCAGGTGTCG TTCTTCTTCT ACACCCACGG CGACTTCTTC
GAGGAGATCG CCAAGTACCG GGCGGGCCGG CGGCGGTGGG CGACCATCGT CCGGGAGCGC
TACGGCGCCA CCGCCGACAA GGCGTCGATG TTCCGCTTCG GCTGCGTGGC CGGCGGGGCG
TCGCTGTACG CGCCGCAGGC GCAGAACAAC ATCGTCCGGG TCGCCTACGA GGCGATGGCC
TCGGTGCTCG GCGGCGTGCA GTCGATGTTC ACCGCGGCCT GGGACGAGCC GTTCGCGCTG
CCCAGCGAGG AGTCGGCGAC GCTCGCGCTG CGCACCCAGC AGATCCTCGC CTACGAGACG
GGGGTGACCC GCACCGCCGA CCCGCTGGGC GGCTCGTACT TCGTCGAGGC GCTCACCGAC
GCCACCGAGG CCCGCATCAT CGAGATCATG GACGACCTCG AGCAGCACGG CGGCATGGTC
CGCGCGATCG AGGACGGCTA CCTCCAGGGC ATGATCGCCG ACGAGGCGTA CCAGCTGCAC
CAGGACATCG AGAGCGGAAA GGTGCCGATC GTCGGCGTGA ACCGGTTCGT CTCCGACGAG
CCGGCGCCCG ACCTCGCCAC CTACGAACTG GACGCCGAAG GCCGCGAGCG GCAGCTGAAG
CGGCTGGCGA AGGTCAAGGG CGAGCGCGAC GCCGCGGCCG TCCGGGCCAG CCTCGACGCG
CTGGCCCGCG CCGCGGAGGG GAACGGGAAC CTGATGCACA ACCTGATCGA CTGCGCCAAC
GCCTACTGCA CGGTGGGCGA GATGGTCGCC ACCCTCAAGA ACGTCTGGGG CGAGTTCCAG
CAGCCGGTGG TGTTCTGA
 
Protein sequence
MSDSVQTNSG LPVEPVYGPA DRGTEPPAPG KFPFTRGNYA SGYRGRTWTF RQYSGFGTAE 
ESNNRYRYLL EQGGTGLSVA LDLPTQCGYD SDDPEVTEEV GRVGVAVDTL ADAEVLFEGI
PLDKISTSFT INGTAAILLA FYVAAAERSG VPRAKLTGTI QNDILKEYAS RGTWIWPPEP
SLRLIADTIE FCAEEVPRFN AISVAGAHFR DAGANAVQEM AFTLADGVTY CDTVIERGRL
SIEKFAPQVS FFFYTHGDFF EEIAKYRAGR RRWATIVRER YGATADKASM FRFGCVAGGA
SLYAPQAQNN IVRVAYEAMA SVLGGVQSMF TAAWDEPFAL PSEESATLAL RTQQILAYET
GVTRTADPLG GSYFVEALTD ATEARIIEIM DDLEQHGGMV RAIEDGYLQG MIADEAYQLH
QDIESGKVPI VGVNRFVSDE PAPDLATYEL DAEGRERQLK RLAKVKGERD AAAVRASLDA
LARAAEGNGN LMHNLIDCAN AYCTVGEMVA TLKNVWGEFQ QPVVF