Gene Francci3_1228 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_1228 
Symbol 
ID3902973 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp1470493 
End bp1472118 
Gene Length1626 bp 
Protein Length541 aa 
Translation table11 
GC content71% 
IMG OID637878561 
Productmalate synthase 
Protein accessionYP_480335 
Protein GI86739935 
COG category[C] Energy production and conversion 
COG ID[COG2225] Malate synthase 
TIGRFAM ID[TIGR01344] malate synthase A 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.112102 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.512222 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCTCGT TGGACGGGGT GACGGTGCAC GGGGTGTCCG CGGTGCGGAC CGTGCCGGGG 
CTCACCGCGG AGCAGGTCGA CGCGGTGCTC TCCGACAACG CGCTGGCCTT CGTCGCCGGG
TTGCACCGCA CGTTCGCCGG CCGGCGTGCC GAACTGCTCG CCGCCCGGGC CGCCCGCCGC
GCCGCGATCG CAGCCGGGGC CACCCTGGAT TTCCTGCCGC AGACCGCCGA CATCCGGGCC
GGGAACTGGC GGGTCGCGTC GCCGGCGCCG GGGCTGGTCG ACCGTCGGGC CGAGATCACC
GGCCCGACCG ACGCCAAGAT GCTCATCAAC GCGCTCAACA GCGGTGCCCG GGTCTTCATG
GCGGACCTCG AGGACGCCAA CGTGCCGACC TGGTCGAACA TGGTCGTCGG ACAGCACAAC
CTGAGTGAGG CCGTCGCCGG CACGCTCGCC TTCACCTCGC CCGACGGCCG TCGTTACGAG
CTCGACGAGT CCACGGCGAC GCTCGTGGTG CGCCCGCGCG GCTGGCACCT GCCGGAGCGG
CACGTCACCG TCGACGGGGA GCCGATCGTC GCCGCGCTCT TCGACGCCGG AATGTACCTG
GTCCGCAACG CGCACGCCCT GCGGGCTACG GGCGTGGCGC CGTACTTCTA CCTGCCGAAG
CTGGAGAGTC ATCTCGAGGC CCGGCTGTGG AACGACGTGT TCACCGCGGC GCAGGCCGAG
CTCGGCCTGC CCGTCGGCAC CATCCGGGCG ACCGTTCTCA TCGAGACGCT GCCCGCCGCC
TTCGAGATGG AGGAGATCCT CTACGAGCTG CGGGAGCATT CCGCCGGACT CAACGCTGGC
CGCTGGGACT ACATGTTCTC CACCATCAAG ACGTTCGCGT CCCGGCCGAC TGAGTTCCTG
CTGCCCGACC GCAACGGCGT GACGATGACG GTGCCGTTTC TGCGCGCCTA CACCGAGCTG
CTGGTCTCCA CCTGTCATCG CCGCGGCGCG CACGCGATCG GCGGGATGGC GGCGTTCATC
CCGTCGCGGC GCGACCCCGA GATCAACGCC GCGGCCCTGG CGAAGGTGCG CGCCGACAAG
GAGCGGGAGT CCGCGGACGG GTTCGACGGG AGCTGGGTGG CGCACCCTGA CCTGGTGCCG
GTGTGCACCG AGGTCTTCGA TGCGGTGCTC GGCGACGAGC CGAACCAGCT CACCCGGTTG
CGTGACGACG TCAAGGTCGG CGCCGGCGAC CTGCTGGCGG TGCGCGACAC TCCGGGTTCC
GTCACCGCGG CCGGGGTCCG CGGGAACATC AGCGTCGGGG TGCGCTACCT GGAGAGCTGG
CTGCGCGGGA TCGGGGCGGT CGGCATCGAC AACCTGATGG AGGACGCCGC CACCGCTGAG
ATCTCCCGTA GTCAGATTTT CCAGTGGATA GCCGCCGGGG TCGTGCTCGA CGACGGCCGT
CCGGTCACGG CCGATCTGGT CCGGACCGCT CTGGCGGAGG TGCTGGACCA GATCCGGCTC
TCCATCGGCG CCGCCGCCTT CGACAACGGT CGCTGGAAGG ACGCGGCGGC GGTGTTCGAG
GAGACGGCGC TCGGCGAGAC CTTCGTCGAG TTCCTTACTC TTCCCGCCTA CGAGCGGATC
GACTGA
 
Protein sequence
MGSLDGVTVH GVSAVRTVPG LTAEQVDAVL SDNALAFVAG LHRTFAGRRA ELLAARAARR 
AAIAAGATLD FLPQTADIRA GNWRVASPAP GLVDRRAEIT GPTDAKMLIN ALNSGARVFM
ADLEDANVPT WSNMVVGQHN LSEAVAGTLA FTSPDGRRYE LDESTATLVV RPRGWHLPER
HVTVDGEPIV AALFDAGMYL VRNAHALRAT GVAPYFYLPK LESHLEARLW NDVFTAAQAE
LGLPVGTIRA TVLIETLPAA FEMEEILYEL REHSAGLNAG RWDYMFSTIK TFASRPTEFL
LPDRNGVTMT VPFLRAYTEL LVSTCHRRGA HAIGGMAAFI PSRRDPEINA AALAKVRADK
ERESADGFDG SWVAHPDLVP VCTEVFDAVL GDEPNQLTRL RDDVKVGAGD LLAVRDTPGS
VTAAGVRGNI SVGVRYLESW LRGIGAVGID NLMEDAATAE ISRSQIFQWI AAGVVLDDGR
PVTADLVRTA LAEVLDQIRL SIGAAAFDNG RWKDAAAVFE ETALGETFVE FLTLPAYERI
D