Gene Francci3_3751 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3751 
Symbol 
ID3906035 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp4499491 
End bp4500717 
Gene Length1227 bp 
Protein Length408 aa 
Translation table11 
GC content71% 
IMG OID637881077 
Productmalate dehydrogenase 
Protein accessionYP_482831 
Protein GI86742431 
COG category[C] Energy production and conversion 
COG ID[COG0281] Malic enzyme 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.131564 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCGCAA CCACTGACCA GCCGACTCTC GGTTCCACCA CGTCGGACTT CGCCGGGCTC 
AGCCCGCGGC CGTCCGACGA CGATCCCGCG TTCGCCCTGC ACCGCGGAGG AAAGATCAAC
ATCAACGCTA CCGCTCCATT GAACAGTCGG GAAGACCTGT CCCTGGCCTA TACGCCCGGA
GTCGCGCGGG TGTGTACCGC CATCGCCGGG ACTCCGGCCC TTGCCGACGA CTACACCTGG
CGGTCTAACA CCGTTGCCGT GGTCACGGAT GGGACGGCCG TGCTCGGCCT CGGTGACATC
GGCCCCGCGG CCTCCCTGCC GGTGATGGAG GGCAAGGCCG CGTTGTTCAA GCGCTTCGGT
GGTGTGGACG CGGTGCCACT GGCCCTTGCC TGCACGGACG TCGAGGAGAT CGTCGACACC
GTCGCGCGCC TCGCTCCGAG CTTCGGCGGG ATCAACCTCG AGGACATCTC GGCACCGCGC
TGCTTCGCCA TCGAGCGCCT CCTGCAGGAC CGCGTCGACA TCCCGGTCTT CCACGACGAC
CAGCACGGAA CGGCCATCGT CGCGCTGGCC GCGCTGGAGA ATGCGGCGAA GCTCACCGGA
CGCACCCTCG GTGATCTGCG TGCCGTGGTG TCCGGCGCGG GCGCTGCCGG CGCCGCCGTG
GCCCGCATCC TGCTGGCCGC CGGCATCGGG GACATCGCGG TTGGCGACAG CCGCGGCATC
CTCTACGCCG GTCGGGACAA CCTCACCCCG AGCAAGGCCG CGCTCGCCGC CGAGACCAAC
CGTGCCGGCC TGGCCGGGTC CATCACCGAC GCGCTCGCCG GCGCCGATGT CTTCCTCGGG
CTGTCCGCCG GGCAGGTCCC GGAGGAGGCG GTCGCCACCA TGGCGCCGGA GGCGATCATT
TTCGCGATGG CCAACCCCGA TCCCGAGATC GACCCGGCGG TCGCCCACAA GTACGCGCGG
ATCGTGGCGA CCGGTCGCAG CGACTACCCG AACCAGATCA ACAACGTCCT CGCCTTCCCC
GGAATCTTCC GGGGAGCGCT CGATGTCCGG GCCAGCCGCG TGACGGAGGG CATGAAGCTC
GCCGCCGCGA GGGCGCTCGC CGCGGTGATC GCGGACGAGC TGGCCGAGGA TCTCATCATC
CCGAGCGTGT TCGACGACCG GGTCGCCCCG GCCGTCGCCG CGGCGACGGC CGCGGCCGCG
CGGGCGGACG GGGTGGCCCG CCGCTGA
 
Protein sequence
MTATTDQPTL GSTTSDFAGL SPRPSDDDPA FALHRGGKIN INATAPLNSR EDLSLAYTPG 
VARVCTAIAG TPALADDYTW RSNTVAVVTD GTAVLGLGDI GPAASLPVME GKAALFKRFG
GVDAVPLALA CTDVEEIVDT VARLAPSFGG INLEDISAPR CFAIERLLQD RVDIPVFHDD
QHGTAIVALA ALENAAKLTG RTLGDLRAVV SGAGAAGAAV ARILLAAGIG DIAVGDSRGI
LYAGRDNLTP SKAALAAETN RAGLAGSITD ALAGADVFLG LSAGQVPEEA VATMAPEAII
FAMANPDPEI DPAVAHKYAR IVATGRSDYP NQINNVLAFP GIFRGALDVR ASRVTEGMKL
AAARALAAVI ADELAEDLII PSVFDDRVAP AVAAATAAAA RADGVARR