Gene Francci3_0058 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_0058 
Symbol 
ID3905393 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp73539 
End bp74738 
Gene Length1200 bp 
Protein Length399 aa 
Translation table11 
GC content71% 
IMG OID637877388 
Productpyruvate dehydrogenase 
Protein accessionYP_479181 
Protein GI86738781 
COG category[C] Energy production and conversion 
COG ID[COG1071] Pyruvate/2-oxoglutarate dehydrogenase complex, dehydrogenase (E1) component, eukaryotic type, alpha subunit 
TIGRFAM ID[TIGR03181] pyruvate dehydrogenase E1 component, alpha subunit 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGCTGC TCACCCCGAC CGGCCACCGG ATCACCACCG CGTCCACGAC GACGGCAGCC 
GGACCCGACG CGCCGGGACC CGATCCCTCC AGAGCCCGCG CCGGCGGGGT GGACGCCGCG
GGAGCGGACC CGCCCCGGAC AGCCGTCGAT CTGGCCGCCC AGATCGCCGA CCTGACCGAC
GACGACCTGT ACGGGCTGTA TCGGGACATG GTGTTCATCC GCCGGTTCGA TGAGGAGGCG
ACCAGCCTGC AGCGCCAGGG CGAGCTGGGG CTGTGGGCGA GCCTGCGGGG CCAGGAGGCG
GCCCAAGTCG GATCCGGCCG GGCGCTGAGG CCCGGCGACA TGGTGTTCCC GTCGTACCGG
GAACACGGGG TGGCCTGGTG CCGGGGAGTC AGGCCGAGGG AAATCCTGGC GATCTATCGC
GGCACGACGC TCGGTGGCTG GGATCCGGGC ACCCACGGGT GCGCCCTCTA CTCGATCGTC
GTGGGCTCGC AGGCACTGCA TGCCACCGGC TACGCGATGG GAATCGCCCG GGACGGCACC
AACGACGCGG CCATTGCGTA CTTCGGTGAC GGGGCGTCAA GCGAGGGCGA TGTCAGCGAG
GCGTTCGGAT GGGCGAGCGT GTTCGCCGCC CCGCTGGTGT TCTTCTGCCA GAACAACCAG
TGGGCCATCT CGGCACCTGC CCGGCGCCAG AGCCGGATCG AGATCGTCCA CCGGGCCGCG
GGCTTCGGTT TCCCGGGCGT GCGGGTGGAC GGCAACGACG TGCTGGCGTG TCTCGCGGTG
ACCCGGTGGG CCCTGGCGAC CGCGCGTGCC GGGCGTGGTC CGGTGCTGGT GGAGGCCGTG
ACCTACCGGA TGAATCCACA CACCACGGCA GACGATCCCA GCCGCTACCG TCCGAAGGGT
GAACTGGACA TGTGGCGTCG GCGTGATCCG TTGGACCGGA TGCGCGCCTA CCTGACGGCG
CGGGGCCTGC TCACCGAGGA GAGTCTGCGG CAGCTGGCCA TCGAGGCGGA CTCCTTCGCG
CACGAGCTGC GCGCGCAGTG CGTCGCGCTG CCGGATCCGA TCCCGGCCAG CCTGTTCGAC
CATGTGCAGG TGGCTGAGAA TGAACTCGTG ACGGCCCAGC GATCGGCGTT GGCGGCCATG
TTCGACGTGG AGGGGCCCGG CGTGGAGGAC GAGATGGGCG TACCCGAGGG GAACCGGTGA
 
Protein sequence
MQLLTPTGHR ITTASTTTAA GPDAPGPDPS RARAGGVDAA GADPPRTAVD LAAQIADLTD 
DDLYGLYRDM VFIRRFDEEA TSLQRQGELG LWASLRGQEA AQVGSGRALR PGDMVFPSYR
EHGVAWCRGV RPREILAIYR GTTLGGWDPG THGCALYSIV VGSQALHATG YAMGIARDGT
NDAAIAYFGD GASSEGDVSE AFGWASVFAA PLVFFCQNNQ WAISAPARRQ SRIEIVHRAA
GFGFPGVRVD GNDVLACLAV TRWALATARA GRGPVLVEAV TYRMNPHTTA DDPSRYRPKG
ELDMWRRRDP LDRMRAYLTA RGLLTEESLR QLAIEADSFA HELRAQCVAL PDPIPASLFD
HVQVAENELV TAQRSALAAM FDVEGPGVED EMGVPEGNR