Gene Francci3_2785 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_2785 
Symbol 
ID3904931 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp3279007 
End bp3280260 
Gene Length1254 bp 
Protein Length417 aa 
Translation table11 
GC content69% 
IMG OID637880107 
Productpyruvate dehydrogenase 
Protein accessionYP_481873 
Protein GI86741473 
COG category[C] Energy production and conversion 
COG ID[COG1071] Pyruvate/2-oxoglutarate dehydrogenase complex, dehydrogenase (E1) component, eukaryotic type, alpha subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATGTCG TACGGGATCC GGAACTGATT ACCGGCGGGC CGATGCGGCG GCTCGAACCG 
GATTCGGTGG ACGGTCTCGC TCCGCATCGT TTTCCTGGCG CGTTCGGGGA GAACACCTCA
CCCGCGGCGG GTGAACTCAA CGACGGCGGC ATCCGACTGC TCGCCCCCGA TGGTTCGCTG
GTCGCCGATT CTCGGTTCTC GGTGCTCGCC GACCACGAGC TGCGCATGGA ATTCTACACC
TCCATGGTCC TGGCCCGCCG GCTGGACGAG GAGGCCACGG CCCTGCAACG TCAGGGTGAG
CTGGTCCTGT GGATCCCGCT GCGGGGGCAG GAGGCTGCGC AGGTCGGATC CGCCGCGGCC
GCGCGTCCCC GGGATTACCT GTTCCCCAGT TACCGGGAAC ACGCGGTCGC CTGGCACCGC
GGTGTCCCGG CGGTCGAGGT GATTCGGCTG CTGCGGGGAG TGAGTCACGA TGGCTGGGAC
ACGCACCGTC ACAATATGGC TAATTACACG ATTGTTCTGG CGGCCCAGAC GCTGCACGCC
GTCGGTTTCG GGATGGGGGT GCTCCTTGAC GGCGCAGCCG GCACCGGCAC CGGCACCGGC
GGTACCGGCG GCACCGGCAC CGGCGGCACC GATACCGATA CCGACATGGC GGTGATGGTG
TATCTCGGTG ACGGTGCGAT GAGTCAGGGT GATGCCAACG AGGCATTCGT GTGGGCGGCG
AGTTTCGGTG CGCCCGTCGT CTTCTTCTGC CAGAATAATC AGTGGGCGAT TTCCACGCCG
AGCCGACGGC AGTCCCCGGT GCGGTTGGCC CGCCGGGCCG ACGGGTTCGG CTTCCCCGGC
CTGCGCGTCG ATGGTAACGA CGTGCTCGCG GTGCACGCGG TGACCACTTG GGCGCTCGAC
CGTGCGCGCA GCGGTCGCGG CCCGGTGCTC ATCGAGGCCA ACACCTACCG GATGGCCCCG
CACACCACCT CCGACGACGC GACTCGGTAC CAGCCGCCGG ACGAGATCAC CGCCTGGCAG
GCCCGGGATC CCATCGAGCG CCTCCGTCGG CTGCTCGCCG CCGAGGTCGA GGCCGGCTGG
TTCGACGAGG TCCGCCGGCG GGCCGACGTC GCCGCGGCCG ATCTGCGTCG GGACACCCTC
GCCCTCGACC CGCCGGACCC CTCCTCGATG CTCGACCATG TCGCCGCGGA GGAGACGGCC
GATCAGCGCC GGCAACGGGA GTGGTTCGCC GCGTACCGCG ACTCGTTCCT CTGA
 
Protein sequence
MNVVRDPELI TGGPMRRLEP DSVDGLAPHR FPGAFGENTS PAAGELNDGG IRLLAPDGSL 
VADSRFSVLA DHELRMEFYT SMVLARRLDE EATALQRQGE LVLWIPLRGQ EAAQVGSAAA
ARPRDYLFPS YREHAVAWHR GVPAVEVIRL LRGVSHDGWD THRHNMANYT IVLAAQTLHA
VGFGMGVLLD GAAGTGTGTG GTGGTGTGGT DTDTDMAVMV YLGDGAMSQG DANEAFVWAA
SFGAPVVFFC QNNQWAISTP SRRQSPVRLA RRADGFGFPG LRVDGNDVLA VHAVTTWALD
RARSGRGPVL IEANTYRMAP HTTSDDATRY QPPDEITAWQ ARDPIERLRR LLAAEVEAGW
FDEVRRRADV AAADLRRDTL ALDPPDPSSM LDHVAAEETA DQRRQREWFA AYRDSFL