Gene Francci3_2488 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_2488 
Symbol 
ID3904866 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp2934426 
End bp2935592 
Gene Length1167 bp 
Protein Length388 aa 
Translation table11 
GC content72% 
IMG OID637879818 
Productpyruvate dehydrogenase 
Protein accessionYP_481584 
Protein GI86741184 
COG category[C] Energy production and conversion 
COG ID[COG1071] Pyruvate/2-oxoglutarate dehydrogenase complex, dehydrogenase (E1) component, eukaryotic type, alpha subunit 
TIGRFAM ID[TIGR03181] pyruvate dehydrogenase E1 component, alpha subunit 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.915026 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.661007 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAATCC TCGACGACAG CACCGCGCCC GGCTTCGTCG GGCCGCCGAG CTCCCACGGT 
CCGCGTCGTG ACCCGGCTCC GCTCCTGCCC GACCCCGAGC CGGTCCGGGT CCTCGGCACG
GAGGCGGCGG GGAAGGTCGA CACCGATCTG CTGCGCGTCC TGTACCACCG TCTGGTGCTG
GGACGGCGCT TCAACCAGCA GGCCACGACC CTGGCCAGGC AGGGCCGCCT CGCCGTGTAC
CCGGCGTCCA CCGGCCAGGA GGCATGCCAG ATCGCCGCGG CCATGGTGCT GCGGGAGTCG
GACTGGCTGT TCCCCAGCTA CCGCGACACG CTGGCGGTGG TGTCGAGGGG CGTGCGCCCG
GTGGACGCGC TGACGCTGAT GCGCGGGAAC GCGCACAGCG GCTACGATCC GCGGGAGCAC
CGGATCGCGC CGCTGTCGAC TCCGCTGGCC ACCCAGGCCT GCCATGCGGT GGGCCTGGCC
CACGCCGCTC GCCTGCGCGC GGCCTCGGAT CCGTGGGCGG CGGAGGACGT CGTGGCGCTT
GCCCTGATCG GCGACGGCGG CACCAGCGAG GGCGACTTCC ACGAGGCGCT GAACTTCGCC
GGGGTGCTGA ACGCGCCAGT GGTGTTCCTG GTACAGAACA ACGGCTATGC GATTTCGGTG
CCGCTGGCCC AGCAGTCCGC GGCGCCGACG CTGGCACACA AGGCGGTGGG CCATGGGATC
ATCGGTCGTT TGGTGGACGG CAACGACGCG CCCGCGGTGC ACGGTGTGCT GCGCGCGGCG
GTCGAGCACG CGCGGTCGGG TCGCGGCCCG GTGCTGGTCG AGGCGGTCAC CTACCGGCTG
GAGGCGCACA CCAACGCCGA CGACGCGACC CGCTACCGCA CCTCGGAGGA GGTCGCCGCC
TGGCAGGCCC GCGATCCGCT GACGCTGCTG GAGCGGCAGC TACGCAAGGC CGGTCTCCTC
GACGACGCCG GCGTCGCCGC AGTCGCGCGG GCCGCCGAGG AACTCGCCGC CGAGATGCGC
GCCCAGTTCG ATCGTGTGCC TGATCTCGAT CCGGGCTCAC TGTTCACGCA CGTCTATGCC
CAGCCGACCA GCCAGCTTCG TGAGCAGGCC GCCGAGCTGA TGGCCTGGCA GGCCGCGGAC
GCAGCCAAGA GCGACGACGC ACGATGA
 
Protein sequence
MTILDDSTAP GFVGPPSSHG PRRDPAPLLP DPEPVRVLGT EAAGKVDTDL LRVLYHRLVL 
GRRFNQQATT LARQGRLAVY PASTGQEACQ IAAAMVLRES DWLFPSYRDT LAVVSRGVRP
VDALTLMRGN AHSGYDPREH RIAPLSTPLA TQACHAVGLA HAARLRAASD PWAAEDVVAL
ALIGDGGTSE GDFHEALNFA GVLNAPVVFL VQNNGYAISV PLAQQSAAPT LAHKAVGHGI
IGRLVDGNDA PAVHGVLRAA VEHARSGRGP VLVEAVTYRL EAHTNADDAT RYRTSEEVAA
WQARDPLTLL ERQLRKAGLL DDAGVAAVAR AAEELAAEMR AQFDRVPDLD PGSLFTHVYA
QPTSQLREQA AELMAWQAAD AAKSDDAR