Gene Francci3_3135 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3135 
Symbol 
ID3903932 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp3707484 
End bp3708947 
Gene Length1464 bp 
Protein Length487 aa 
Translation table11 
GC content72% 
IMG OID637880456 
Product2-oxoglutarate dehydrogenase E2 component 
Protein accessionYP_482221 
Protein GI86741821 
COG category[C] Energy production and conversion 
COG ID[COG0508] Pyruvate/2-oxoglutarate dehydrogenase complex, dihydrolipoamide acyltransferase (E2) component, and related enzymes 
TIGRFAM ID[TIGR02927] 2-oxoglutarate dehydrogenase, E2 component, dihydrolipoamide succinyltransferase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00928935 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTGTAT CTGTCACAAT GCCCCGGCTC GGGGAGAGTG TCTCCGAGGG AACCGTCACG 
CGGTGGCTGA AGCAGGAGGG TGAACGCGTC GAGGCCGACG AGCCACTGCT CGAAGTCAGC
ACCGACAAGG TCGACACCGA GATCCCCGCC CCCGCCTCCG GTGTCGTCAG CTCGATCAAG
GTCGCCGAGG ATGAGACCGT CGAGGTCGGC GTCGAGCTCG CGGTGATCGA CGACGGGTCC
GCGGGTGGCG GCACCGCACC GGCGCAGGCC ACGCAGGCGC CCGCCGCGCC GGAGCCGGAA
CCCGAACCGG CGCCGGAACC CGTGAAGCCG GCTGCCGCGG CACCGCCGCC TCCGCCCACT
CCGACGCCCG CACCCGCGCC GACGCCCGCA CCCGCGCCGA CGCCCGCACC CGCGCCGACG
CCCGCGCCGG CACCGGCGTT CGCTCGTCAG CCGGAGCTCA CCCCGGTCCC GGCCCAGACC
CCGGCCGCCG CCAACGGGAA CGGCGGCGGC ATCGGTCGCT ACGTCACCCC GCTGGTGCGC
AAGATGGCCG CCGAGCTCGG CGTGGACCTG GGAACCGTGG CGGGAACCGG GCCGGGTGGA
CGCGTCACCA AGCAGGACAT CCAGGACGCC GCGCGGCCGC GGGACGCGGC TCCCGCGGCG
CAGGAGACTC CCGCGACCCC CCCCACAGCC CCGACAGCCC CAGCCGTGGC GCCGACCGTG
GCGCCGACCG GCGCTGCCCC CGTCCGCGGC CAGACCGAGA AGCTCTCCCG CCTGCGGGCG
TTGGTCGCTC GCCGCATGGT CGAGTCGCTG CAGATCAGCG CGCAGCTCAC CACCGTGGTG
GAGGCCGACG TCACCCGGAT CGCGCGGCTG CGGGACCGGG CGAAGAGCGG CTTCCAGGCT
CGGGAGGGCA TCAAGCTGTC ATTCCTGCCG TTCTTCGCCT TGGCGACCTG TGCGGCGCTG
CGTGAGTTCC CGCAGCTCAA CTCCAGCATC GACGTCGAGG CGGGCACGGT CACCTACCAC
GGAGAGGAGA ACCTCGGCAT CGCCGTGGAC TCCGAGCGTG GTCTGGTCGT GCCCGTCATC
CACAACGCCG GCGATCTCAA CCTCATCGGG CTGGCCCGCA AGATCGACGA CCTGGCGAGC
CGCACCCGGG CCAACCGAAT CTCGCCCGAC GAGCTCGGCG GCGGCACCTT CACCCTGACG
AACACCGGCA GTCGAGGCGC CCTGTTCGAC ACGCCGATCA TCAACCAGCC GCAGGTGGGC
ATCCTCGGCA CCGGCATCGT GACGAAGAAG CCGGCCGTCG TCGACGATCC GGAGCTGGGC
GAGATCATCG CGGTTCGTTC GACGGTGTAC CTGTCCCTCA CCTACGACCA CCGCATCGTC
GACGGTGCCG ACGCGGCTCG CTTCCTGGCC TTCACCAAGC ACCGGCTGGA AAACGGAGCC
TTCGAAGCCG AACTCGGCCT GTAG
 
Protein sequence
MSVSVTMPRL GESVSEGTVT RWLKQEGERV EADEPLLEVS TDKVDTEIPA PASGVVSSIK 
VAEDETVEVG VELAVIDDGS AGGGTAPAQA TQAPAAPEPE PEPAPEPVKP AAAAPPPPPT
PTPAPAPTPA PAPTPAPAPT PAPAPAFARQ PELTPVPAQT PAAANGNGGG IGRYVTPLVR
KMAAELGVDL GTVAGTGPGG RVTKQDIQDA ARPRDAAPAA QETPATPPTA PTAPAVAPTV
APTGAAPVRG QTEKLSRLRA LVARRMVESL QISAQLTTVV EADVTRIARL RDRAKSGFQA
REGIKLSFLP FFALATCAAL REFPQLNSSI DVEAGTVTYH GEENLGIAVD SERGLVVPVI
HNAGDLNLIG LARKIDDLAS RTRANRISPD ELGGGTFTLT NTGSRGALFD TPIINQPQVG
ILGTGIVTKK PAVVDDPELG EIIAVRSTVY LSLTYDHRIV DGADAARFLA FTKHRLENGA
FEAELGL