Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_3135 |
Symbol | |
ID | 3903932 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | + |
Start bp | 3707484 |
End bp | 3708947 |
Gene Length | 1464 bp |
Protein Length | 487 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 637880456 |
Product | 2-oxoglutarate dehydrogenase E2 component |
Protein accession | YP_482221 |
Protein GI | 86741821 |
COG category | [C] Energy production and conversion |
COG ID | [COG0508] Pyruvate/2-oxoglutarate dehydrogenase complex, dihydrolipoamide acyltransferase (E2) component, and related enzymes |
TIGRFAM ID | [TIGR02927] 2-oxoglutarate dehydrogenase, E2 component, dihydrolipoamide succinyltransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.00928935 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTGTAT CTGTCACAAT GCCCCGGCTC GGGGAGAGTG TCTCCGAGGG AACCGTCACG CGGTGGCTGA AGCAGGAGGG TGAACGCGTC GAGGCCGACG AGCCACTGCT CGAAGTCAGC ACCGACAAGG TCGACACCGA GATCCCCGCC CCCGCCTCCG GTGTCGTCAG CTCGATCAAG GTCGCCGAGG ATGAGACCGT CGAGGTCGGC GTCGAGCTCG CGGTGATCGA CGACGGGTCC GCGGGTGGCG GCACCGCACC GGCGCAGGCC ACGCAGGCGC CCGCCGCGCC GGAGCCGGAA CCCGAACCGG CGCCGGAACC CGTGAAGCCG GCTGCCGCGG CACCGCCGCC TCCGCCCACT CCGACGCCCG CACCCGCGCC GACGCCCGCA CCCGCGCCGA CGCCCGCACC CGCGCCGACG CCCGCGCCGG CACCGGCGTT CGCTCGTCAG CCGGAGCTCA CCCCGGTCCC GGCCCAGACC CCGGCCGCCG CCAACGGGAA CGGCGGCGGC ATCGGTCGCT ACGTCACCCC GCTGGTGCGC AAGATGGCCG CCGAGCTCGG CGTGGACCTG GGAACCGTGG CGGGAACCGG GCCGGGTGGA CGCGTCACCA AGCAGGACAT CCAGGACGCC GCGCGGCCGC GGGACGCGGC TCCCGCGGCG CAGGAGACTC CCGCGACCCC CCCCACAGCC CCGACAGCCC CAGCCGTGGC GCCGACCGTG GCGCCGACCG GCGCTGCCCC CGTCCGCGGC CAGACCGAGA AGCTCTCCCG CCTGCGGGCG TTGGTCGCTC GCCGCATGGT CGAGTCGCTG CAGATCAGCG CGCAGCTCAC CACCGTGGTG GAGGCCGACG TCACCCGGAT CGCGCGGCTG CGGGACCGGG CGAAGAGCGG CTTCCAGGCT CGGGAGGGCA TCAAGCTGTC ATTCCTGCCG TTCTTCGCCT TGGCGACCTG TGCGGCGCTG CGTGAGTTCC CGCAGCTCAA CTCCAGCATC GACGTCGAGG CGGGCACGGT CACCTACCAC GGAGAGGAGA ACCTCGGCAT CGCCGTGGAC TCCGAGCGTG GTCTGGTCGT GCCCGTCATC CACAACGCCG GCGATCTCAA CCTCATCGGG CTGGCCCGCA AGATCGACGA CCTGGCGAGC CGCACCCGGG CCAACCGAAT CTCGCCCGAC GAGCTCGGCG GCGGCACCTT CACCCTGACG AACACCGGCA GTCGAGGCGC CCTGTTCGAC ACGCCGATCA TCAACCAGCC GCAGGTGGGC ATCCTCGGCA CCGGCATCGT GACGAAGAAG CCGGCCGTCG TCGACGATCC GGAGCTGGGC GAGATCATCG CGGTTCGTTC GACGGTGTAC CTGTCCCTCA CCTACGACCA CCGCATCGTC GACGGTGCCG ACGCGGCTCG CTTCCTGGCC TTCACCAAGC ACCGGCTGGA AAACGGAGCC TTCGAAGCCG AACTCGGCCT GTAG
|
Protein sequence | MSVSVTMPRL GESVSEGTVT RWLKQEGERV EADEPLLEVS TDKVDTEIPA PASGVVSSIK VAEDETVEVG VELAVIDDGS AGGGTAPAQA TQAPAAPEPE PEPAPEPVKP AAAAPPPPPT PTPAPAPTPA PAPTPAPAPT PAPAPAFARQ PELTPVPAQT PAAANGNGGG IGRYVTPLVR KMAAELGVDL GTVAGTGPGG RVTKQDIQDA ARPRDAAPAA QETPATPPTA PTAPAVAPTV APTGAAPVRG QTEKLSRLRA LVARRMVESL QISAQLTTVV EADVTRIARL RDRAKSGFQA REGIKLSFLP FFALATCAAL REFPQLNSSI DVEAGTVTYH GEENLGIAVD SERGLVVPVI HNAGDLNLIG LARKIDDLAS RTRANRISPD ELGGGTFTLT NTGSRGALFD TPIINQPQVG ILGTGIVTKK PAVVDDPELG EIIAVRSTVY LSLTYDHRIV DGADAARFLA FTKHRLENGA FEAELGL
|
| |