Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_4699 |
Symbol | |
ID | 5673041 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 5613718 |
End bp | 5616051 |
Gene Length | 2334 bp |
Protein Length | 777 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 641243556 |
Product | carbon-monoxide dehydrogenase (acceptor) |
Protein accession | YP_001508972 |
Protein GI | 158316464 |
COG category | [C] Energy production and conversion |
COG ID | [COG1529] Aerobic-type carbon monoxide dehydrogenase, large subunit CoxL/CutL homologs |
TIGRFAM ID | [TIGR02416] carbon-monoxide dehydrogenase, large subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGATTCG TCGGCGAGCG GGTGGCAAGA GTCGAGGACC CGCGGTTGTT GACGGGCCGT GGAAAATTCG TCGACGACGT GACGTTGCCG CGCATGCTCC ATGTCGCTTT CGTACGCAGC CCCATGGCCC ACGCTCGGAT CAACAGCATC GACACCACGG CGGCGCGAGC GGTGCCCGGC GTCGTGGCGG TGTTCACCGG CGCCGACCTG GTAGCCGGCT GCAATCCCCT CACGGGATTG CGCAACGGAC CGAACTGGCC GACGTTCCGC GCCTTGCCCA CGGACAAGGC AAGGTTCGTC GGCGATCCGC TCGTGATGGT CGTGGCCGAC AGCCGGTACA TAGCCGAGGA CGCGTGCGAG CTGGTCGACA TCGACTACGA GCCGCTTCCG GCGGTGACCG GTTACGAAGC TGCCCTCGAC CCCGCATCAC CGGCCATCTT CGACGAACTC GGCGACAACA TCGTGGCGAC GAGCCCACCC GTCACCCTGG GTGACATCGA TGCCGCCTTC GCCGAGGCCG ACCAGGTGGT GCGGGCGACC TTGCGCCAAC ACCGTGTCGC CAACGTCCCC CTGGAAACGA GGGGCGGGAT CGCCGACTAC GACCCGGCCT CGGGGGAGCT GACCTTCATC GCCTCGACCC AGACCCCGCA TGGCCTGCGG CAGGCACTGG CCCAGGCTCT CGACCATCCC CTGGAACGGC TGCGGGTGCT TGCGGGCGAC GTGGGTGGTG GCTTCGGCCT CAAGGGTGTG GTGGGGCGCG AGGACTTCTG CATCGCGCTG GCCAGCAAGC GGCTCGGCCG TCCGGTCAAA TGGGTGGAGG ACCGCAACGA GCACCTTCTC GCGTCCGGCC ACGCCCGCGA AGAGAAGATC GACGCGGAAC TCGCGCTGAA GGCCGACGGC ACGCTGCTCG GGTTGAAGGT GAAGCTCGTC CTGGACGGCG GCGCCTACCC GGCGGTGCCG TTCAGCTGCA CGATCTACCC TGAGATGATC TGTACGTCGC TGCCTGGCCC TTACCACATC AAAGCCTACA AATATGAAAG CGCTGTCGTC GCCAGCAACA AGGCTACGTA CGTCGCCTAC CGGGGCCCGT GGGAGATGGA GGTGTGGACG CGGGAACGGC TGCTTGACAT CGCGGCCCAC GAGCTGGGGC TCGACCCCGC CGACATCCGG CGCCGGAACC TGGTGGCCGG CGAGCCCGGC GACCAGATCA TCACTGGCCG AGGCCTGGAG GGCATCACAT CACGCCGCTC CCTTGAGCAG GCCCTGGACC TCGTCGATTA CGACGGGTTC CGGAAGGAGC AGGTAGCCGC GCGTGCCGCG GGCCGCTACC TCGGCATCGG GTTCGCCATC TTCATGGAAG CGGCGCCCGG GCCGCCGGAG TTGCGGGGCA AAGGTGCGCC CTTCGGCGGC GAACAGGCGA AGGCGGCGCT CCAGGCCGAC GGCCATCTCC TCGTGACCAC CGGTCAGGCC CCGCACGGCC AGGGCCACGA GACGACGCTC GCCCAGGTCG CGGCCGACCA GATGGGTATT CCGCTGGACC ACGTGCGGGT GGTCCACGGC GACACGCGTC AGACCCCGTT CAACCTCATC GGCACCGGCG GCAGCCGGGC CGGCACGTGG GCGACGGGTG CCGTGATCGT GACGACCAGG CGGCTGAAGG AAAAGGTGCT GGACATCGCG GCCCACCGGC TGCAGATCGA TCCTGGCGAC CTCGACATCG TCGACGGGAT GGTGACGCCG AAGGGGGCGC CGGACAAGGC GATCCCGCTG GCCGACGTGG CGAAGCTGGC GATGATGGCG CCCATGTTCC TCCCGCCGGG CACCGACGTG TCCTTGACGG CGCAGGAGCG CTTCGACGGG TCCGCCGTCA CCAGCAGCGG TTGGTCGGGC GGCGCACATG TCTGCACCGT CGAGGTGGAC ATCGCCACGG GGCAGGTCCG GATACTTCGT TACGTCGTCG TCGAGGACTG CGGACGGGTG GTCAACCCGG CGATCGTCGA GGGCCAGATC TGCGGCGGTA TCGCGCAGGG CATCGGCGAG GTGCTGTACG AGAACGCCGC CTACGACGCC GACGGCAACT TCCTCGCCGC CACGTTCATG GACTACCTGC TCCCGACCGC CGCCGAGATC CCGCACATCG AGATCGAACA CATCGATTCC GCAGGCCTCG GCGACTTCGA CTTCCACGGT GTCGGCGAAG GCGGGGCGCT GGCCGCCCCG GCGACGCTGA CCAACGCCGT GGCGGACGCT CTGCTGCCCT TCGGAGCCCG GGTGGTCGAC CAGTACCTTC CCCCAGCGAA GATCCTCGAG CTCGCGGGCG TGATCTCTGC CTGA
|
Protein sequence | MRFVGERVAR VEDPRLLTGR GKFVDDVTLP RMLHVAFVRS PMAHARINSI DTTAARAVPG VVAVFTGADL VAGCNPLTGL RNGPNWPTFR ALPTDKARFV GDPLVMVVAD SRYIAEDACE LVDIDYEPLP AVTGYEAALD PASPAIFDEL GDNIVATSPP VTLGDIDAAF AEADQVVRAT LRQHRVANVP LETRGGIADY DPASGELTFI ASTQTPHGLR QALAQALDHP LERLRVLAGD VGGGFGLKGV VGREDFCIAL ASKRLGRPVK WVEDRNEHLL ASGHAREEKI DAELALKADG TLLGLKVKLV LDGGAYPAVP FSCTIYPEMI CTSLPGPYHI KAYKYESAVV ASNKATYVAY RGPWEMEVWT RERLLDIAAH ELGLDPADIR RRNLVAGEPG DQIITGRGLE GITSRRSLEQ ALDLVDYDGF RKEQVAARAA GRYLGIGFAI FMEAAPGPPE LRGKGAPFGG EQAKAALQAD GHLLVTTGQA PHGQGHETTL AQVAADQMGI PLDHVRVVHG DTRQTPFNLI GTGGSRAGTW ATGAVIVTTR RLKEKVLDIA AHRLQIDPGD LDIVDGMVTP KGAPDKAIPL ADVAKLAMMA PMFLPPGTDV SLTAQERFDG SAVTSSGWSG GAHVCTVEVD IATGQVRILR YVVVEDCGRV VNPAIVEGQI CGGIAQGIGE VLYENAAYDA DGNFLAATFM DYLLPTAAEI PHIEIEHIDS AGLGDFDFHG VGEGGALAAP ATLTNAVADA LLPFGARVVD QYLPPAKILE LAGVISA
|
| |