Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_3211 |
Symbol | |
ID | 5671587 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 3788847 |
End bp | 3791216 |
Gene Length | 2370 bp |
Protein Length | 789 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641242105 |
Product | carbon-monoxide dehydrogenase (acceptor) |
Protein accession | YP_001507525 |
Protein GI | 158315017 |
COG category | [C] Energy production and conversion |
COG ID | [COG1529] Aerobic-type carbon monoxide dehydrogenase, large subunit CoxL/CutL homologs |
TIGRFAM ID | [TIGR02416] carbon-monoxide dehydrogenase, large subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGGAACT CCACTGTCAC GACAGCCGCC GCCACGCCGA CCGCCGCGCG GTACGCGGGC ACCCGCGTGA ACCGGGTCGA GGACCCGCGG CTGCTTACCG GCCAGGGCAC CTATGTCGAC GATGTCAAGC GGCCGGGGAT GCTGCACGCG TGCTTTGTAC GCAGCCCGTT CGCCCGCGCG CGCATCATCG GCATCGACGC GTCGTCGGCG TTGGAGCTCG AGGGCGTGCA CGCCGTGTTC GTGGCCGAGG ATCTCAACCC GGGCGTCCAC GAGCACTGGT TCGGGATCAT CGGTCGCGAC GTCCCGGACA CCCCGCGGCC GGCGCTGGCC CAGGACGAGG TCCGCTTCGT CGGCGACCCG GTGGCCCTCG TCATCGCGGA GGACCGCTAC ATCGCGGAGG ACGCGGTTGA GCTGGTGGAG GTGGATTACG ACCCGCTGCC TCCGGTCGCC TCCTACATCA GCGCCCAGGA GTCGACCGAG CTCGTGCACG AGGCCTACCA GAACAACCGG GCCGGACTAC TCAAAGGCGG CGACCAGGGG CGGGTCGACG AGGCGTGCGC CGGCGCGGCG TTCGTCGTCG AGGAGACGAT CTACCAGCAG GGGTACGCGC CCGTCCCGAT GGAGACCCGC GGGATCGTCG TGGAGTGGTC GGGTGGCGAG CTCACGATCT GGGCGGCGTC CCAGGCCCCC CATGAGCTGC GGGCCTTCGC CGCGCGGTAC CTCGGCCTGC CGGAGAACCG GGTTCGCGTG ATCATGCGGG ACGCGGGCGG GGGCTTCGGG CAGAAGGTCA ACCCGATGCG CGAGGACATG TGCATGCTGC TCGCCGCCCG GAAGGTTCCC GCGGCGCTGA AGTGGATCGA GGACCGGCGC GAGAACCTCA TGGCCGCGAA CTCGGCCCGC CACGAGCACG GTGCCGCCCG TCTCGCGTTC GACGCGGAGG GCAGGATCGT CGCGGCGCAG ATCGACCATG TCCAGGACGT GGGTTCCTAC CCCACACCGT GGCCGGTGGG TACCGCGGTG GCGGTCTGCA TGATGTTCCC CGGGCCGTAC CGCATCCCGG TCGCGGCGTG GTCGTCGGCG TCGGTGTTCT CGAACACGCT GGGTCGTGGC GCCTACCGGG GGCCGTGGCA GTACGAGTCG GTCGCTCGCG AGGTACTGCT GGACGTCGCC GCGCGCCGGA TGGGCCTCGA CCCGGTCGAG CTGCGCCGGC GCAACTTCCT GGCCCGCGCG GACCTGCCGT ACGTGAACCC GTGCGGCATG CCGTACGACC ACATGTCCCC GCGTGAGGTG TTCGAGAAGG CACTGGAGCA CTTCGACTAC GACGCGTTCC GCCGTGAGCA GGCCGAGGCC AGGGCGAACG GCCGGTACAT CGGTATCGGG ACGTGCAGCT ACGTGGAGCC CACCACGACC GGGATGTCCT TCTACGCGAC CGAGGGCGCC ACGATCCGCA TCGAGCCGAG TGGCACGGTC AACGTGTACC TCGCTGGCGG ATCGACGGGC AACAGCCTGG AGACCACCGC CGTACAGATC GCGGCCGACG CGCTGGGTGT CGACATCAGG GACGTCAACA CGATCCAGGG AGACACCGCC GTCACGCCCT TCGGCGGGGG CACCGGCGGC AGTCGCAGCG GTTCGATGAT CGCCGGGGCG GTCGGAGTCA CCGCGGGCGA ACTACGCGAG CGCATCATCG CCATCGCGGC GCACCGTCTG GAGGCGGCGG CCGAGGACAT CGAGCTCGCG GACGGGCGGG CCAATGTGCG TGGTACCCCC TCGATCGGCA TGTCGCTCGC CGAGATCGCG AATGTCGCGT ATTTCGATCC CGCCGGGCTG CCGCCCGGTG TGCAGCCCGG ATTGGAAGTC AGCGGCCGGT ATCAGGCGCA GGCGCCGATG CTGTGGGCCA ACGCCACGCA CATCTGTACC TGCGAGGTGG ACACGGAGAC CGGGGTCGTG ACCTTCCTTC GGTACCTCGT CAGCGAGGAC TGCGGCCCGC GGATCAACCC GAGCATCGTC GAGGGCCAGG TCGACGGTGG AACCGTGCAG GGCATCGGCG GGGCTCTCTA CGAGGACCTG GCCTATGACG AGGACGGCAA CCCGGTCGCG ACGACGTTCA TCGACTACCT GTTGCCGACC ATCGCCGAGA TGCCGCTCAT CGAGCACGTG CACCTCGAGA CCCCGGGACC GGGGCCGGGC GGCTACAAAG GCGCCGGTGA GGGCGGCGCG ATCGGCGCGC CCCCGGCAGT CATCAACGCG GTCGCCGACG CGCTCGCGCC GTTCGGGGTC TCCATCACCC ACCTTCCGCT CACGCCCGCG ACGATTGTCG CGCTTCTCGA CGAGGGGAAG TCCGGCCAGG AGGACCAGCA GCCTCACTGA
|
Protein sequence | MGNSTVTTAA ATPTAARYAG TRVNRVEDPR LLTGQGTYVD DVKRPGMLHA CFVRSPFARA RIIGIDASSA LELEGVHAVF VAEDLNPGVH EHWFGIIGRD VPDTPRPALA QDEVRFVGDP VALVIAEDRY IAEDAVELVE VDYDPLPPVA SYISAQESTE LVHEAYQNNR AGLLKGGDQG RVDEACAGAA FVVEETIYQQ GYAPVPMETR GIVVEWSGGE LTIWAASQAP HELRAFAARY LGLPENRVRV IMRDAGGGFG QKVNPMREDM CMLLAARKVP AALKWIEDRR ENLMAANSAR HEHGAARLAF DAEGRIVAAQ IDHVQDVGSY PTPWPVGTAV AVCMMFPGPY RIPVAAWSSA SVFSNTLGRG AYRGPWQYES VAREVLLDVA ARRMGLDPVE LRRRNFLARA DLPYVNPCGM PYDHMSPREV FEKALEHFDY DAFRREQAEA RANGRYIGIG TCSYVEPTTT GMSFYATEGA TIRIEPSGTV NVYLAGGSTG NSLETTAVQI AADALGVDIR DVNTIQGDTA VTPFGGGTGG SRSGSMIAGA VGVTAGELRE RIIAIAAHRL EAAAEDIELA DGRANVRGTP SIGMSLAEIA NVAYFDPAGL PPGVQPGLEV SGRYQAQAPM LWANATHICT CEVDTETGVV TFLRYLVSED CGPRINPSIV EGQVDGGTVQ GIGGALYEDL AYDEDGNPVA TTFIDYLLPT IAEMPLIEHV HLETPGPGPG GYKGAGEGGA IGAPPAVINA VADALAPFGV SITHLPLTPA TIVALLDEGK SGQEDQQPH
|
| |