Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nther_0845 |
Symbol | |
ID | 6315656 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Natranaerobius thermophilus JW/NM-WN-LF |
Kingdom | Bacteria |
Replicon accession | NC_010718 |
Strand | + |
Start bp | 890132 |
End bp | 891469 |
Gene Length | 1338 bp |
Protein Length | 445 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 642643218 |
Product | biotin carboxylase |
Protein accession | YP_001917018 |
Protein GI | 188585473 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG4770] Acetyl/propionyl-CoA carboxylase, alpha subunit |
TIGRFAM ID | [TIGR00514] acetyl-CoA carboxylase, biotin carboxylase subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 0.717453 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 41 |
Fosmid unclonability p-value | 0.153916 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTTAATA AAATTTTGGT GGCTAATAGA GGTGAAATTG CTGTTAGAAT AATCAGAGCT TGTAGAGAAA TGGGGATTGA AACAGTAGCT GTTTATTCTA CTGTAGATGA AGAAGCTCTA CACGTCCAAA TGGCAGATGA ATCCGTTTGT ATCGGCACGG CAACCGATAA CTGCTATACG AATATTAGCA GGATATTGAC TGCTGCGGAA ATCACTGATG CTGAAGCCAT CCATCCTGGA TTTGGGTTTT TATCAGAAAA CAGTCGTTTT GCCGATGTCT GTAGAAAATG TAATATTACA TTTATTGGTC CCAGGCCAGA GGTGATTGAG AACATGGGAA ACAAGTCCAA TGCCAGGAAG TTAATGGCAA ATGCTGATGT TCCAGTGGTC CCAGGTTCTA AGAAACCTTT AACAGATGAG CAAGATGCTG TACAGATGGC TGACGAACTT GGATATCCGG TGATGATTAA AGCATCAGCC GGTGGTGGTG GCAGAGGTAT ACGAATTATC AGAAGCCAAG CCGAACTGTT AAATGCTCTT GGTACAGCCA AACAAGAAGC CGCTACTTTT TTCGGTGATG ACACAATGTA TATGGAAAAG TATTTAGAAA AGCCAAGACA CGTGGAAATA CAAATACTTG CTGATAAATA CGGGAATGTA CTGCACCTAG GGGAAAGAGA TTGCTCTATC CAGAGAAGGA ATCAAAAAGT TTTAGAAGAA GCCCCATGTC CAGTGATGAC TGAGGAACTC AGAGAAAAAA TGGGACAAGC TGCAGTCAGG GCAGCTAAAT ATGTGGGATA TCAAAATGCG GGAACAATTG AATTTTTATT AGACAAACAC AACAATTTCT ATTTTATGGA AATGAATACC AGGATTCAAG TTGAACACCC GGTTACAGAA CAAATAGTGG GACTGGATTT GATAAAGGAG CAAATAAAAA TTGCTGCAGG ACAAAGGCTC GAAAAAACTC AAGAGGAGAT AAATTTAAAC GGACATTCCA TCGAATGTAG AATCAATGCC GAGGATCCGC AAAAAAATTT CATGCCATGT CCGGGTAAGG TAAACGGGCT ATTTATACCT AGTGGTTTAG GAGTAAGAGT TGATTCTTTG TTGTATGATG GCTATGAGAT ACCATCAACC TACGATTCAA TGATAGCAAA ACTGATAGTC CATGGCAAAG ATAGGGAAGA AGCAATTAAT AAAATGAAGA GAGCCTTAGG AGAATTTGTG ATACTTGGTG TTAAAAATAA TGTTGAATTC CACTTAAATA TCTTAAATAA TGATGATTTT GTGAAAGGCA TTTATGATAC TAATTTCATA AATGACAAAA TCTTTTAA
|
Protein sequence | MFNKILVANR GEIAVRIIRA CREMGIETVA VYSTVDEEAL HVQMADESVC IGTATDNCYT NISRILTAAE ITDAEAIHPG FGFLSENSRF ADVCRKCNIT FIGPRPEVIE NMGNKSNARK LMANADVPVV PGSKKPLTDE QDAVQMADEL GYPVMIKASA GGGGRGIRII RSQAELLNAL GTAKQEAATF FGDDTMYMEK YLEKPRHVEI QILADKYGNV LHLGERDCSI QRRNQKVLEE APCPVMTEEL REKMGQAAVR AAKYVGYQNA GTIEFLLDKH NNFYFMEMNT RIQVEHPVTE QIVGLDLIKE QIKIAAGQRL EKTQEEINLN GHSIECRINA EDPQKNFMPC PGKVNGLFIP SGLGVRVDSL LYDGYEIPST YDSMIAKLIV HGKDREEAIN KMKRALGEFV ILGVKNNVEF HLNILNNDDF VKGIYDTNFI NDKIF
|
| |