Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noc_0471 |
Symbol | |
ID | 3706642 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosococcus oceani ATCC 19707 |
Kingdom | Bacteria |
Replicon accession | NC_007484 |
Strand | - |
Start bp | 505981 |
End bp | 507786 |
Gene Length | 1806 bp |
Protein Length | 601 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 637736980 |
Product | thiamine pyrophosphate protein |
Protein accession | YP_342524 |
Protein GI | 77163999 |
COG category | [E] Amino acid transport and metabolism [H] Coenzyme transport and metabolism |
COG ID | [COG0028] Thiamine pyrophosphate-requiring enzymes [acetolactate synthase, pyruvate dehydrogenase (cytochrome), glyoxylate carboligase, phosphonopyruvate decarboxylase] |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 33 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCCAGA TAGTCAGCGA TTTTCTCTTA CACCGATTGA ACGAATGGGG CATCAACCGG ATTTACGGCT ATCCCGGGGA TGGCATCAAT GGAATCGTCG GCGCCCTGGA CCGGCTTCAA GACCGGATAG AGTTTATTCA AACCCGGCAT GAGGAAATGG CGGCTTTTAT GGCCTGCGCC CATGCTAAAT TTACCGGCGA AGTAGGCGTC TGCCTGGCCA CTTCAGGACC CGGGGCCATC CACCTGCTGA ATGGCCTTTA TGATGCCAAA CTGGACCATC AGCCGGTGGT GGCCATTGTG GGCCAACAAT CCCGCGCCGC CCTCGGGGGA GATTATCAAC AAGAAGTGGA TCTCATTTCC TTGTTCAAGG ATGTCGCCCA TGAATACGTA CATATGTGCG CTACTCCCGC CCAGGTGCGC CATTTAATTG ATCGCGCGGT CCGCATTGCC AAAACAGAGC GCACCGTGAC CTGCCTTATC TTTCCCAATG ACGTGCAGGA ATTGGAAGCC GTTGAGAAAC CGCCACGGGC TCACGGCACC ATCCATTCCA GCACCGGGTA TACGATCCCC CGGGTGATTC CTCATCAGCA AGATCTCCAA CAAGCTGCCG AGGTGCTCAA TAGAGGCAAA AAGGTCGCTA TCCTGGTGGG AGCTGGCGCT TTGGGGGCCA CGGATGAAGT TATTCAGGTC GCTGAACTGC TCGGCGCAGG GGTAGCGAAA GCCTTGCTGG GCAAGGGCGC TCTGCCTGAT GAACTTCCCT TCGTGACCGG CGCTATCGGC CTGCTGGGGA CTAAACCGAG CTGGGAATTA ATGGACGGCT GCGATACGCT GTTGATGATT GGTTCAAGTT TCCCCTATTC CGAATTCCTG CCGGAGGAAG GCCAAGCCCG GGGCGTGCAG ATTGACCTGG ACGGGCGCAT GCTGGGAATC CGCTATCCCA TGGAAGTGAA TCTGGTGGGA GACAGTGCGG AAACCCTGCG GGCCTTAATC CCTCTTCTCA CACGAAAAAC GAACCGGGCC TGGCGAGAGA AGATCGAAAA AGACGTGGCC CAATGGTGGC AGGTACTCGA AAGCCGCGCC ATGCACGATG CGGACCCTAT TAACCCCCAG CGGGTTTTCT GGGAGCTTTC TTCCCGACTG CCGGATAACT GCATCATCAG CAGCGACTCC GGTTCCGCCG CCAACTGGTA TGCCCGGGAT CTTAAAATCC GCCGAGGTAT GATGTGCTCT CTCTCGGGGG GCTTGGCGAC CATGGGCCCC GGCGTTCCCT ATGCCATTGC GGCCAAATTC GCCTTTCCGG ATCGGGTGGC TATTGCCCTT GTAGGGGATG GAGCCATGCA GATGAACGGC AACAGCGAAC TGGTCACCGC AGCTAAATAT TGGCAACAAT GGCAAGATCC CCGGCTGATT GTCTTGGTAC TCAATAATCG GGATCTCAAT CAAGTCACCT GGGAGCAGCG GGTGATGTCG GGCGATCCCA AGTTCGAAGG CTCCCAAAGC TTGCCCGACT TTCCCTATGC CCGTTATGCC GAACTACTTG GCTTTAAAGG CATCCGCGTT GATCGGCCGG AAAGTATCGG CCCCGCTTGG GAGGAAGCCC TAGCCGCTGA CCGACCCGTA ATACTAGAAG CGTATACCGA TGGGAACGTG CCGCCCTTGC CTCCCCATAT CAAGCTGGAA CAGGCCAAAG CCTATGTCTC CGCCTTGCTG CACCGAGATC CGGAAGCCAT CAACATTATT AAGCAGTCCA TCAAGGAAAT TAAAGAAAGC TGGTTTTCCA GTGGTCAAGA AGAAAAGGGC AATTAG
|
Protein sequence | MSQIVSDFLL HRLNEWGINR IYGYPGDGIN GIVGALDRLQ DRIEFIQTRH EEMAAFMACA HAKFTGEVGV CLATSGPGAI HLLNGLYDAK LDHQPVVAIV GQQSRAALGG DYQQEVDLIS LFKDVAHEYV HMCATPAQVR HLIDRAVRIA KTERTVTCLI FPNDVQELEA VEKPPRAHGT IHSSTGYTIP RVIPHQQDLQ QAAEVLNRGK KVAILVGAGA LGATDEVIQV AELLGAGVAK ALLGKGALPD ELPFVTGAIG LLGTKPSWEL MDGCDTLLMI GSSFPYSEFL PEEGQARGVQ IDLDGRMLGI RYPMEVNLVG DSAETLRALI PLLTRKTNRA WREKIEKDVA QWWQVLESRA MHDADPINPQ RVFWELSSRL PDNCIISSDS GSAANWYARD LKIRRGMMCS LSGGLATMGP GVPYAIAAKF AFPDRVAIAL VGDGAMQMNG NSELVTAAKY WQQWQDPRLI VLVLNNRDLN QVTWEQRVMS GDPKFEGSQS LPDFPYARYA ELLGFKGIRV DRPESIGPAW EEALAADRPV ILEAYTDGNV PPLPPHIKLE QAKAYVSALL HRDPEAINII KQSIKEIKES WFSSGQEEKG N
|
| |