Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Swit_3976 |
Symbol | |
ID | 5197931 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sphingomonas wittichii RW1 |
Kingdom | Bacteria |
Replicon accession | NC_009511 |
Strand | - |
Start bp | 4363731 |
End bp | 4365911 |
Gene Length | 2181 bp |
Protein Length | 726 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 640583532 |
Product | aldehyde oxidase and xanthine dehydrogenase, molybdopterin binding |
Protein accession | YP_001264459 |
Protein GI | 148556877 |
COG category | [C] Energy production and conversion |
COG ID | [COG1529] Aerobic-type carbon monoxide dehydrogenase, large subunit CoxL/CutL homologs |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.218094 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.540973 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCATGG GCGCGCAGCA CACCACCCTC TCCCGCCGGC CCCTCTCCCG CCGTGACGCG CTCAAGGGCG CGGGCGCGCT CGTCATCGGC CTGAGCCTGC CGTTCGGGGC GGGCAAGGCG CTGGCGCAGG CCAAGGGCCC GGCCCCGGTC GATCCCAACG CCTTCATCCG CATCGGCGCG GACGACGTCG TGACGGTGAT CGTCAAGCAT ACCGAGATGG GCCAGGGCCC CTATACCGGC CTCGCCACCA TCGCCGCCGA GGAGCTCGAC GCCGACTGGT CGCAGATGCG GGTCCAGAGC GCCCCCGCCA ACGTCGCGAT CTACGCCAAC ACCCTGCTCG GCGCGCAGCT CACCGGCGGC TCGACCGCGA TCGCCAACAG CTTCGACCAG CTCCGCCGCG CCGGCGCGAT GGCGCGGGCG ATGCTGGTCC AGGCCGCCGC CGCCGAATGG AAGGTGTCGG CCGCCGACAT CGCGGTGGGC AAGGGCGTCA TCAGCCACAA GGCAAGCGGC CGTTCGGCCG GCTTCGGCAA ATTCGCCGCG GCGGCGGCCA AGCTGCCGGC CCCGAAGCAG GTCAGGCTCA AGGACCCGTC GGCCTTCACC CTGATCGGCA AGGACCAGTC GGGCCGCCGC GTCGACAGCG CCGACAAGTC GACCGGCCGC GCCAAGTTCA CCATCGACAT GACCGCGCCG GACATGCTGA CCGTCCTGGT CGCGCGCAGC CCGCGCTTCG GCGGGACGGT GAAGAGCTTC GACGCCGCCG AGACGCTGAA GGTCAAGGGC GTCGTCGCCG TCCGCCAGAT ATCGAGCGGG GTCGCCGTCT ATGCGCGCGG CATGTGGCCC GCGATCAAGG GCCGCAAGGC GCTCAAGGTC GCCTGGGACG ACAGCAAGGC CGAGATGCGC GGCACCGACG AGATGATCGC CGCCTATCTC GAACAGACGC GGAAACCCGG CCGCGTCCAC CATGCGAGCG GCGACGTCGA CAAGGCGCTG GCCGCCGGCG GCGAGGTGAT CGAGGCGGAC TATGCCTTCC CCTATCTCGC CCATGCGCCG ATGGAGCCGC TCGACGGCTT CATGGTCTGG GACGGCACCA CCGCCCGCGC GCGCTTCGGA TCGCAGGGAC AGACGATCGA CCAGGGCGCG ATCGCCAGGG TGTTCGGCAT CCCGATGGAG AAGGTCGAGA TCGAGACGCT GCTCGCCGGC GGCAGCTTCG GCCGCCGCGC CCAGGCGACC GCGCACCTCG CCGCCGAGCT GGCCGAAGTC GCCAAGGCGA TGCCGGTCGG CACCCCGGTC AAGCTGGTGT GGACGCGCGA GGACGACATC CATGGCGGCT ATTACCGCCC GCTGTTCGTC CACCGCTTCC GCGGCGCGGT GAAGGACGGC CGGATCACCG CCTGGTCGAA CACGATGATG GGCCAGTCCT TCATGATCGG ATCGCCGTTC GAATCCTTCG CCGTCAAGGA CGGCATCGAT TCGATCATGA GCGAGGGCGC CGCCGAGCTG CTCTACGACA TCGCCGACTT CCGCTGCGAC GTCCATGTCG CCCAGTCGCC GGTGCCGACA TTGTGGTGGC GGTCGGTCGG CCACACCCAT ACCGGCTATG CCGTCGAATG CTTCGTCGAC CAGCTCCTGA AGGCGGCGGG GCAGGACCCG GTGGCCGGGC GGCTGGCGAT GATGGGCAAG GCGCCGCGCG CAGCAGGCGC GCTCAAGGCG GTCGCCGAGC TGGCGCAGTG GAAGGGGTCG CAGGCGGCGA ACGGCCGCGC CCGCGGCGTC GCGGTGGTCG AGAGCTTCAA CACGTTCGTC GCCCAGATCG CCGAGGTCTC GGTCGGTGCC GACGGCGAGC CGCGCGTCCA CAAGGTGTGG GCGGCGGTCG ACTGCGGCAT CGCGGTGAAC CCCGACATCA TCCGCGCCCA GGTCGAGGGC GGGATCGGCT ATGCGCTGGG CCACGCCCTC TATGCCGAGG TGCCGCTGGT CGAAGGCGTT CCCGCCGTCT CCAACTTCAA CGACTATCGA TCGCTGAGGA TCAACGAGAT GCCCGAAATC GAGGTCGTCG TCGTCCGCTC GGCCGAGCCC CCCACCGGGA TCGGCGAGCC GGGGGTTCCC CCGCTCGCCC CCGCCGTCGC CAACGCGCTC GCCGCGCTCG GCGGCAAGCG TCCGGCCCGC CTGCCGATGG TGCGGGCATG A
|
Protein sequence | MTMGAQHTTL SRRPLSRRDA LKGAGALVIG LSLPFGAGKA LAQAKGPAPV DPNAFIRIGA DDVVTVIVKH TEMGQGPYTG LATIAAEELD ADWSQMRVQS APANVAIYAN TLLGAQLTGG STAIANSFDQ LRRAGAMARA MLVQAAAAEW KVSAADIAVG KGVISHKASG RSAGFGKFAA AAAKLPAPKQ VRLKDPSAFT LIGKDQSGRR VDSADKSTGR AKFTIDMTAP DMLTVLVARS PRFGGTVKSF DAAETLKVKG VVAVRQISSG VAVYARGMWP AIKGRKALKV AWDDSKAEMR GTDEMIAAYL EQTRKPGRVH HASGDVDKAL AAGGEVIEAD YAFPYLAHAP MEPLDGFMVW DGTTARARFG SQGQTIDQGA IARVFGIPME KVEIETLLAG GSFGRRAQAT AHLAAELAEV AKAMPVGTPV KLVWTREDDI HGGYYRPLFV HRFRGAVKDG RITAWSNTMM GQSFMIGSPF ESFAVKDGID SIMSEGAAEL LYDIADFRCD VHVAQSPVPT LWWRSVGHTH TGYAVECFVD QLLKAAGQDP VAGRLAMMGK APRAAGALKA VAELAQWKGS QAANGRARGV AVVESFNTFV AQIAEVSVGA DGEPRVHKVW AAVDCGIAVN PDIIRAQVEG GIGYALGHAL YAEVPLVEGV PAVSNFNDYR SLRINEMPEI EVVVVRSAEP PTGIGEPGVP PLAPAVANAL AALGGKRPAR LPMVRA
|
| |