Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dshi_3719 |
Symbol | |
ID | 5714248 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dinoroseobacter shibae DFL 12 |
Kingdom | Bacteria |
Replicon accession | NC_009955 |
Strand | + |
Start bp | 124654 |
End bp | 127605 |
Gene Length | 2952 bp |
Protein Length | 983 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 641276635 |
Product | aldehyde oxidase and xanthine dehydrogenase molybdopterin binding |
Protein accession | YP_001541931 |
Protein GI | 159046259 |
COG category | [C] Energy production and conversion [S] Function unknown |
COG ID | [COG1529] Aerobic-type carbon monoxide dehydrogenase, large subunit CoxL/CutL homologs [COG3427] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 30 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCAAAGG ATGGGCTGCA TCAGACGATG TTGCGGGTGG AAGACGCCCC GCTTCTGACC GGGCGCGGCC AGTTCTTTGA CGATCTGCCG ACGCCGCACG GAACCTTGCA GGCCGCGATC CTGCGCAGCC CCTATGCCCA CGCAAGGATC ACCGACATTT CGGCCGATGC CGCCCGCAGG CTGGCAGGCG TTCACGCCGT CCTGACAGGC GAGGATTACG CAAGCGTCGC ATCGCCGCTG ATGGTCGGGG TCAAGCTGCC CATCGAATGC TGGCCGATCG CACGCGACAA GGTGCGTTAT GTAGGCGAGC CGGTCGCGGT GGTTCTGGCC GATGATCGCT ACCTCGCCGA GGACGCGCTC GATCTCATCG AAGTCAGCTA TGATACGCTC GCCCCCGTCA TTGACCCGCT AAAGGCCCTC GCGGACGACG CGCCTGTGCT GCACGACGAG ATGGGCGGCA ATCTTGGCGC CGATCGCCGT TTTTCTTATG GCGACCCTGA CTCCGCCTTT GCCGAGGCCG ACCATGTCGT CGAGATTTCG GTGCGCTATC CACGCAATTC CTGTACGCCC ATCGAAACCT ACGGCGTGCT GACCCGCTGG AATCCGCACA CGCGGTCCTA TGACGTCACC TCGAATTTTC AGGGGCCGTT CTCGATCCAT CCGGTGGTCG CGCGGGCGCT GAACGTGCCT GGCAACCGTC TGCGCCTGCG CACGCCCGAG GATTCCGGCG GCAGCTTCGG TATCAAGCAG GGCGTGTTTC CCTATATCGC GCTTCTGTCC GCCTGTGCGC GGCTGGCCGA CCGTCCCGTC AAGTGGGTCG AGGACCGGCT CGAGCATCTG ACCGCCTCGG TGTCTGCCAC AAATCGCGCC ATCACGCTGC GCGCAGCGGT TACGGCAAAG GGACGCGTGA CCGCGCTCGA CTACGATCAG GTCGAGGATG TCGGCGCCTA TCTGCGGGCG CCGGAGCCGG CGACGCTCTA CCGCATGCAC GGCAACCTCT GCGGTGCCTA TGACATCGCC AACCTGCGGG TGCGCAACCG CATCGTGATG ACGAACAAGA CGCCGACGGG GCTGAACCGC GGGTTCGGCG GCCCGCAGAT CTATTTCGCG CTGGAACGGC TGATGCAACG TATCGCCATC GAGCTGGGTC TCGATCCGGT GAATGTTGCC CGCGACAACC TGATCCCCAA GGGTGCCTTT CCCTACCGCA CCGCCAGCGG TGCCACCTAC GATTCGGGCG ATTATCAAAA TGCCTTCGAC ACGGCGTTGA AGGACGGCCG CTATGACGCG CTGTGCCAAA AGCGCGACGC GGCCCGTGCC GAGGGTCGGC TTTACGGCAT CGGCCTTGCC GTGGCGGTGG AGCCGTCGGT ATCGAACATG GGATACCTTT CGACCGCTCT GACACCTGCG GAACGGGCCA AGGCGGGGCC GAAAAACGGC GCGCAATCCT GCGCGACCAT CGCCATCGAC CCGGTCGGCA GCATCACGGT GCAGATCGAT TCCGTGCCCC AGGGCCAGGG CCACAAGACC GTCGCCGCCA GCATCGTCGC CGAACGGTTC GGGCTGGAGC CGGGCGCGAT CACGGTCGTG GCCGCGATGG ACACCGGCAA GGACGCCTGG TCCATTGCCG CCGGGAACTA TTCCAGCCGC TTTGCCGCCG CCTCCGCCGG GGCGGTGCGC ATCGCCGCCG ACCGCCTGCG CGAGCGGATC GCCGCAATTG CCGCAAGCCA ACTCAACGCG CGGCCCGAGG ATATCGAATT CGAGAATGGC AAGGTTTTCA TCGCGTCCAA CCCCGGGGCC GCGCTCGTCT TCGCCCGTGT CGCCTCCACC GGGCATTGGG CTCCCGGCAC CCTGCCTGAC GGAATTGCCG CGCCGATGCG CGAGACCGCG ATCTGGACCG CCCCCGAACT GACCGCCCCG ACCGATGAGG ACGGGATCAA CTCATCGCTG TGCCACGCGT TCATCTTCGA TTTCTGCGGG GTGGAGATCG ATCCCGTGAC CCATGAAGCC CGCGTCGACC ACTACGTCAC GATGCACGAC TGTGGCACCA TCCTGCACGA AGGCATGGTC GAGGGGCAGA TCCGCGGTGC GTTCGCGCAG GCGGTCGGTG CGGCGCTTTA CGAGGAATAC GTCTATGACG ACGACGGCGC GTTCCTTGCG GGCACCCTGG CCGATTACCC GGTGCCCACG GTGCATGAAA TCCCTGACCT TGAGATATTG CATACCTGCA CACCTTCGCC GCTGACCCCG CTGGGGGCCA AGGGCGTGGG CGAGGGCAAT TGCATGTCCA CGCCGGTTTG CATTGCCAAT GCCGTCGCCG ATGCACTGGC ACCGGCTCGC GGCAATGTCG ATGTCACCCT GCCGCTTTCG CCTGCGCGTA TCGCATCGTA TCTGCCGGAC GAACCCACAG CACCCGAAGG CATCGCGCAG CCACGGCCCA GCTCCGGCAA GGGGCTGTCC GGTCAGGGCG AGGCGCGCGT CGGTGCCGCA CCGCAGGAGA TATGGGATCT GCTGATGGAC GCAGACCAAC TCGCCGCGAT CATCCCCGGC GCACATGGCG TCAAGCGCCT GTCGCCAACC CGCTTTCTGG CCGATGTCAC GCTGGGCGTC GGGCCGGTCA AGGGGCGTTA CCGGGTGCAG GTCGGGCTAT CGGATCTGGA CGCGCCCAAG AGCGCTACAT TGACCGGCAA GGCCGGCGGC GCGCTTGGCA CCGGCGAGGG GTCGGGCCGG GTCACGCTGA CACCGGACGG ATCGGGCGGC ACGGTGATCG GCTACAGCTA CGAGGCCGCC GTGGGCGGCA AGGTCGCATC CGTCGGTGGG CGGCTGCTGG ACGGTGCTGC GAAAATCGTC ATTGGCCAGT TTTTTGCCGC CCTCGGACGC AGGGCCGGAG ACGCACCACA GCCCGACGGC ATGATCGCCC GCATTCGCGG CTGGATCGGG GGGACGCGAT GA
|
Protein sequence | MPKDGLHQTM LRVEDAPLLT GRGQFFDDLP TPHGTLQAAI LRSPYAHARI TDISADAARR LAGVHAVLTG EDYASVASPL MVGVKLPIEC WPIARDKVRY VGEPVAVVLA DDRYLAEDAL DLIEVSYDTL APVIDPLKAL ADDAPVLHDE MGGNLGADRR FSYGDPDSAF AEADHVVEIS VRYPRNSCTP IETYGVLTRW NPHTRSYDVT SNFQGPFSIH PVVARALNVP GNRLRLRTPE DSGGSFGIKQ GVFPYIALLS ACARLADRPV KWVEDRLEHL TASVSATNRA ITLRAAVTAK GRVTALDYDQ VEDVGAYLRA PEPATLYRMH GNLCGAYDIA NLRVRNRIVM TNKTPTGLNR GFGGPQIYFA LERLMQRIAI ELGLDPVNVA RDNLIPKGAF PYRTASGATY DSGDYQNAFD TALKDGRYDA LCQKRDAARA EGRLYGIGLA VAVEPSVSNM GYLSTALTPA ERAKAGPKNG AQSCATIAID PVGSITVQID SVPQGQGHKT VAASIVAERF GLEPGAITVV AAMDTGKDAW SIAAGNYSSR FAAASAGAVR IAADRLRERI AAIAASQLNA RPEDIEFENG KVFIASNPGA ALVFARVAST GHWAPGTLPD GIAAPMRETA IWTAPELTAP TDEDGINSSL CHAFIFDFCG VEIDPVTHEA RVDHYVTMHD CGTILHEGMV EGQIRGAFAQ AVGAALYEEY VYDDDGAFLA GTLADYPVPT VHEIPDLEIL HTCTPSPLTP LGAKGVGEGN CMSTPVCIAN AVADALAPAR GNVDVTLPLS PARIASYLPD EPTAPEGIAQ PRPSSGKGLS GQGEARVGAA PQEIWDLLMD ADQLAAIIPG AHGVKRLSPT RFLADVTLGV GPVKGRYRVQ VGLSDLDAPK SATLTGKAGG ALGTGEGSGR VTLTPDGSGG TVIGYSYEAA VGGKVASVGG RLLDGAAKIV IGQFFAALGR RAGDAPQPDG MIARIRGWIG GTR
|
| |