Gene Dshi_3719 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDshi_3719 
Symbol 
ID5714248 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDinoroseobacter shibae DFL 12 
KingdomBacteria 
Replicon accessionNC_009955 
Strand
Start bp124654 
End bp127605 
Gene Length2952 bp 
Protein Length983 aa 
Translation table11 
GC content66% 
IMG OID641276635 
Productaldehyde oxidase and xanthine dehydrogenase molybdopterin binding 
Protein accessionYP_001541931 
Protein GI159046259 
COG category[S] Function unknown
[C] Energy production and conversion 
COG ID[COG1529] Aerobic-type carbon monoxide dehydrogenase, large subunit CoxL/CutL homologs
[COG3427] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCAAAGG ATGGGCTGCA TCAGACGATG TTGCGGGTGG AAGACGCCCC GCTTCTGACC 
GGGCGCGGCC AGTTCTTTGA CGATCTGCCG ACGCCGCACG GAACCTTGCA GGCCGCGATC
CTGCGCAGCC CCTATGCCCA CGCAAGGATC ACCGACATTT CGGCCGATGC CGCCCGCAGG
CTGGCAGGCG TTCACGCCGT CCTGACAGGC GAGGATTACG CAAGCGTCGC ATCGCCGCTG
ATGGTCGGGG TCAAGCTGCC CATCGAATGC TGGCCGATCG CACGCGACAA GGTGCGTTAT
GTAGGCGAGC CGGTCGCGGT GGTTCTGGCC GATGATCGCT ACCTCGCCGA GGACGCGCTC
GATCTCATCG AAGTCAGCTA TGATACGCTC GCCCCCGTCA TTGACCCGCT AAAGGCCCTC
GCGGACGACG CGCCTGTGCT GCACGACGAG ATGGGCGGCA ATCTTGGCGC CGATCGCCGT
TTTTCTTATG GCGACCCTGA CTCCGCCTTT GCCGAGGCCG ACCATGTCGT CGAGATTTCG
GTGCGCTATC CACGCAATTC CTGTACGCCC ATCGAAACCT ACGGCGTGCT GACCCGCTGG
AATCCGCACA CGCGGTCCTA TGACGTCACC TCGAATTTTC AGGGGCCGTT CTCGATCCAT
CCGGTGGTCG CGCGGGCGCT GAACGTGCCT GGCAACCGTC TGCGCCTGCG CACGCCCGAG
GATTCCGGCG GCAGCTTCGG TATCAAGCAG GGCGTGTTTC CCTATATCGC GCTTCTGTCC
GCCTGTGCGC GGCTGGCCGA CCGTCCCGTC AAGTGGGTCG AGGACCGGCT CGAGCATCTG
ACCGCCTCGG TGTCTGCCAC AAATCGCGCC ATCACGCTGC GCGCAGCGGT TACGGCAAAG
GGACGCGTGA CCGCGCTCGA CTACGATCAG GTCGAGGATG TCGGCGCCTA TCTGCGGGCG
CCGGAGCCGG CGACGCTCTA CCGCATGCAC GGCAACCTCT GCGGTGCCTA TGACATCGCC
AACCTGCGGG TGCGCAACCG CATCGTGATG ACGAACAAGA CGCCGACGGG GCTGAACCGC
GGGTTCGGCG GCCCGCAGAT CTATTTCGCG CTGGAACGGC TGATGCAACG TATCGCCATC
GAGCTGGGTC TCGATCCGGT GAATGTTGCC CGCGACAACC TGATCCCCAA GGGTGCCTTT
CCCTACCGCA CCGCCAGCGG TGCCACCTAC GATTCGGGCG ATTATCAAAA TGCCTTCGAC
ACGGCGTTGA AGGACGGCCG CTATGACGCG CTGTGCCAAA AGCGCGACGC GGCCCGTGCC
GAGGGTCGGC TTTACGGCAT CGGCCTTGCC GTGGCGGTGG AGCCGTCGGT ATCGAACATG
GGATACCTTT CGACCGCTCT GACACCTGCG GAACGGGCCA AGGCGGGGCC GAAAAACGGC
GCGCAATCCT GCGCGACCAT CGCCATCGAC CCGGTCGGCA GCATCACGGT GCAGATCGAT
TCCGTGCCCC AGGGCCAGGG CCACAAGACC GTCGCCGCCA GCATCGTCGC CGAACGGTTC
GGGCTGGAGC CGGGCGCGAT CACGGTCGTG GCCGCGATGG ACACCGGCAA GGACGCCTGG
TCCATTGCCG CCGGGAACTA TTCCAGCCGC TTTGCCGCCG CCTCCGCCGG GGCGGTGCGC
ATCGCCGCCG ACCGCCTGCG CGAGCGGATC GCCGCAATTG CCGCAAGCCA ACTCAACGCG
CGGCCCGAGG ATATCGAATT CGAGAATGGC AAGGTTTTCA TCGCGTCCAA CCCCGGGGCC
GCGCTCGTCT TCGCCCGTGT CGCCTCCACC GGGCATTGGG CTCCCGGCAC CCTGCCTGAC
GGAATTGCCG CGCCGATGCG CGAGACCGCG ATCTGGACCG CCCCCGAACT GACCGCCCCG
ACCGATGAGG ACGGGATCAA CTCATCGCTG TGCCACGCGT TCATCTTCGA TTTCTGCGGG
GTGGAGATCG ATCCCGTGAC CCATGAAGCC CGCGTCGACC ACTACGTCAC GATGCACGAC
TGTGGCACCA TCCTGCACGA AGGCATGGTC GAGGGGCAGA TCCGCGGTGC GTTCGCGCAG
GCGGTCGGTG CGGCGCTTTA CGAGGAATAC GTCTATGACG ACGACGGCGC GTTCCTTGCG
GGCACCCTGG CCGATTACCC GGTGCCCACG GTGCATGAAA TCCCTGACCT TGAGATATTG
CATACCTGCA CACCTTCGCC GCTGACCCCG CTGGGGGCCA AGGGCGTGGG CGAGGGCAAT
TGCATGTCCA CGCCGGTTTG CATTGCCAAT GCCGTCGCCG ATGCACTGGC ACCGGCTCGC
GGCAATGTCG ATGTCACCCT GCCGCTTTCG CCTGCGCGTA TCGCATCGTA TCTGCCGGAC
GAACCCACAG CACCCGAAGG CATCGCGCAG CCACGGCCCA GCTCCGGCAA GGGGCTGTCC
GGTCAGGGCG AGGCGCGCGT CGGTGCCGCA CCGCAGGAGA TATGGGATCT GCTGATGGAC
GCAGACCAAC TCGCCGCGAT CATCCCCGGC GCACATGGCG TCAAGCGCCT GTCGCCAACC
CGCTTTCTGG CCGATGTCAC GCTGGGCGTC GGGCCGGTCA AGGGGCGTTA CCGGGTGCAG
GTCGGGCTAT CGGATCTGGA CGCGCCCAAG AGCGCTACAT TGACCGGCAA GGCCGGCGGC
GCGCTTGGCA CCGGCGAGGG GTCGGGCCGG GTCACGCTGA CACCGGACGG ATCGGGCGGC
ACGGTGATCG GCTACAGCTA CGAGGCCGCC GTGGGCGGCA AGGTCGCATC CGTCGGTGGG
CGGCTGCTGG ACGGTGCTGC GAAAATCGTC ATTGGCCAGT TTTTTGCCGC CCTCGGACGC
AGGGCCGGAG ACGCACCACA GCCCGACGGC ATGATCGCCC GCATTCGCGG CTGGATCGGG
GGGACGCGAT GA
 
Protein sequence
MPKDGLHQTM LRVEDAPLLT GRGQFFDDLP TPHGTLQAAI LRSPYAHARI TDISADAARR 
LAGVHAVLTG EDYASVASPL MVGVKLPIEC WPIARDKVRY VGEPVAVVLA DDRYLAEDAL
DLIEVSYDTL APVIDPLKAL ADDAPVLHDE MGGNLGADRR FSYGDPDSAF AEADHVVEIS
VRYPRNSCTP IETYGVLTRW NPHTRSYDVT SNFQGPFSIH PVVARALNVP GNRLRLRTPE
DSGGSFGIKQ GVFPYIALLS ACARLADRPV KWVEDRLEHL TASVSATNRA ITLRAAVTAK
GRVTALDYDQ VEDVGAYLRA PEPATLYRMH GNLCGAYDIA NLRVRNRIVM TNKTPTGLNR
GFGGPQIYFA LERLMQRIAI ELGLDPVNVA RDNLIPKGAF PYRTASGATY DSGDYQNAFD
TALKDGRYDA LCQKRDAARA EGRLYGIGLA VAVEPSVSNM GYLSTALTPA ERAKAGPKNG
AQSCATIAID PVGSITVQID SVPQGQGHKT VAASIVAERF GLEPGAITVV AAMDTGKDAW
SIAAGNYSSR FAAASAGAVR IAADRLRERI AAIAASQLNA RPEDIEFENG KVFIASNPGA
ALVFARVAST GHWAPGTLPD GIAAPMRETA IWTAPELTAP TDEDGINSSL CHAFIFDFCG
VEIDPVTHEA RVDHYVTMHD CGTILHEGMV EGQIRGAFAQ AVGAALYEEY VYDDDGAFLA
GTLADYPVPT VHEIPDLEIL HTCTPSPLTP LGAKGVGEGN CMSTPVCIAN AVADALAPAR
GNVDVTLPLS PARIASYLPD EPTAPEGIAQ PRPSSGKGLS GQGEARVGAA PQEIWDLLMD
ADQLAAIIPG AHGVKRLSPT RFLADVTLGV GPVKGRYRVQ VGLSDLDAPK SATLTGKAGG
ALGTGEGSGR VTLTPDGSGG TVIGYSYEAA VGGKVASVGG RLLDGAAKIV IGQFFAALGR
RAGDAPQPDG MIARIRGWIG GTR