Gene Dshi_2027 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDshi_2027 
Symbol 
ID5713022 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDinoroseobacter shibae DFL 12 
KingdomBacteria 
Replicon accessionNC_009952 
Strand
Start bp2147043 
End bp2148059 
Gene Length1017 bp 
Protein Length338 aa 
Translation table11 
GC content74% 
IMG OID641267951 
Productputative allophanate hydrolase subunit 2 
Protein accessionYP_001533367 
Protein GI159044573 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1984] Allophanate hydrolase subunit 2 
TIGRFAM ID[TIGR00724] biotin-dependent carboxylase uncharacterized domain 


Plasmid Coverage information

Num covering plasmid clones40 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value0.124744 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGCGC GCCTGGAGAT TTGCGCCGCC GGTCCCGGCC TGACCGTGCA GGATGCCGGG 
TTCCTGGGCT ATATCGGGCA GGGTCTGTCT CGGGGCGGGG CGGCGGACAC CCGGGCCCTG
GCCGAAGGCG CCGCATTGCT GCGCCAGTCC CCCGACCTCG CCGCGATCGA GATGGCCGGC
AGTGGTGGGA CCTTCCGGGT CACCCGCGAC GCCCGGCTCG CGCTGACCGG CGCGCCGATG
CAGGCGACGC TCGATGGCGC GCCGCTCGCC TGGCACGCCT GTCACGCGTG GCCCGCCGGG
GCCGAACTGC GCATCGGCGC CGTGCAGGGG GGCAGCTATG GCTACCTGCA TGTGGGCGGC
GGGATCGACA CGCCTGTGGA GCTCGGCTCA CGCTCCACCC ACCTCACCGC CGGGCTGGGC
CGGGCGCTGG CGGCGGGCGA CAGCCTGCCC CTGGGCCGCG ATCCCGGTGG CCCGGTGGGG
CAGGGGCTGC AGGTCGAGGA TCGCTTCTGT GGCGGCGAGA TCCGCATCAT CCGCAGCTTC
CAGAGCGACA GCTTCGCGCC CGAGGATGTC ACCCGCCTCT CCGAGACGCC CTTCACCCGC
GACCCGCGCG GCAACCGGAT GGGTGTGCGT CTCGCCCACG CGGGCGACGG GTTCTTTGCC
CGGGGCGGGC TCACGGTGCT GTCCGAGATC GTCGTGCCCG GCGACATCCA GGTCACCGGC
GAGGGCGCGC CTTATATCCT CGGGGCCGAA AGCCAGACCA CCGGCGGCTA TCCCCGCATC
GCCACCGTCA TCCCCTGCGA TCTGCCGCGC GCGATGCAGG CCGGGCCGGG CGCGCCGATC
CGCCTCGCCC TCGTGGATCG CGCCACCGCC CTTGCGGCCG AGCGCGCCGA GGCCAAGCTG
CTGCAGGCCT TGCCGAAACA GGTCCGCCCC CTCTTGCGCG ACCCGCGCGA GATGTCGGAC
CTTCTGTCCT ACCAACTGAT CAGCGGCGTC ACCGCCGGGG AGGAGCCCTC GCCATGA
 
Protein sequence
MTARLEICAA GPGLTVQDAG FLGYIGQGLS RGGAADTRAL AEGAALLRQS PDLAAIEMAG 
SGGTFRVTRD ARLALTGAPM QATLDGAPLA WHACHAWPAG AELRIGAVQG GSYGYLHVGG
GIDTPVELGS RSTHLTAGLG RALAAGDSLP LGRDPGGPVG QGLQVEDRFC GGEIRIIRSF
QSDSFAPEDV TRLSETPFTR DPRGNRMGVR LAHAGDGFFA RGGLTVLSEI VVPGDIQVTG
EGAPYILGAE SQTTGGYPRI ATVIPCDLPR AMQAGPGAPI RLALVDRATA LAAERAEAKL
LQALPKQVRP LLRDPREMSD LLSYQLISGV TAGEEPSP