Gene Dshi_2266 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDshi_2266 
Symbol 
ID5713919 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDinoroseobacter shibae DFL 12 
KingdomBacteria 
Replicon accessionNC_009952 
Strand
Start bp2389926 
End bp2391176 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content63% 
IMG OID641268188 
Productputative sarcosine oxidase, beta subunit 
Protein accessionYP_001533603 
Protein GI159044809 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0665] Glycine/D-amino acid oxidases (deaminating) 
TIGRFAM ID[TIGR01373] sarcosine oxidase, beta subunit family, heterotetrameric form 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value0.547041 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCGCTATT CCGCGCTGGA ACTTCTCAAA CAAGGGCTCT CGGGCAACAA GGGTTGGCCC 
CTGCTGTGGC GCAGCCCCGC GCCGAAGCCC GCCTACGATG CGCTCATCAT CGGCGGTGGC
GGGCATGGGC TCGCCACGGC CTATTACCTT GCGCGCAATC ATGGGATGAC CAATGTTGCG
GTGCTGGAAA AGGGGTATCT CGGCAGCGGC AATATCGGCC GTAACACCAC TATCGTACGT
GCCAACTACC TGCTGGCCGG TAATTCCGAG TTCTATTCCC ATTCCTTGAA GCTATGGGAG
GGGATGGAGA CGGATCTGAA CTTCAACGCG ATGCACTCCC AGCGCGGGAT CATCAATCTG
TTCCACTCCG ACGGGCAGCG CGATGCCTTT GCCCGCCGCG GCAACGCCAT GATCAACCAG
GGTGACGATG CGATCCTGCT GGACCGGGAG GGTGTGCGCC GCGAGGTGCC CTATCTGGAT
TTCGACCAGA CCCGCTATCC GATCTATGGC GGGCTGTGGC ACAAGCGCGG CGGGACCGCG
CGCCACGACG CTGTCGCCTG GGGCTATGCC CGCGGAGCCG ACCAGCGTGG TGTTGACCTG
ATCCAGAATT GCGAAGTCAC CGGATTTCTG ATCGAGAACG GCTCTGTACA GGGCGTGCAG
ACCACGCGCG GGGAGATCCG CGCCCCCAAG GTCGGCATGG TCGTGGCGGG TCGCTCGGGG
CAGGTGGCCG AGATGGCGGG CTTCCGTCTG CCCATCGAGA GCCACATCCT GCAGGCCTTC
GTGACCGAAG GGCTGAAACC ATGCATCGAC CAGGTGATTA CCTATGGCAT GGGTCACTTC
TATATCAGCC AGTCCGACAA GGGCGGTCTG GTGTTTGGCG GGGACCTGGA TTTCTACGCC
TCCTATGCGC AGCGCGGCAA CCTGCCGATG GCCGAGCACG TTGTTGAGGC GGGAATGACT
CTGATGCCGA TGATCGGCAA GGCGAAGATG CTGCGCTCCT GGGGCGGGAT CATGGACATG
ACGCCCGACG GCTCGCCAAT CATCGACCAC AGCCCCGTAC AAGGGCTCTA TGTCAACGGG
GGCTGGTGCT ATGGCGGGTT CAAGGCAGTG CCCGCCTCAG GCTGGTGCAT GGCGCATCTC
ATGGCCACGG ATACGGTACA CGAGATCGCT CGACGCTATC GGCTCGACCG GTTCCGCACG
GGGCACCTGA TCGACGAGGA AGCCACGGGC AGTCAGCACA ATCTACACTG A
 
Protein sequence
MRYSALELLK QGLSGNKGWP LLWRSPAPKP AYDALIIGGG GHGLATAYYL ARNHGMTNVA 
VLEKGYLGSG NIGRNTTIVR ANYLLAGNSE FYSHSLKLWE GMETDLNFNA MHSQRGIINL
FHSDGQRDAF ARRGNAMINQ GDDAILLDRE GVRREVPYLD FDQTRYPIYG GLWHKRGGTA
RHDAVAWGYA RGADQRGVDL IQNCEVTGFL IENGSVQGVQ TTRGEIRAPK VGMVVAGRSG
QVAEMAGFRL PIESHILQAF VTEGLKPCID QVITYGMGHF YISQSDKGGL VFGGDLDFYA
SYAQRGNLPM AEHVVEAGMT LMPMIGKAKM LRSWGGIMDM TPDGSPIIDH SPVQGLYVNG
GWCYGGFKAV PASGWCMAHL MATDTVHEIA RRYRLDRFRT GHLIDEEATG SQHNLH