Gene Dshi_1457 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDshi_1457 
Symbol 
ID5712634 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDinoroseobacter shibae DFL 12 
KingdomBacteria 
Replicon accessionNC_009952 
Strand
Start bp1514658 
End bp1516022 
Gene Length1365 bp 
Protein Length454 aa 
Translation table11 
GC content60% 
IMG OID641267370 
Producttype III restriction protein res subunit 
Protein accessionYP_001532800 
Protein GI159044006 
COG category[S] Function unknown 
COG ID[COG3421] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value0.0747485 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCAAGC CCGACATCTC TAACGACATC ACCGGCAACC TTGCGCCCCG GATCGAGTTG 
CGTCCCTATC AGCGCACGGC GCTGGAACGC TGGCTGTTCT ACATCGACAA GTACGACGGA
CGGCCCAAGG CCCCGCACCT GCTGTTCCAC ATGGCCACGG GCAGCGGCAA GACGGTCCTG
ATGGCGGCAC TGATCCTGGA CCTGTACCGG CGCGGCTACC GCAACTTCCT GTTCTTCGTG
AACTCGGCCC AGATCATCGA GAAGACCAAG GAAAACTTCC TTAATGCCGC ATCGGCCAAG
CACCTGTTCG CGCCCACGAT CCGCATTGAC GACAAGCCCG TGGACATCCG CGCGGTGGAC
ACCTTCGACG CGGTGTCAGG CGATGCGATC AACATCCACT TTACCACCAT CCAAGGGCTG
CACACGCGGA TGCAGGCCCC GAAAGAGAAC GCCGTCACCA TCGAGGATTT CCGCGACTAC
AAGGTGGTGA TGATCTCGGA CGAAGCACAC CACCTGAACG CAGAGACCAA GAAGACGCTT
ACGGAAGGCG AGAAGGCAGA GAAGGCCAGT TGGGAAGGCA CGGTATCCGA GATTTTCCGC
CAGCACCCCG AGAACATGCT GTTGGAGCTT ACGGCAACCG TGGACCTGAG CCACGATGCG
ATCCGCGCGA AGTACGCCGA CAAGATACTC TACGACTATT CCCTGCGCCA GATTCGCGAA
GACGGCTATT CCAAGGACAT CGAGTTGCGG CAGGCCGATT TACCACCGGC GGAACGGATG
ATGCAGGCAA TGGTTCTGAG CCAGTACCGC CGCAAGGTGG CCGAGGCGCA TGGGCTGCAT
TGCAAGCCGG TGATCCTGAT CAAGTCCAAG ACGATCAAGG ACAGCGCCGA TAACGAGGCC
GCGTTCACGG CGATGGTGGC CGGGCTGACG GGCGAGGCGC TGGACGCGCT CAGGGCGGCC
TCTGAGGGCG ATGAGACCCT TTCTCGGGCC TTCACCTTCA TCATGGATGA AAGGGCCATG
AGCGGTGCGG ATTTCGCCCG TGAGCTACAG GGCGATTTTG CCCCCGAGAA GGTGGTGAAC
GTTAACAACC CCAAGGATTT GGAAAACAGA CAGATAGAAC TCAATGCCTT GGAGGACCGC
GACAACGAGA TACGGGTGAT CTTTGCCGTC GATAAGCTGA ACGAGGGCTG GGACGCGCTG
AACCTGTTCG ACATCGTGCG GCTGTACGAT ACGCGCGACG GCAAGGCCAA CAAAGTGGGC
AAGACCACAA TGGCCGAGGC TCAGTTGATC GGACGCGGTG CCCGGTACTT CCCCTTCATG
GCTCCGGACC AGCCCGACGC GGCGCGGGAA AAAGCGCAAG TATGA
 
Protein sequence
MLKPDISNDI TGNLAPRIEL RPYQRTALER WLFYIDKYDG RPKAPHLLFH MATGSGKTVL 
MAALILDLYR RGYRNFLFFV NSAQIIEKTK ENFLNAASAK HLFAPTIRID DKPVDIRAVD
TFDAVSGDAI NIHFTTIQGL HTRMQAPKEN AVTIEDFRDY KVVMISDEAH HLNAETKKTL
TEGEKAEKAS WEGTVSEIFR QHPENMLLEL TATVDLSHDA IRAKYADKIL YDYSLRQIRE
DGYSKDIELR QADLPPAERM MQAMVLSQYR RKVAEAHGLH CKPVILIKSK TIKDSADNEA
AFTAMVAGLT GEALDALRAA SEGDETLSRA FTFIMDERAM SGADFARELQ GDFAPEKVVN
VNNPKDLENR QIELNALEDR DNEIRVIFAV DKLNEGWDAL NLFDIVRLYD TRDGKANKVG
KTTMAEAQLI GRGARYFPFM APDQPDAARE KAQV