Gene Dshi_3863 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDshi_3863 
Symbol 
ID5714392 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDinoroseobacter shibae DFL 12 
KingdomBacteria 
Replicon accessionNC_009956 
Strand
Start bp72755 
End bp73885 
Gene Length1131 bp 
Protein Length376 aa 
Translation table11 
GC content65% 
IMG OID641276776 
Producthypothetical protein 
Protein accessionYP_001542072 
Protein GI159046401 
COG category[S] Function unknown 
COG ID[COG2327] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.0984668 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones49 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGAACC CCTATATTTC CAGCCTCTAC CCCGAGGTCC CCGGCCCGCC CTTCCCCAAG 
ACGCAAGAGT ACCTGCAATG CGCGGGCAAC AACCTGGGCA ACTTCATGTT CTGCTCCTCG
GTCCGCCGGA TCGTGCGCAC CACGACCCAC CCGCGCGGGG ATTTTCGCCG GCTCGACCTC
AAGACCATCG CCGCGGAATG CGACGGGATC GTCATCGCGG CGGCCAACTG GCTTCAGCCC
AAGCAGAACT ATGGCGGCCT GGCCGACCAG ATCGAAAAGG CGAACGTGCC CGCGGCGATC
ACCGGCATCG GCGCCCAGAG CTCCGGCGGC AAGATCCCCG AATTGCTGCC CGGCATGCTG
CGGCTGCTCA AGGTGGTCTC CGAGCGGTCC CACTCGATCT CCGTGCGTGG CCCGTTCAGC
GCCGAGGTGC TCAACCACTA CGGCATCCAG AATGTCACCG TCACCGGCTG CCCGTCGCTG
CTGTGGCACC GGGACCACCC CGCCGAGATC ACCCGCCTGC CCCGGGACGG CCGGGTCGGC
CGGGTCACGC TCAACGGCAC CCTGCACCGC TTCGACATCC CCAAGACCCC AGGCAAGGTG
GTCAAGCTGA CCCGGTTCAC CCTCCTGCAG GCCATGGCCT GGGGCTGCGA CTACGTGGTG
CAGAACGAAC GCCCCTTCCT GCAGGCCCAT CTGGGCGAGC TCGCCGAGGA CGACCAAGAC
AGCTGGGACT TCCTGCATTA CGTGTTCGAC GAGCCGGACC GCGCGATTCT GAAAACCTAC
CTGGAACGCC ATATCCAGGC CTTCCCGAAT ATTCCCGAAT GGATGGCCTA TTGCGCGAAT
CATGACCTGG TGCTGGGCAG CCGCCTGCAC GGGGTGATCG TGGGGCTGCT CTCGGGAACA
CCGGGGGTGC TGATCACCCA TGACAACCGG ACAGAGGAAA TGGGCCGCTT TGCCGGCATT
CCGACCATCA CCGCCGAGGA TTTCATGTCG CGGCCCAAGA TCGACCCGGA CGCGATCCTC
GCCGAGGCCG ATTTCGACGC CTTCAATGCC CGGCAGAAAG ACTATTTCCG GGACTTTGTG
GCCTGGTTCG ACGCCAACGA GATCCCGCAT CGGTTGACCG TCACCCCATA G
 
Protein sequence
MKNPYISSLY PEVPGPPFPK TQEYLQCAGN NLGNFMFCSS VRRIVRTTTH PRGDFRRLDL 
KTIAAECDGI VIAAANWLQP KQNYGGLADQ IEKANVPAAI TGIGAQSSGG KIPELLPGML
RLLKVVSERS HSISVRGPFS AEVLNHYGIQ NVTVTGCPSL LWHRDHPAEI TRLPRDGRVG
RVTLNGTLHR FDIPKTPGKV VKLTRFTLLQ AMAWGCDYVV QNERPFLQAH LGELAEDDQD
SWDFLHYVFD EPDRAILKTY LERHIQAFPN IPEWMAYCAN HDLVLGSRLH GVIVGLLSGT
PGVLITHDNR TEEMGRFAGI PTITAEDFMS RPKIDPDAIL AEADFDAFNA RQKDYFRDFV
AWFDANEIPH RLTVTP