Gene Dshi_4021 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDshi_4021 
Symbol 
ID5714550 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDinoroseobacter shibae DFL 12 
KingdomBacteria 
Replicon accessionNC_009957 
Strand
Start bp87675 
End bp88910 
Gene Length1236 bp 
Protein Length411 aa 
Translation table11 
GC content57% 
IMG OID641276933 
Productreplication initiation protein RepC 
Protein accessionYP_001542229 
Protein GI159046559 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0836066 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACTACA CGCCCGTAAC GCCGTTCCGG CGAACGATAG ATGCTGCCAT CCTGAAACAT 
CAGGCAGCGA CCCAAGAAGA CCTGCCCCCA GCCGGCGCCA ACAAGTGGGA GGTCCTGAGG
GAGCTCGCTG CCGCTCGAGT CGCGTTCGGC TTGTCCGATC GGGATTTGAC GGTGCTTCAG
GCGCTGGTCA GCTTTCACCA AGCGACAATT CTCGGAGGCA ATGACAGCGA ATTGATTGTA
CATCCGTCCA ACAAGGCGAT TTGCGAGCGC CTGAACGGCA TGCCCTGCTC GACGATGCGG
CGCCACCTCT CCAACCTTGT GCAGACTGGC TTTGTTGTCC GGCGCGATAG CCCCAATGGG
AAGCGCTATG CCCGCCGCTA CGGCGACGAA AAGGTTGCGT TTGGGTTCGA CCTCTCTCCG
CTCGTTCGAC GCTTCCAGGA AGTTTGTGAG GCTGCTGAGA CCGTCCGGGC CGCAGAAGAG
CGGTACAAGC GCCTACGTGC CACTGTGAGC CTCATGCGGC GTGACCTCGC AGGGCTGGCC
GAGTACGGGC GCTCACTTCG TCCGGATCAG GGCGTCTGGG ACCAATTCTC TGATCTTGCG
GCCCTAATGG CCCGAGATCT TCGCAGAAAA CTCGAAATGG AAGACCTTAG GCGCATCGAA
GACGCTTTGG GGTCAGCTTT AGATCACGCC CGAAGCCTTC TGGATGGCTG TGAAACAGAA
AATATGAGCA CCAATGATGC TGTTTCTGAG CAGCATTATC AGAATTCAAA TAAAGACTCT
TATGATCTTG AACCTCGCTT AGAAAAAGCG CGGGGCGGAG GCGCTGTGCG CGAAACTCCA
GAAGTTGCCA ATAGTCATCT GTGTTCTGAA GATGAGGGCA ACTCAACGGC AACTATTGAC
GATCAACTGA TGCCGAACAT ACCGCTTGGT CTCGTCCTCG CTTCCTGTCA GGAATTCAAA
GCGTATTCCG AGCAGCCCGT GCGCCACTGG CACGATCTGG TCCGGGTGGC TGATGTGGTC
AGGCCCATGA TGGGTATTTC CCCGTCCGCG TGGGACGAGG CGAAACGCTA TATGGGTCCC
GAAGAAGCGT CTGTTGTGAT CGTTGCAATG CTTGAACGGT TTGCGGATAT CCGATCACCT
GGAGGCTACT TGAGAACCCT ATCTTCAAAG GCAGCAATTG GGGAGTTCTC CTGCGGTCCG
ATGATCATGG CCTTGATGCG GCGGGATGCT GCATGA
 
Protein sequence
MDYTPVTPFR RTIDAAILKH QAATQEDLPP AGANKWEVLR ELAAARVAFG LSDRDLTVLQ 
ALVSFHQATI LGGNDSELIV HPSNKAICER LNGMPCSTMR RHLSNLVQTG FVVRRDSPNG
KRYARRYGDE KVAFGFDLSP LVRRFQEVCE AAETVRAAEE RYKRLRATVS LMRRDLAGLA
EYGRSLRPDQ GVWDQFSDLA ALMARDLRRK LEMEDLRRIE DALGSALDHA RSLLDGCETE
NMSTNDAVSE QHYQNSNKDS YDLEPRLEKA RGGGAVRETP EVANSHLCSE DEGNSTATID
DQLMPNIPLG LVLASCQEFK AYSEQPVRHW HDLVRVADVV RPMMGISPSA WDEAKRYMGP
EEASVVIVAM LERFADIRSP GGYLRTLSSK AAIGEFSCGP MIMALMRRDA A