Gene Dshi_1840 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDshi_1840 
Symbol 
ID5712832 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDinoroseobacter shibae DFL 12 
KingdomBacteria 
Replicon accessionNC_009952 
Strand
Start bp1921699 
End bp1922916 
Gene Length1218 bp 
Protein Length405 aa 
Translation table11 
GC content70% 
IMG OID641267764 
Producthypothetical protein 
Protein accessionYP_001533183 
Protein GI159044389 
COG category[E] Amino acid transport and metabolism
[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1506] Dipeptidyl aminopeptidases/acylaminoacyl-peptidases
[COG1765] Predicted redox protein, regulator of disulfide bond formation 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value0.949047 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTACCG AAAAGCTGAC CTTCACCGGC CATTCCGGCG ACACCCTTGC CGCGCGTCTC 
GACCTGCCCG AGGGCCCGCA CCTGGCCACC GCGCTCTTTG CCCATTGCTT CACCTGCTCC
AAGGACATCC CCGCCGCGCG CCGGATCGCG CAGCGGCTCG CGGCCATGGG GATCGCGGTT
TTGCGGTTCG ATTTCACCGG GCTGGGGCAT TCGGGGGGCG AGTTCAGGAA CACCACGTTT
TCGTCCAACG TCGCCGACCT GCGGCTGGCG GCGGAGGCGC TGGCGGCCCG CGGCATGGCG
CCCAGCCTGT TGATCGGGCA CAGCCTGGGC GGGGCGGCGG TGCTGAAGGC GGTGCGCAGC
ATTCCGGGGG TCAAGGCGGT GGCGACGATC GGCGCGCCCT TCGATCCCGG GCATGTCACC
CATAATTTCG CCGAGGCGCT GGAGACGATC GCGGCGCAGG GCGAGGCCGA GGTGCAGCTG
GGCGGGCGGC CCTTCCGGAT CCGCAAGGCG TTCGTCGAGG ATGTGACGGC CGAGAAACTG
GCCCCCGAGA TCGCCGCGAT GAAGGCCGCC CTTCTGGTGC TGCACGCGCC GCTGGACGCG
CAGGTGGGGA TCGAGAACGC CACGCAGATC TTCGCGGCCG CGAAACATCC CAAGAGCTTC
GTCACCCTCG ACGATGCGGA CCACCTGATC ACCCGCGCTG CGGATGCGGA TTACGCCGCC
GAAGTGATCG CCGCTTGGGT CGGGCGATAC CTGGACCTGC GCCCGCCCGC CCCGCCCCCG
GGCGTGCCCG AGGGGATCAC CCGGGTGTCG GAGGCCGATC CCGCCGGGTT CCTGCAGGAC
GTGAGCGCGG GCCCCTCCCA TCACATCCAG GCCGATGAGC CGCTGGCCTA TGGCGGCACC
AATCGCGGGC TGACACCCTA CCAGTTGCTG GCCGCCGGGC TGGGCGCCTG CACCTCGATG
ACCCTGCGGA TGTATGCCCG CCAGAAGGGC TGGCCCCTAA CCCATGTCTC GGTCGACGTG
ATGCACGACA AGGTGCACGG CCAGGACGCC AAGGGCGCTC ATGACCGCAT CGACAGTTTC
GTGCGCCGCA TCCACCTGGA GGGCGATCTG GACACGGCGC AGCAGGAGCG GCTGCTGGAG
ATCGCGGACA AGTGCCCGGT GCATCGCACG CTCGAGACCG GCGCACGCAT CGTCACCGAA
CTGGCCGTGC CCGCCTGA
 
Protein sequence
MPTEKLTFTG HSGDTLAARL DLPEGPHLAT ALFAHCFTCS KDIPAARRIA QRLAAMGIAV 
LRFDFTGLGH SGGEFRNTTF SSNVADLRLA AEALAARGMA PSLLIGHSLG GAAVLKAVRS
IPGVKAVATI GAPFDPGHVT HNFAEALETI AAQGEAEVQL GGRPFRIRKA FVEDVTAEKL
APEIAAMKAA LLVLHAPLDA QVGIENATQI FAAAKHPKSF VTLDDADHLI TRAADADYAA
EVIAAWVGRY LDLRPPAPPP GVPEGITRVS EADPAGFLQD VSAGPSHHIQ ADEPLAYGGT
NRGLTPYQLL AAGLGACTSM TLRMYARQKG WPLTHVSVDV MHDKVHGQDA KGAHDRIDSF
VRRIHLEGDL DTAQQERLLE IADKCPVHRT LETGARIVTE LAVPA