Gene Dshi_1797 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDshi_1797 
Symbol 
ID5712785 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDinoroseobacter shibae DFL 12 
KingdomBacteria 
Replicon accessionNC_009952 
Strand
Start bp1868767 
End bp1870593 
Gene Length1827 bp 
Protein Length608 aa 
Translation table11 
GC content77% 
IMG OID641267717 
Producthypothetical protein 
Protein accessionYP_001533140 
Protein GI159044346 
COG category[S] Function unknown 
COG ID[COG2861] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000479521 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value0.132264 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTAGAG GCATACTCTC GGGCCTGTTC TGGGGCGGTG CGGCCAGCAT GATCGTGCTG 
ATGGCCGCGT CGGTTTATTT TCCCCTGCGG GACATCAGCG ACCGCACCGC GCCGCGCGCG
CCGGTGCGCG AGAACGTGTT GCTCGCCGAC CCGCCCCCCC TCCCGCCGGG GGCGGAGGTG
CAGCCGAGCG TTGCGGAGCC TGAACCTTCC GTCACGACCG ATCGTGTCGC GGCCCCGTTG
CAGACCCCGC GTCCAGACAG CGGCCCGCCG CCCGATGCCC CTGCACCGGC CGCGCCGACG
GTTGCCGAGG CGCCGCCCGC GCCGCCGACG GACCCCGCCG CGCCCGCTGG CCCGGGCCCG
GAACTTGCCG AGACCGCCCC GGCGAATGTT GCGCAGCCCG CGCCCGGCCA AGCGGTCCCA
ACCGCGCCTG GGCCCGATAC CCCGTCGGCC ACCGCGCCGC GGATGGCCGA CGTGCCGCCG
GGCCTGCCGG TGCCCGGTGC CGTGCGCGAC CGCGCCCCGG GAGAGGCGCT CGCAGCCCTG
CAAACCCCTT TGCCGGTCCC CGAGGTTGCC CAGGAGCCGG GCGATTTGAC GGGCCCTGCC
GCGATCCTGC CCGCCCCCGG CGCGCGGCAA ATCGGCGAGG CGGCGACGCG GACGCCGGAC
CGGCCCGCGG ATCTGCCTGT GCCTGGCACG GCCGCCGAAC CACCGGTCGC GTCGGCGCCT
GTCGCGCCGC CCCGGCCGCC GGAGCTCGCC GCGGAGCCAT CGATGCCGTC CGGCCCGCCG
GAGCCGGTCG AGCTTGCCCG TCCCGACAGC TCGCCGCCGC GGCTGGCCCT GGCACCCGCA
GACCTGCCGC TGCCTCCCGT GGCCGCGGCC CCCATGGAAG CGTCGGAGCC GCCCGTCCCG
GCCGCAGGCC CTCTGCCCGC CGACCTGCGC CCGGCCGCCG ACACGCCTGC ACCGCCCGCC
GACGACCGGC GCACGGCCCT GGTCGTGCCG CCGCGGGAGC TGCCCCGCCG TCTGGTGCTC
GGCAGCGACC AGAGCTTCGG CACCCGCGGC CCTGGCCTGT CCTCGCGGAT CCCCCGGATC
GGCGAGACCG CCGCGGTGGC CGAGGCCGAC GATCCCCTGA CGCCAGAGCC GGAGGTGCCC
GCGCCGCTTG GTGCCCTGGC CCGAAATGCG CTGCCTTTCG AAGGGGCGGA GGGTGTGCCG
CGGCTCGCCC TGGTGCTGCG CGCGACCTCT GACGCGCGGG CGATCACCGA TGTGCTGAGC
CGGATCGCCG ATCCCGTGGC CGTGGCGCTC GACCCGACAT GGCCGGAGGC GGATGCGCGC
GCGGCCGAGC TGCGCGCGGC GGGGCACGAG GTGCTGATCA CCCTGACGGG GCTACCCGAC
CCGGTCGAGC CGCGCGACAT CGATACCGCC CTCGCGGTCC ATATCGCGCG CCTGCCGGGC
GCGATGGGGG TCTGGCTGCC CCGGACGAGC CCGGTCTTCG GCGATCGGGA ACTGCTGCGC
CACCTGGTGG CGGTGTTGGG CGACACGGGC CACGGGCTGG TGGCGCCGCT CAGCGGGCTC
GACGCGGTGG GGCAGGAGGC GCGGGCGATC GGTCTGCCCG CGATTTCGGT GGGCCGGGTC
CTTGGCGGGT CCGGAGAGGG TGAGGACGCC CTGCGCAGAA GCCTCGACCA GGGGGCGTTG
CGCGCCGGCG CGGACGGGCA GGCGGTCCTG CTGGGCGAGA CCCGGGCCGA GACGCTGTCG
GCCCTGCGCG ACTGGAGCGC CGCGCAGGAC CCGGATGCCT TGCGCCTTGC ACCGATCTCC
GCGCTTTTGC TGGCCCCGGG GTCCTAG
 
Protein sequence
MGRGILSGLF WGGAASMIVL MAASVYFPLR DISDRTAPRA PVRENVLLAD PPPLPPGAEV 
QPSVAEPEPS VTTDRVAAPL QTPRPDSGPP PDAPAPAAPT VAEAPPAPPT DPAAPAGPGP
ELAETAPANV AQPAPGQAVP TAPGPDTPSA TAPRMADVPP GLPVPGAVRD RAPGEALAAL
QTPLPVPEVA QEPGDLTGPA AILPAPGARQ IGEAATRTPD RPADLPVPGT AAEPPVASAP
VAPPRPPELA AEPSMPSGPP EPVELARPDS SPPRLALAPA DLPLPPVAAA PMEASEPPVP
AAGPLPADLR PAADTPAPPA DDRRTALVVP PRELPRRLVL GSDQSFGTRG PGLSSRIPRI
GETAAVAEAD DPLTPEPEVP APLGALARNA LPFEGAEGVP RLALVLRATS DARAITDVLS
RIADPVAVAL DPTWPEADAR AAELRAAGHE VLITLTGLPD PVEPRDIDTA LAVHIARLPG
AMGVWLPRTS PVFGDRELLR HLVAVLGDTG HGLVAPLSGL DAVGQEARAI GLPAISVGRV
LGGSGEGEDA LRRSLDQGAL RAGADGQAVL LGETRAETLS ALRDWSAAQD PDALRLAPIS
ALLLAPGS