Gene Dshi_1974 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDshi_1974 
Symbol 
ID5712968 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDinoroseobacter shibae DFL 12 
KingdomBacteria 
Replicon accessionNC_009952 
Strand
Start bp2087391 
End bp2088665 
Gene Length1275 bp 
Protein Length424 aa 
Translation table11 
GC content73% 
IMG OID641267897 
Producthypothetical protein 
Protein accessionYP_001533314 
Protein GI159044520 
COG category[S] Function unknown 
COG ID[COG3864] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.0525296 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value0.788975 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCGCG CGCAATCCCA CAGCCGCCGC GCCACCCATG CGCTGCAAAA GCTGTCGGAG 
GCCGATCCGG CGCTCGCCGC TCTCGCGCTC TGGTGCGCCC ACCGCGACGC CGATCTTGCG
GGCGACCTGC CCGCCGACAG CGACGGGCAC ACCATCCGCT ACGCACCCGG CTTCGCGGCG
CTGTCGCTGC CGGAGCAGAT GGGCCTCTGC GCCCACCACA TCCTGCACAT CGCCCTGCGC
CATTCCGCCC GCAGCGAAAC GCTCCGCCTG CGGCTCGGAC CCGGCTTCGA TCCGGACCTC
TTCGGCATCG CCGCCGACAT CCTGATCAAC GAAACCCTGC TGCAGGCGGG CTATATCCAG
CCCCGCCCCC ATGTCAGCCA CGCCACCGTG AAACGCGAGC TTGGCATCGA CAGCCCCGCC
GACCTGCTCC GGAGCTTCGA CGCCGAACGC CTCTTCACGG AGATGCGCCG CGACGCCGCC
GCGAAGCCCG AGGGCCAGGG CAAGACCGAC AAGATCAAGG CCATGGCGGG CGCGGACGGC
TTTCGGCCCG ACATCGCCCC CAGCCCCACG GGCGAGGACG GCGACGAGGA CACGCCCGAG
GCCCGCGATT TCGAGTGGCG CCAGCACCTC GCCCGGGCGC TGGAGGCCGG CAAACTGGCT
GGCCGGGGGA TCGGCGCGCT CGGCTTCCGG CTCGCCGACA TCCCCGAGAC CACCACCCCG
TGGGAGGTGA TCCTGCGCGG CCTGCTGGAC CGCGCGACAC GGGCCGACCC GCGCCGCAGC
TTCCGCCGCC CCGCGGGCCG CTGGGTCGCG GGCGAGGCCG AGGCGCGCGC ACGCGGCCGC
CCCGTCCCGG TCTTCGAGCC CGCCCTGCAG CGCGAGACGA CCCAGCCCCG GATCGTGCTG
GCCATCGACA GCTCCGGCTC GGTCACCGGC GATCAGCTGG CCCATTTCGC CGCGCAAATC
GCCCGGATCG GGCGCCGGGT GCTGGCCGAG ATCCACGTGC TGATCTTCGA CGAGACCGTG
CAATCGGCCC ACAAGATGCG CGGCACCCAC TGGGCCGCGA CCCTGGCGGG CTGGGACTTC
GCCCGCGACG GTGGGACGAG TTTCGTCGAT GTGCTGGAAC GCGCCGCCGC GCTGACCCCC
TCGGCGGTCG TGGTGCTCAC CGATCTCGAC GGCCCCATGG GCGCCGCACC CGGCCGCGCC
CCGGTAATCT GGGCCTGCCC CAAACCACCC GAGAGCCCCC CACCCTTCGG TCGCGTGCTG
GTGCTGGACC GCTGA
 
Protein sequence
MSRAQSHSRR ATHALQKLSE ADPALAALAL WCAHRDADLA GDLPADSDGH TIRYAPGFAA 
LSLPEQMGLC AHHILHIALR HSARSETLRL RLGPGFDPDL FGIAADILIN ETLLQAGYIQ
PRPHVSHATV KRELGIDSPA DLLRSFDAER LFTEMRRDAA AKPEGQGKTD KIKAMAGADG
FRPDIAPSPT GEDGDEDTPE ARDFEWRQHL ARALEAGKLA GRGIGALGFR LADIPETTTP
WEVILRGLLD RATRADPRRS FRRPAGRWVA GEAEARARGR PVPVFEPALQ RETTQPRIVL
AIDSSGSVTG DQLAHFAAQI ARIGRRVLAE IHVLIFDETV QSAHKMRGTH WAATLAGWDF
ARDGGTSFVD VLERAAALTP SAVVVLTDLD GPMGAAPGRA PVIWACPKPP ESPPPFGRVL
VLDR