Gene Dshi_1992 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDshi_1992 
Symbol 
ID5712987 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDinoroseobacter shibae DFL 12 
KingdomBacteria 
Replicon accessionNC_009952 
Strand
Start bp2108817 
End bp2109965 
Gene Length1149 bp 
Protein Length382 aa 
Translation table11 
GC content71% 
IMG OID641267916 
Producthypothetical protein 
Protein accessionYP_001533332 
Protein GI159044538 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.073088 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value0.53449 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCGTAG CTTATGTCTC GACCGATCCC GGCATATCGC CCACCGGCAC CAAGGGGGCG 
TCGATCCATG TGCGCGCGAT CCTCGGGGCG TTGCTGCGCA TGGGCGCAGA GGTGACATTG
TTCGCGCCCC CGTCCCGCGC GCCGTTGCCG GAGGATTTGG CCGCGGTGAC CTGGGTGCCG
CTGCCGAAAC CGGCCAAGGG CGCGCCCGAG GTCCGCGAAC GCGCGCTGAT CGCGGCCAAT
GCGCGCCTGG CGCAGGCGAT GGAGGACCAC GGACCTTTCG ATCTGATCTA TGAGCGGCAC
GCCCTGTTTT CGGACGCGGC CATGCAATTC GGTGCGGCGC GCCGGATCCC CAGCGTGCTG
GAAGTCAACG CGCCGCTTCT GGAAGAACAG CGCCGCCACC GGGTTCTGCA GAATTCGGAC
GAGGCGGCGG CTCGTGCCCG GTCCTCGATC TCGGCGGCGG ATCGGATCAT CGCCGTCTCC
GATGCGGTCG GCGCCTATGC CGAAGGCTTC GGCGCCCGGT CGGTCAAGGT CGTGCCGAAT
GGCGTCGATG CGGACCGCTT TGCGGTGCCA CCCGGGTTCC GGCCGCCTTT CACCCTCGGG
TTTGTCGGCA CGCTCAAGCC CTGGCACGAT GTGGCCTGCC TGATCGATGC GCTGACGCTG
GTCCGGCGCT CGGTGCGCGA TGCGCGGCTG CTGGTGGTCG GCGACGGTCC GGAGCGCGCG
GCCCTCGAGG CGCAGGCGCG CGAGGGCGGT CTTGCCGACG CGGTCGACTT CCATGGCGCG
GCGCCGTCGC AGGACATCCC GGCGCTGCTG GCCCGGATGC ATGTGGGGCT CGCCCCCTAT
CGCGGGGGGG ATCCGTTCTA TTTCTCGCCG CTCAAGATCT ACGAATACAT GGCGGCGGGC
CTGCCCGTTC TCGTCAGTGA CCGGGGCAAC ATGCGCGATG TGGTCCTGCC GCCCCGGGCG
GGCGCGGTGG TGCCGCCCGA TGACCCCGCT GCGCTGGCCG AGGCGATCAT CCACCTGGCG
CAGAACCCGT CGGTCGGGCG CGCGCAGGGC CAGCGCGGGC GCGCCCATGT GATCCGCACC
GCCAGTTGGG ATCACGTCCT GCGGGCGAGC CTGAACGGCC TGCCGCTCCC TTCGGTCCTG
GCCGCCTGA
 
Protein sequence
MRVAYVSTDP GISPTGTKGA SIHVRAILGA LLRMGAEVTL FAPPSRAPLP EDLAAVTWVP 
LPKPAKGAPE VRERALIAAN ARLAQAMEDH GPFDLIYERH ALFSDAAMQF GAARRIPSVL
EVNAPLLEEQ RRHRVLQNSD EAAARARSSI SAADRIIAVS DAVGAYAEGF GARSVKVVPN
GVDADRFAVP PGFRPPFTLG FVGTLKPWHD VACLIDALTL VRRSVRDARL LVVGDGPERA
ALEAQAREGG LADAVDFHGA APSQDIPALL ARMHVGLAPY RGGDPFYFSP LKIYEYMAAG
LPVLVSDRGN MRDVVLPPRA GAVVPPDDPA ALAEAIIHLA QNPSVGRAQG QRGRAHVIRT
ASWDHVLRAS LNGLPLPSVL AA