Gene Dshi_1994 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDshi_1994 
Symbol 
ID5712989 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDinoroseobacter shibae DFL 12 
KingdomBacteria 
Replicon accessionNC_009952 
Strand
Start bp2111233 
End bp2112393 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content67% 
IMG OID641267918 
Productputative glycosyltransferase 
Protein accessionYP_001533334 
Protein GI159044540 
COG category[R] General function prediction only 
COG ID[COG4671] Predicted glycosyl transferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.0252697 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value0.74194 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGGCCA ACCGTATCGC CCTTTATTCC CATGACACGC TCGGTTTTGG CCACTTTCGG 
CGCAACCTGA TGCTGGCCAA GAAGCTGCGC GCGTTGCCGT CGAAGCCGGA TGTGATGCTG
GTGGCCGGGA CCTACGAAGT CGGGGCCTTT GACATTCCCG ACGGCATCGA GGTTCTGACG
CTGCCGGCCT ATGCCAAGCA CGCGGATGGC CAATACACTG CCCGGCGCCT GAACATGGAG
CTGTGCGAGC TGCGCGCCCT GCGCGAGGCG ATCCTGGCGG CCACCTTGAA GCGGTTCGCG
CCCGACCTGC TGATCGTGGA CAATGTCCCG CTTGGCGCCC AGGGGGAACT GGAGGGGCCG
CTGCGCAAGC TTCGCAAACG GGGCAAGACG CGGCTTGTGC TGGGGTGTCG CGATATCCTC
GATGATCCGG CGACCGTGCG GCGGCAATGG CTGCGCCAGC GCCATGTCGA GACCATCAAC
ACCTATTTCG ATGCGGTGTG GATCTATGGG GACCCGGCCG TCTATGACGT GTTCAAGGAC
TGCGATCTGA CCGGCATCAC CGCCGAGATC GTGCATACCG GCTACTTGTT GAAGGACTGG
CCCGCCGAGG TCGCGCCGAG TGGTGGGGAG GCGCCGCTGG TTCTGTGCAC CGTGGGCGGC
GGGCGCGACG GTCTCGACCT GTGCAAGGCG TTCGCGGCGG CGGAGCTTCC GGCGGGTCAC
CGCGGCATCA TCGTGCCGGG CACGCAAATG GATGCGGACG CCCTGGCCCG TATCCGGCAG
ATCGCGGCGG GCAATCGCGG CATGCAGGTG GTGCCCTTCG TGCCGGACCT CGTGCCGCTG
ATGGCCGCGG CGCGCCGGAT CGTGGCGATG GGGGGCTACA ACACGACCTG CGAGATCCTG
GCCCTGAAGA AACCGGCGCT GATCGTGCCA AGGGTCGCGC CGCGCACGGA GCAGCTGATC
CGCGCGCGCG CCCTGAGCGA CCGCGGGCTT GTCGATATCT GCCACCCGAG GGGGCTGTCC
CCGACGGCCT TGTCGGAGTG GATGGCGCGC CCGATCCCGC GCGCGGCCTC TCATGGGATC
AGGACCGATG GGCTGGCCTC TGTTGCCGCC CTCGCCCAGT CCGCGCTTTA CCCCGATTAC
CAACAGATCG CCGCAGAGTG A
 
Protein sequence
MQANRIALYS HDTLGFGHFR RNLMLAKKLR ALPSKPDVML VAGTYEVGAF DIPDGIEVLT 
LPAYAKHADG QYTARRLNME LCELRALREA ILAATLKRFA PDLLIVDNVP LGAQGELEGP
LRKLRKRGKT RLVLGCRDIL DDPATVRRQW LRQRHVETIN TYFDAVWIYG DPAVYDVFKD
CDLTGITAEI VHTGYLLKDW PAEVAPSGGE APLVLCTVGG GRDGLDLCKA FAAAELPAGH
RGIIVPGTQM DADALARIRQ IAAGNRGMQV VPFVPDLVPL MAAARRIVAM GGYNTTCEIL
ALKKPALIVP RVAPRTEQLI RARALSDRGL VDICHPRGLS PTALSEWMAR PIPRAASHGI
RTDGLASVAA LAQSALYPDY QQIAAE