Gene Dshi_1971 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDshi_1971 
Symbol 
ID5712965 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDinoroseobacter shibae DFL 12 
KingdomBacteria 
Replicon accessionNC_009952 
Strand
Start bp2084030 
End bp2085007 
Gene Length978 bp 
Protein Length325 aa 
Translation table11 
GC content64% 
IMG OID641267894 
Productputative TRAP-transporter extracellular solute-binding protein 
Protein accessionYP_001533311 
Protein GI159044517 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG4663] TRAP-type mannitol/chloroaromatic compound transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.418895 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value0.696077 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGCTGA AGACCCCGAT TGCGTTCTCG ACGGCCCTGC CGGGGCTGGG CACGCCGATC 
CCGCGGGTGG CCGATGCGCT GGCGACCATG TCCGGCGGAA CGCTGAAGAT GAAGGTGTAC
GAGCCCGGCA AGCTGGTCCC GGCCTTCGAG ATCCTGGATG CGGTGTCCTC GGGCAAGATC
AACTCCGGCT ACACCACCGC CGGGTATTGG GCGGGCAAGA TCCCGGCGGC CCCCCTGTTC
TCGGCTGTGC CCTTTGGCCC CGAGGCGGGC GAGTACATGG CATGGCTCTA TTACGGCAAC
GGCATGGATC TCTATCAGGA GATGTATGAC CAGGCCGGCT ACAACGTGCA TGTGCTGCCC
TGCGCGATCC TGGCGCCCGA AACCTCGGGC TGGTTCGCCA AGGAGATCAC GTCGGCCGAA
GATCTGAACG GGCTGAAGAT GCGGTTCTTC GGGTTGGGCG GCAAGGTGAT GCAGAAGTTG
GGCGTGGCCA CATCGCTGCT GCCCGGCGGC GAGATCTTCC CGGCGCTGGA GAAGGGCGCC
ATCGACGCGA CCGAGTTCTC GATGCCCGCC ATCGATGCAC GGCTCGGTTT CCACAAGCTG
GTGAAGTTCA ACTACTTCCC CGGCTGGCAC CAGCAGGCGA CCGTGTTCGA GTTGATGATC
AACAAGGACG TCTGGAACGA CGCCAGCGAG CAGCACAAGG CGATCATCGA GAGCGCCTGC
AAGGCGTCCA TGGCCGACAG CTTCGCCGAG GGTGAGGCGA TCCAGCACGC GGCGCTGATC
GACAACGTGG AAAAGAACGG TGTCGAGATG AAGCAGTGGT CGCCGGAGAT GCTAGAGCTG
TTCCGGGCGA CTTGGGACGA GGTGGCCGCA GAAGAAGCCG CGAACGATGA ATTCTTCGCC
AAGGTACTGG CGGACATGAC CACGTTCCGC GACGGCTACG CTCTGTGGAA GCGCAACGCC
TTCCTGCCGC GGGACTGA
 
Protein sequence
MLLKTPIAFS TALPGLGTPI PRVADALATM SGGTLKMKVY EPGKLVPAFE ILDAVSSGKI 
NSGYTTAGYW AGKIPAAPLF SAVPFGPEAG EYMAWLYYGN GMDLYQEMYD QAGYNVHVLP
CAILAPETSG WFAKEITSAE DLNGLKMRFF GLGGKVMQKL GVATSLLPGG EIFPALEKGA
IDATEFSMPA IDARLGFHKL VKFNYFPGWH QQATVFELMI NKDVWNDASE QHKAIIESAC
KASMADSFAE GEAIQHAALI DNVEKNGVEM KQWSPEMLEL FRATWDEVAA EEAANDEFFA
KVLADMTTFR DGYALWKRNA FLPRD