Gene Dshi_1490 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDshi_1490 
Symbol 
ID5712669 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDinoroseobacter shibae DFL 12 
KingdomBacteria 
Replicon accessionNC_009952 
Strand
Start bp1548941 
End bp1550608 
Gene Length1668 bp 
Protein Length555 aa 
Translation table11 
GC content67% 
IMG OID641267405 
Productextracellular solute-binding protein family 5 
Protein accessionYP_001532833 
Protein GI159044039 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value0.758744 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGATGT CCACCATCAA CGGCAAACCC ATGCATCCGG CGGTCGCAAC CTACGCCCAG 
GACTGCCGCA CCGGCACGCT CAGCCGCCGC GAATTCCTGT CCTACGCCAC TGCCCTCGGC
GCCACCTCCG TCGCCGCCTA CGGCATGATC GGGGCCAAAC CGGCCCGCGC GATGACCGCC
ACACCGGCCC AGGGCGGCAC GCTGCGCATC CAGCACCTGG TCAAGGCGAT GAAGGAGCCG
CGGACCTATG ACTGGTCCGA GCTTGGCAAC CACTCCCGCG GGTTCCTCGA ATATCTCGTG
GAATACAATG CCGACGGAAC CTTCCGCGGC ATGCTGCTGG AAAGCTGGGA AGTGAACGAC
GACGCCACCG TCTATACCCT CAACGTCCGG CCCGGCGTGA CCTGGAACAA CGGCGACGCC
TTTACCGCCG AGGACGTGGC GCGCAACATT ACCGGCTGGT GCGACAGCTC GCTCGAAGGC
AACTCCATGG CCACCCGCGT GCAGGGGCTG ATCGACGAAG CCACCGGCCA AGCCCGGGAG
GGCGCGATCG AGATCGTCGA TGACATGACC GTGCGCCTGA CCCTCAGTGC GCCCGACATC
ACCCTGATCG CCACCATGTC CGACTACCCC GCCGCGATCA CCCATGCCAG CTACGAGGGC
GGCAACCCGT TCGACCACGG CATCGGCACC GGCCCCTATC GCCCGGTCAG TTTCGAGGCG
AACGTGCGCG CCGTGCTGGA ACGCGCCACC GACCACACCT GGTGGGGCAC CGAGGTCTAT
GGCGGCCCCT ATGTCGACCG GATCGAGTTC GTCGATTTCG GCACCGACCC GGCCACCTGG
CTGGCGGGCG CCGAGGCCGA GGAATTCGAC CTGACCTACG AGACCACCGG CGAGTTCGTC
GACATCTTCA GCGCCATCGG TTGGTCCGTC ACCGAGGCCG TGACCGGCGC CACCGTCGTG
TGCCGTCCCA ACCAGGCCGC CGAGATCGAC GGCGTGACGC CCTATGCGGA CGTGAACGTG
CGCCGGGCGC TTGCGATGGC CGTGGATAAC AGCGTGCTGC TGGAGCTGGG CTACAACAAC
CAGGGCACCG CGGCGGAAAA TCACCATGTC TGCCCGATCC ATCCGGAATA CGCCGATATC
GGCGCGCCGG AAACGGACCC GGCCAAGGCC AAGGAGATGA TCGACGCCGC CGGCCTGGGC
GATTTCACCC ACACCTTCAT CACCCCCGAT GAAGAGTGGC TCGCCAATAC TGGCGCCGCC
CTGACCGCGC AGCTGCGCGA CGCGGGCATC CAGGTCGATC ACCGCATCCA GCCCGGCGCC
ACCTTCTGGG GCGACTGGAC CAAGCATGCG TTCTCGGCCA CCTCCTGGAA CCACCGGCCC
CTGGGCGTGC AGATCCTGGC ACTGGCCTAC CGCTCGGGCG AGGCATGGAA CGAATCCGCC
TATGCCAACC CGGAGTTCGA CGCGGCCCTC GCCGAGGCCC TGGCCATCGC CGATGCCGAC
AAGCGCCGCG AAGTCATGGG CAAGGTGCAG CAAATCCTGC GCGACGACGG CGTGATCATC
CAGCCCTACT GGCGGTCGCT CTACAACCAC CACCGGGGCG ACGTGGTCAA TGCCGAGAAG
CATCCGAGCC ACGAGATCCA CGTCTACAAG CTCGGCTTCG CGGCCTGA
 
Protein sequence
MTMSTINGKP MHPAVATYAQ DCRTGTLSRR EFLSYATALG ATSVAAYGMI GAKPARAMTA 
TPAQGGTLRI QHLVKAMKEP RTYDWSELGN HSRGFLEYLV EYNADGTFRG MLLESWEVND
DATVYTLNVR PGVTWNNGDA FTAEDVARNI TGWCDSSLEG NSMATRVQGL IDEATGQARE
GAIEIVDDMT VRLTLSAPDI TLIATMSDYP AAITHASYEG GNPFDHGIGT GPYRPVSFEA
NVRAVLERAT DHTWWGTEVY GGPYVDRIEF VDFGTDPATW LAGAEAEEFD LTYETTGEFV
DIFSAIGWSV TEAVTGATVV CRPNQAAEID GVTPYADVNV RRALAMAVDN SVLLELGYNN
QGTAAENHHV CPIHPEYADI GAPETDPAKA KEMIDAAGLG DFTHTFITPD EEWLANTGAA
LTAQLRDAGI QVDHRIQPGA TFWGDWTKHA FSATSWNHRP LGVQILALAY RSGEAWNESA
YANPEFDAAL AEALAIADAD KRREVMGKVQ QILRDDGVII QPYWRSLYNH HRGDVVNAEK
HPSHEIHVYK LGFAA