Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dshi_1545 |
Symbol | |
ID | 5713202 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dinoroseobacter shibae DFL 12 |
Kingdom | Bacteria |
Replicon accession | NC_009952 |
Strand | + |
Start bp | 1606867 |
End bp | 1608687 |
Gene Length | 1821 bp |
Protein Length | 606 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 641267460 |
Product | putative extracellular solute-binding protein |
Protein accession | YP_001532888 |
Protein GI | 159044094 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG4166] ABC-type oligopeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.134516 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 0.0971522 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAACAAC TCCGCCGCGC AATCCGCATT GGCCACGCCG CCCTCATCGC CGGTTTCCTC GCCGTCCCCG GCGCGCTCGC CGCGCCCTCC CATGGCATCG CTATGTATGG CGACCCGGCC CTTCCACCGG ATTTTGTGTC TCTCCCCTAC GCCAACCCCG ATGCGCCCAA GGGCGGCCGG ATCGTTCTGG GCGAGGTCGG CGGCTTCGAC AGCCTCAATC CCCACATCCT CAAGGGGCGC GTCCCGTGGC AGCTCAGATT TCTCGCCCAT GAGAGCCTGA TGGCCCGAAA TTACGACGAG CCCTTCGCAC TTTACGGGCT TCTGGCCGAA TCGGTGGAGG TCGACCCGGA CGGGCTCTGG GTCGCCTTCA CCCTGCGCCC CGAGGCGCGA TTCTCCGATG GCAGCCCGGT GACGGTCGAG GATGTGCTGT GGTCGTTCGA AACCCTCGGC ACCGTCGGCC ACCCGCGCTA TCTCGGCGCC TGGGCGCAGG TGCAATCCGC CGAAGCCACC GGACCCCGCT CCCTGCGCAT CACCTTCACC GAACCCAACC GCGAACTGGC GCTTATCATG GGCATGCGTC CGATCCTGAA AAAGGCCCAG TGGGAGGGCA AGGATTTCGC CGAAAGCGGG CTGGACGAAG CCCCGATCTC CTCCGCGCCT TACGTGATCG CGGATTTCGA GCCGGGCCGC TTCGTCACCC TGCGCCGCAA TCCCGATTAC TGGGCCGCCG ACCTGCCCAT CCGGCGGGGG GTCCACAATC TCGACGAGAT CCGGATGGAA TTCTTCGGCG ATGCCGGTGT GATGTTCGAG GCGTTCAAGG CCGGCATCCT CACCTCGATC CGCGAGACCA ACACCGCCAA ATGGAACCGC GACTACGACT TTCCGGCCAT GCAGGCGGGC GAGGTAGTCA AATCCGTCAT CCCGCATGAA CGCCCCTCGG GCATCACCGG CTTTGCCATG AACACCCGCC GCGCCGATTT CGCCGATTGG CGCGTGCGCG ATGCCCTGAT CCACGCCTTC AATTTCGAGC TGATCAACCG CACCCTCAAT GGTGCCGAGG TGCCGCGCAT CACCTCCTAT TTCTCCAACT CCGTGCTGGC GATGCAGGAC GGCCCCGCCA CGGGCCGGGT GGCCGAGCTG CTCGCCCCCT ACGCCGACAC ACTGCCCCCC GGCGCGCTGG AGGGCTACAC CCTGCCGGTC TCCGACGGCT CCGAGGCCAA CCGCCGCAAC ATCCGCGCCG CCCTGCGCCT GATGGACGAG GCCGGTTACA CCATCGAGGA AGGCGTGATG ACCCGCCCCG ACGGCACCCC CTTCACCTTC GAGATCCTCC TGTCCCAGGG CAGCTCCGAA GTGCAGTCCA TGATCAACAT CTACGCCAAG TCCCTCGAAC GGCTGGGCGT TTCCGTGGAC ATCACCACCG TCGACAGCGC TCAGTACCGC GAGCGCACCG ATGCCTACGA TTTCGACATG ACCTACTACA CCCGCGGCCT GTCGCTCAGC CCCGGCAACG AGCAGCGGCT CTATTGGGGC TCGGAGGGCG TGGACATCCC CGGCAGCCGC AACTGGCCCG GCATCGACAG TGCGGCGGTG GACGGGCTGA TCGACGCCAT GCTGAGTGCC AAGAGCCAGG AGGACTACAT CGCCACCGTG CGCGCGCTCG ACCGGGTGCT GACCGCGGGA CGCTATGTCA TCCCGATCTG GTTCAACCCG GTCTCCCGCA TCGCGCACGC GGCCGATCTG ACCTACCCGG AAGCCTTGCC CGCCTATGGC GACTGGATCT CGTTTCACCC GGATGTGTGG TGGTCGAAAT CCGCCGAATG A
|
Protein sequence | MKQLRRAIRI GHAALIAGFL AVPGALAAPS HGIAMYGDPA LPPDFVSLPY ANPDAPKGGR IVLGEVGGFD SLNPHILKGR VPWQLRFLAH ESLMARNYDE PFALYGLLAE SVEVDPDGLW VAFTLRPEAR FSDGSPVTVE DVLWSFETLG TVGHPRYLGA WAQVQSAEAT GPRSLRITFT EPNRELALIM GMRPILKKAQ WEGKDFAESG LDEAPISSAP YVIADFEPGR FVTLRRNPDY WAADLPIRRG VHNLDEIRME FFGDAGVMFE AFKAGILTSI RETNTAKWNR DYDFPAMQAG EVVKSVIPHE RPSGITGFAM NTRRADFADW RVRDALIHAF NFELINRTLN GAEVPRITSY FSNSVLAMQD GPATGRVAEL LAPYADTLPP GALEGYTLPV SDGSEANRRN IRAALRLMDE AGYTIEEGVM TRPDGTPFTF EILLSQGSSE VQSMINIYAK SLERLGVSVD ITTVDSAQYR ERTDAYDFDM TYYTRGLSLS PGNEQRLYWG SEGVDIPGSR NWPGIDSAAV DGLIDAMLSA KSQEDYIATV RALDRVLTAG RYVIPIWFNP VSRIAHAADL TYPEALPAYG DWISFHPDVW WSKSAE
|
| |