Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dtur_0214 |
Symbol | |
ID | 7082399 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dictyoglomus turgidum DSM 6724 |
Kingdom | Bacteria |
Replicon accession | NC_011661 |
Strand | + |
Start bp | 209703 |
End bp | 211532 |
Gene Length | 1830 bp |
Protein Length | 609 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 643457330 |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_002352157 |
Protein GI | 217966651 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.00000434403 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGGGGTA AAGTTTTAGG GGTGTTTTTA ACAGTTTTAC TTTTGACCCT TCTACTTGTA CCTTCCACAT TAGGTCAGGT TTCTTTACCA CGTAATGAAA CAGTTTATAC CGTTGGAGCA CTTTGGTCTG CAACCACCTA CAGTTTATAT GCACCTTCTT CCACCTATGG TACTGAGCAT TTCTTGTACA TGCCACTTTT CATCTACAGC AATATGAAAG ATGGATGGCT ACCAGTACTT GCCGAATCCT TCACAGCAGT GAATAAGAGA ACACTAAGAG TAAAAATCAG AGATATTGCA AAATGGAGCG ATGGTACTCC AATTACTGCT GATGATGTAG TGTTCACCTT CAACTGCACA AAAGAGGTAG GATTTGGACC TGGTAATGGA TGGTGGGATT ACATCCAAAC AGTAAGAGCA GTAGACAATA AGACAGTAGA ATTTGTAATG AGACCAGATG CACAAAACTA TGCCTCTTTC TTAGGATATG CCTTCACTAC AAGGATTGTT CCAAAACATG TTTATGAGCC TCTATTAAAG CAAGGCGTAC AAGCAGTAAA AGATTTCCAG AACAATGATC CTGCAAAACA GGTAGTTTCT GGACCATATA AACTCTACTA CACAGATCCC AACATTGTAG TATATGGAAG AATTGATGAT TGGTGGGGCA AAGCAGTATT TGGGCTTCCT GTTCCTAAAT ACATTGCTAA TGTAATTTAT AGAGATAATG CTGTAGCAAA CTTAGACTTT GAGAAAGGAA ATGCTGACTG GGCAGGAGTC TTTATACCCG ATGTAAGTTC CCTCTGGACT CAGAAGAAGC TCCCCATTGG AACTTGGTTC AAGAACAAAC CATACTACAT GCCAGATGGT CTTGATCTTC TCTACATCAG CTACTACAAT CCATTACTCA AGGATCCTGC AGTGAGAAAA GCTATAGCTT ATGCAATACC ATATAAAGAG ATGCTTGATA AAGCATACTT TGGCTATGGT AACCAAGCTC ATCCATCAAT GGTAATAGAC GACTTTGAAG CATACAGAGA ATACATTGAT CAAGCTTATG CAAAATATGT ATGGGGTTCT CCCGATGGAA AACCTAAGAC AGATCTTAAG AAAGCAAATG AAATACTTGA CAAGGCTGGA TATAAGAGAG GAAAAGATGG TATAAGAATC AGTCCAGATG GAAAGAGAAT GGGTACTTTC ACTATTCAAG TTCCAAATGG ATGGACCGAC TGGATGATGA TGTGTGAAAT GATGGCAGCA AACATGAGAG AAATTGGGCT CGACGTAAAG ACCGAATTCC CAGACTTCTC TGTATGGTGG ACCAGATGGA CTCAAGGAAC CTTTGACTTC ATCCTTGGAT GGTCTGCAGG CCCTGGTTTT GATCATCCAT GGAACGTCTA CAGATTAGTA TTAGATCCAG CCCTTTACAA ACCATTTGGT CAAGATCAGT ATGGTAACTT TGAAAGATAT AACAATCCAG AGGTTGGCAA ACTCTTAGAT AAGATTGCTG CAACCCTTGA TCCAAAGGTC AAGAAGACAT TATTCTATCA ATTACAAAGA ATAATCTACA GAGATCTCCC AGCAATTCCA CTCTTCTATG GAGCTCACTG GTATGAATAC AACGAAACTG TATGGACTGG ATGGCCCAAC GAAAGCAGAC CATGGTGGTA CCCAGCTGCT CCTTGGAGCA ACATGGCACT ACCAATACTC TTTGGCATAG CTCCAAAAGG ACAAACACCT AAGGTGCCAG CTTGGGTAGA GTTCAAAGCA AAAGGCGGTC TCCTTATACC AACCAACGAC GTCCTCAATG CATTAGCAAA GGCAAAATAA
|
Protein sequence | MRGKVLGVFL TVLLLTLLLV PSTLGQVSLP RNETVYTVGA LWSATTYSLY APSSTYGTEH FLYMPLFIYS NMKDGWLPVL AESFTAVNKR TLRVKIRDIA KWSDGTPITA DDVVFTFNCT KEVGFGPGNG WWDYIQTVRA VDNKTVEFVM RPDAQNYASF LGYAFTTRIV PKHVYEPLLK QGVQAVKDFQ NNDPAKQVVS GPYKLYYTDP NIVVYGRIDD WWGKAVFGLP VPKYIANVIY RDNAVANLDF EKGNADWAGV FIPDVSSLWT QKKLPIGTWF KNKPYYMPDG LDLLYISYYN PLLKDPAVRK AIAYAIPYKE MLDKAYFGYG NQAHPSMVID DFEAYREYID QAYAKYVWGS PDGKPKTDLK KANEILDKAG YKRGKDGIRI SPDGKRMGTF TIQVPNGWTD WMMMCEMMAA NMREIGLDVK TEFPDFSVWW TRWTQGTFDF ILGWSAGPGF DHPWNVYRLV LDPALYKPFG QDQYGNFERY NNPEVGKLLD KIAATLDPKV KKTLFYQLQR IIYRDLPAIP LFYGAHWYEY NETVWTGWPN ESRPWWYPAA PWSNMALPIL FGIAPKGQTP KVPAWVEFKA KGGLLIPTND VLNALAKAK
|
| |