Gene Dtur_0214 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDtur_0214 
Symbol 
ID7082399 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDictyoglomus turgidum DSM 6724 
KingdomBacteria 
Replicon accessionNC_011661 
Strand
Start bp209703 
End bp211532 
Gene Length1830 bp 
Protein Length609 aa 
Translation table11 
GC content41% 
IMG OID643457330 
Productextracellular solute-binding protein family 5 
Protein accessionYP_002352157 
Protein GI217966651 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000434403 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGGGGTA AAGTTTTAGG GGTGTTTTTA ACAGTTTTAC TTTTGACCCT TCTACTTGTA 
CCTTCCACAT TAGGTCAGGT TTCTTTACCA CGTAATGAAA CAGTTTATAC CGTTGGAGCA
CTTTGGTCTG CAACCACCTA CAGTTTATAT GCACCTTCTT CCACCTATGG TACTGAGCAT
TTCTTGTACA TGCCACTTTT CATCTACAGC AATATGAAAG ATGGATGGCT ACCAGTACTT
GCCGAATCCT TCACAGCAGT GAATAAGAGA ACACTAAGAG TAAAAATCAG AGATATTGCA
AAATGGAGCG ATGGTACTCC AATTACTGCT GATGATGTAG TGTTCACCTT CAACTGCACA
AAAGAGGTAG GATTTGGACC TGGTAATGGA TGGTGGGATT ACATCCAAAC AGTAAGAGCA
GTAGACAATA AGACAGTAGA ATTTGTAATG AGACCAGATG CACAAAACTA TGCCTCTTTC
TTAGGATATG CCTTCACTAC AAGGATTGTT CCAAAACATG TTTATGAGCC TCTATTAAAG
CAAGGCGTAC AAGCAGTAAA AGATTTCCAG AACAATGATC CTGCAAAACA GGTAGTTTCT
GGACCATATA AACTCTACTA CACAGATCCC AACATTGTAG TATATGGAAG AATTGATGAT
TGGTGGGGCA AAGCAGTATT TGGGCTTCCT GTTCCTAAAT ACATTGCTAA TGTAATTTAT
AGAGATAATG CTGTAGCAAA CTTAGACTTT GAGAAAGGAA ATGCTGACTG GGCAGGAGTC
TTTATACCCG ATGTAAGTTC CCTCTGGACT CAGAAGAAGC TCCCCATTGG AACTTGGTTC
AAGAACAAAC CATACTACAT GCCAGATGGT CTTGATCTTC TCTACATCAG CTACTACAAT
CCATTACTCA AGGATCCTGC AGTGAGAAAA GCTATAGCTT ATGCAATACC ATATAAAGAG
ATGCTTGATA AAGCATACTT TGGCTATGGT AACCAAGCTC ATCCATCAAT GGTAATAGAC
GACTTTGAAG CATACAGAGA ATACATTGAT CAAGCTTATG CAAAATATGT ATGGGGTTCT
CCCGATGGAA AACCTAAGAC AGATCTTAAG AAAGCAAATG AAATACTTGA CAAGGCTGGA
TATAAGAGAG GAAAAGATGG TATAAGAATC AGTCCAGATG GAAAGAGAAT GGGTACTTTC
ACTATTCAAG TTCCAAATGG ATGGACCGAC TGGATGATGA TGTGTGAAAT GATGGCAGCA
AACATGAGAG AAATTGGGCT CGACGTAAAG ACCGAATTCC CAGACTTCTC TGTATGGTGG
ACCAGATGGA CTCAAGGAAC CTTTGACTTC ATCCTTGGAT GGTCTGCAGG CCCTGGTTTT
GATCATCCAT GGAACGTCTA CAGATTAGTA TTAGATCCAG CCCTTTACAA ACCATTTGGT
CAAGATCAGT ATGGTAACTT TGAAAGATAT AACAATCCAG AGGTTGGCAA ACTCTTAGAT
AAGATTGCTG CAACCCTTGA TCCAAAGGTC AAGAAGACAT TATTCTATCA ATTACAAAGA
ATAATCTACA GAGATCTCCC AGCAATTCCA CTCTTCTATG GAGCTCACTG GTATGAATAC
AACGAAACTG TATGGACTGG ATGGCCCAAC GAAAGCAGAC CATGGTGGTA CCCAGCTGCT
CCTTGGAGCA ACATGGCACT ACCAATACTC TTTGGCATAG CTCCAAAAGG ACAAACACCT
AAGGTGCCAG CTTGGGTAGA GTTCAAAGCA AAAGGCGGTC TCCTTATACC AACCAACGAC
GTCCTCAATG CATTAGCAAA GGCAAAATAA
 
Protein sequence
MRGKVLGVFL TVLLLTLLLV PSTLGQVSLP RNETVYTVGA LWSATTYSLY APSSTYGTEH 
FLYMPLFIYS NMKDGWLPVL AESFTAVNKR TLRVKIRDIA KWSDGTPITA DDVVFTFNCT
KEVGFGPGNG WWDYIQTVRA VDNKTVEFVM RPDAQNYASF LGYAFTTRIV PKHVYEPLLK
QGVQAVKDFQ NNDPAKQVVS GPYKLYYTDP NIVVYGRIDD WWGKAVFGLP VPKYIANVIY
RDNAVANLDF EKGNADWAGV FIPDVSSLWT QKKLPIGTWF KNKPYYMPDG LDLLYISYYN
PLLKDPAVRK AIAYAIPYKE MLDKAYFGYG NQAHPSMVID DFEAYREYID QAYAKYVWGS
PDGKPKTDLK KANEILDKAG YKRGKDGIRI SPDGKRMGTF TIQVPNGWTD WMMMCEMMAA
NMREIGLDVK TEFPDFSVWW TRWTQGTFDF ILGWSAGPGF DHPWNVYRLV LDPALYKPFG
QDQYGNFERY NNPEVGKLLD KIAATLDPKV KKTLFYQLQR IIYRDLPAIP LFYGAHWYEY
NETVWTGWPN ESRPWWYPAA PWSNMALPIL FGIAPKGQTP KVPAWVEFKA KGGLLIPTND
VLNALAKAK