Gene Dtur_0603 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDtur_0603 
Symbol 
ID7081643 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDictyoglomus turgidum DSM 6724 
KingdomBacteria 
Replicon accessionNC_011661 
Strand
Start bp601667 
End bp604741 
Gene Length3075 bp 
Protein Length1024 aa 
Translation table11 
GC content37% 
IMG OID643457679 
ProductM6 family metalloprotease domain protein 
Protein accessionYP_002352505 
Protein GI217966999 
COG category[S] Function unknown 
COG ID[COG4412] Uncharacterized protein conserved in bacteria 
TIGRFAM ID[TIGR03296] M6 family metalloprotease domain 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.460158 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTAAAA AACTTGTACT AACCCTTAGT CTTTTTATCC TTTTAGTCCT TATAATAAAC 
ATTTCTTTTG CCCAAGTCTT CCCATTAACC AAAGTACATA TAATGCCACC CCACGAAAAG
CTTATTGAAA AAAAGGTAAA ACTTCCAAAA CTCCCTGAAA TTACTCCTCA AAATATACCC
CAAGGAACAA AAATCTATGG AACTATTGAA AAAGCCCAAG GTATAGGTAA AGCAATTGTA
ATACCTGTAG AATTTACCGA TAAACCAAGA CAGCCCGAAG ATATTATACC TTCAAACTAT
TTTGACATAC TTTTTAATAG TGTAAAGGCA GATTGGAGCA ATATAAACCC ATACAACGTA
GGTAGTGTAA GAGAATTTTA TTTAGAAAAT TCCTATGGAC AATTTGATAT AACTGCAACA
GTACTTCCCT GGTATACTGC CCAAAATACC TACTCCTACT ATATAAACGA TGGTAATAAT
GGATTTAATG GAGGGGTATT TGTATTAGTA AAAGAAGTAC TACAGCATGC AGTAGATATT
GGATATGATC TAAGAAATTA TGATGTAGTT TTTGTAATTC ATAGTGGACA AGGTGCCGAG
TGGACAGGAT ATCCAAATGA TATTTGGTCC CATGCCTCTA CAGTATATGT TAATATTGGT
GGAAAAAATG TGCCTATAAG ATATTCCATT GAACCAGAAT ATATGGAAGA TTATGATACT
CAAGGAAATC CTGTAATAAT GCCTCAAACT GTAGGTGTAT TTGTGCATGA GATGGGACAT
TCTTTTGGAC ACCTTCCAGA TCTTTATGAT AGAGACTATT CATCTTTGGG GCTTGGTAGG
TGGAGCTTAA TGGCAGCAGG ATCTTGGAAT GGGCCTCAAG GTCCAGGTGG ATATTCCATA
GGAGGAGGAC CATCTCATTT TGATGCTTGG AGTAAGATTC AACTTGGATG GATAACCCCA
ACTGTTCCTA CCAATGATCT AACTAATGTA AATATTCCAC CGGTAGAAAC AAATCCTGTA
GTATATAAAC TTTGGACCGA TGGGGCAGAA GGTCCTCAAT ATTTTTTAAT TGAAAATAGA
CAACCCATTG GATTTGATAA GTATTTAAGA GGTTTTGGCT TGCTTATTTA TCATGTAGAT
GAAAATATGA GAAACTTTCA AAACGATAAT GAATGGTATC CAGGACTTGA TCCATCAAGG
CACTATTTGG TAGCATTAGA GCAAGCCGAT GGAAAATGGG ATTTAGAAAA AAGAAGAAAT
AGTGGAGATG CGGGAGATCC ATATCCGGGA AGTACTAATA ATACAACCTT TGATGAAAAT
AGCACTCCCA ATAGCAATGC TTATGGAGAA ATTCCAACAG GAGTCGCAGT CAAGAATATA
ACCGCCTCAG GAGAAAATAT TATATGTGAT ATTTATGTAA AATCTCTCAG TGCACCACAA
ACCACATCCT TTGTAAAGCA TTTGGCATGG ACAGGAGAAA ACCCTCTCAC TATAAGACCT
CTCTTCAAAT GGAAGCCTAT ACCTTACGCA GTGAATTACA CTTTGCAGGT TGCAACAGAT
AGCAACTTCA ATAGTATCAT AATAAATGTA AGCACAGAAA AGAACGAATA TAGACCTACC
CTTGAGGAAT TCCTTGAACC TGGCAAAACT TACTTTGCAA GGGTTAGAGC AGAAAATGCA
AGTGGTGTAT CTCCATGGAC TACCATATCT TTTACCACAC CAAATACCTT TGAAGCTCTT
CTTGTCTCCG ATGATGGTGG AGAGTTTGGA ATAGCCCCGT ACTTTGAAAA AGCTCTACAA
GATATAAATG TATCTTACTT TATCGTGGAT GTATTTTATG ACAATGCAGT TCCACCTGCT
TCTTTTATGA GCAATTTTGA TTGGGTGATT TGGGGCGGAG ATTGGGGAGC AATCTATGAT
CCATCAGTTC AAAATGAGAT CATGAATTAT CTTGATAATG GTGGAAAACT CTTCATCTCA
AGCCAGGATT TAGGCTGGGG ATATTCAGCA GGGTATATAA GTAGTACTTT CTATAACACT
TACTTAAGAG CTGAGTTTGT ACAGGATGAT GTAGGAATCT ATTCCATAAA AGGTGCAAAC
GGAAGTGAAT TTGAGAGTTT GTCCCTCTCG TTAAATACCG AGGATTCTGC TCAAAACCAA
GGATATCCTG ACGAAATAGA TCCCTTAGAG GGAGCAAGAG CAATCTTAGT ATATACTCCA
TCTGGAATTT CCCCCATAAC TCCAAATATT AAGCTTCCCG AAAAGATAAA AGAGCAAAAA
TCCATAATTA TAAATAAAGA GATTGCATCC TCTGGGACAG CTGGAATATT CTATGCAGAT
CCTACAAAAC ACTATGGAGT AGTATATTTT GCCTTTGGGC TTGAAGGACT ATCTCCCAAT
ATAAGTGGAG AGGTTTTAAG CAGAGTAAGA TATGCACTAC TTTCATCTCC AGAAATTACC
ACTACAGTAA CCCAAAGCTT CAACCCTAAC AACAATGAAA ATTGTAGTAT TAAGTTAACA
GTAAAAGACA ATATAGGCTA TTCATATCTC ACAGTAAAAA TCTACAGCAC TTTGAATAAT
CAAAAGAAAG ATCTTGTGAA GACTCTTGCA GATAATACAA AAGTTGACAA TGGTACATAT
GAACTTACCT GGGATGGAAA GGACGAAAAT GGAAAAGTCA ATCCAGGAAG ATACATGGTG
GAAGTCTTTG CAAAAGATGA ACTAAATAAC AGCATTACAA AAACATATTA CACCACAATT
CCCTATGGGG TTCCATTATC TTTCATAGAT GTGACTAAAT TCTCAAAAGC CTTTAATCCT
AACAAAGAAT TTGCACAAAT TATGTTTACT TTAACCCAAG ATGCCCATGT AAAATTCATA
GTATACAGCC TTGCTGGAGT AAAACTCTAT GAAAGAGATC TTGGATATCT TCCAGCAGGA
GAATACAGTA TAGTATGGGA AGGAGTCAAT CTAAAGGGCG AAATTCTGAA AAATGGGCTT
TATGTATTCC AGCTTGTAGC AACATCCTCT CAAGGCGAAG CAAGAATAAA TAGATTCATA
GGAATTTTGA AGTGA
 
Protein sequence
MSKKLVLTLS LFILLVLIIN ISFAQVFPLT KVHIMPPHEK LIEKKVKLPK LPEITPQNIP 
QGTKIYGTIE KAQGIGKAIV IPVEFTDKPR QPEDIIPSNY FDILFNSVKA DWSNINPYNV
GSVREFYLEN SYGQFDITAT VLPWYTAQNT YSYYINDGNN GFNGGVFVLV KEVLQHAVDI
GYDLRNYDVV FVIHSGQGAE WTGYPNDIWS HASTVYVNIG GKNVPIRYSI EPEYMEDYDT
QGNPVIMPQT VGVFVHEMGH SFGHLPDLYD RDYSSLGLGR WSLMAAGSWN GPQGPGGYSI
GGGPSHFDAW SKIQLGWITP TVPTNDLTNV NIPPVETNPV VYKLWTDGAE GPQYFLIENR
QPIGFDKYLR GFGLLIYHVD ENMRNFQNDN EWYPGLDPSR HYLVALEQAD GKWDLEKRRN
SGDAGDPYPG STNNTTFDEN STPNSNAYGE IPTGVAVKNI TASGENIICD IYVKSLSAPQ
TTSFVKHLAW TGENPLTIRP LFKWKPIPYA VNYTLQVATD SNFNSIIINV STEKNEYRPT
LEEFLEPGKT YFARVRAENA SGVSPWTTIS FTTPNTFEAL LVSDDGGEFG IAPYFEKALQ
DINVSYFIVD VFYDNAVPPA SFMSNFDWVI WGGDWGAIYD PSVQNEIMNY LDNGGKLFIS
SQDLGWGYSA GYISSTFYNT YLRAEFVQDD VGIYSIKGAN GSEFESLSLS LNTEDSAQNQ
GYPDEIDPLE GARAILVYTP SGISPITPNI KLPEKIKEQK SIIINKEIAS SGTAGIFYAD
PTKHYGVVYF AFGLEGLSPN ISGEVLSRVR YALLSSPEIT TTVTQSFNPN NNENCSIKLT
VKDNIGYSYL TVKIYSTLNN QKKDLVKTLA DNTKVDNGTY ELTWDGKDEN GKVNPGRYMV
EVFAKDELNN SITKTYYTTI PYGVPLSFID VTKFSKAFNP NKEFAQIMFT LTQDAHVKFI
VYSLAGVKLY ERDLGYLPAG EYSIVWEGVN LKGEILKNGL YVFQLVATSS QGEARINRFI
GILK