Gene DET1041 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDET1041 
Symbol 
ID3229666 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDehalococcoides ethenogenes 195 
KingdomBacteria 
Replicon accessionNC_002936 
Strand
Start bp946995 
End bp948110 
Gene Length1116 bp 
Protein Length371 aa 
Translation table11 
GC content47% 
IMG OID637120605 
ProductPQQ repeat-containing protein 
Protein accessionYP_181757 
Protein GI57234199 
COG category[S] Function unknown 
COG ID[COG1520] FOG: WD40-like repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.331576 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGTTCAAAA CCAATCGCAT ACCCAAAAAA GTTATTTTGC TGTTTAGCCT TATCGGCATT 
CTAGTTGTTT TAAGCGGCTG TATAGGTTCC GGACAGAAAC CTTTAGGCTG GTCAGGCATA
CTTATCTCTG ATAATGATGC CATTTTTGGT TCTATGACTG GCAAACTTAT AGCTCTTAAC
AAGGATGCCA GCACTCCCCG GTATCAAATA TCACTTGAGA CCACTACCAA TGGCGGCGGC
TGTGCCGGTG CTGTTACTAC TGTAGGTATA TACGGTACAC CGGTAGTCTC TGACGGGGTT
ATTTACGTTT CAACCTACGG CGGCAAAGTT TATGCCTACA ATCAGGCTGA CGGCAGTCTG
AAATGGGTTT ACCCCGGAAC CGGGTCAGTA TCTGCCATTG TCAGCGGTAT TGCTCCTAAC
GGAGATAAAA TCTTTTTTGC GGATACTAAC GGGGTTGTTT ACGCCCTGAG CAAGGCAGAC
GGTCAAAAGG TTTGGGAATA TCAGACCGGT GCCAAGATTT GGGCTTCACC GGCTGTTGCC
GGAGATCTGG TAATAGTCCC CGGGTTTGAC AAGAAAGTAT ATGCCCTGGA TATCAACACC
GGCAGCCCTG TTTGGACTTT TGAAACCCAA AGCCCCTTTG CCAGCGCCCC GGTAGTTGAT
AACGGCGTTG TTTATGTGGG CTGTTTTGAC CGCAATCTGT ACGCCCTGGA TTTGGAAGAT
GGCAGCCAGA AATGGGTCTT TGAATCACCC AACTGGTTCT GGGCGACCCC GGTTGTGGCT
GACGGCAAAG TTTTTGCCCC CAATCTGGAC GGCAACACCT ATGTACTTGA TGCTTTGACT
GGTTCTCAGA TTAAAGTAAT TGATATGACT GACGAGGTGG CTTCTTCCCC GGCCCTGCTG
GGTGACAATG TGATTGTAGC CACTAAGACC GGCAAGATTT TTTCCATAAA TGTATCTTCA
ATGGAAATGA AGGCTGTTAC CGACCTTGCC TTGAAAGTGA TTGCACCGGT TACGGTGTCA
GGTGAAGTTG TATACATACA TACCCAGGAA GATGAAACTG TCTATGCTAT AAATCCGGAG
AGCCGGACTA TAATCTGGAC TTTTGTAGTA AGTTAG
 
Protein sequence
MFKTNRIPKK VILLFSLIGI LVVLSGCIGS GQKPLGWSGI LISDNDAIFG SMTGKLIALN 
KDASTPRYQI SLETTTNGGG CAGAVTTVGI YGTPVVSDGV IYVSTYGGKV YAYNQADGSL
KWVYPGTGSV SAIVSGIAPN GDKIFFADTN GVVYALSKAD GQKVWEYQTG AKIWASPAVA
GDLVIVPGFD KKVYALDINT GSPVWTFETQ SPFASAPVVD NGVVYVGCFD RNLYALDLED
GSQKWVFESP NWFWATPVVA DGKVFAPNLD GNTYVLDALT GSQIKVIDMT DEVASSPALL
GDNVIVATKT GKIFSINVSS MEMKAVTDLA LKVIAPVTVS GEVVYIHTQE DETVYAINPE
SRTIIWTFVV S