Gene DET1088 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDET1088 
Symbol 
ID3229623 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDehalococcoides ethenogenes 195 
KingdomBacteria 
Replicon accessionNC_002936 
Strand
Start bp989902 
End bp991509 
Gene Length1608 bp 
Protein Length535 aa 
Translation table11 
GC content56% 
IMG OID637120652 
Productterminase, large subunit, putative 
Protein accessionYP_181803 
Protein GI57234156 
COG category[R] General function prediction only 
COG ID[COG4626] Phage terminase-like protein, large subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCAATGC GAAAACTGAA AAACTACAAG CCGACCCGCT TCATGGCCGA GTCCTCTCAC 
TACAGCAAGC AGATGGCGGA TTTCGCTGTG ATGTTCATCG AGCAGCTCTG CCATACCAAA
GGCACATGGG CGGGAAAGCC CTTCGAGCTT ATCGACTGGC AGGAAAGAAT CATCCGCGAC
CTGTTCGGAA CGCTGAAACC AAACGGCTAC CGCCAGTTCA ACACGGCCTA CATCGAGATA
CCAAAGAAGA TGGGCAAGTC AGAGCTTGCC GCTGCGGTCG CCCTGCTTCT TTGCTGCGGC
GACGGAGAGG AACGCGCCGA GGTCTACGGC TGCGCTGCCG ACCGCCAGCA GGCCACCATC
GTTTTTGATG TGGCTGCGGA TATGGTCAGG ATGTGCCCTG CCTTAAACCG ACGCGTGAAG
ATACTGGCCT CCCAGAAGCG GATCATCTAC GAGCCGACGA ACAGCTTCTA TCAGGTGCTG
TCCGCCGAGG CCTATTCGAA GCACGGTTTC AATATCCACG GCGTGGTCTT TGATGAGCTG
CATACCCAGC CCAACCGAAA GCTCTTTGAT GTTATGACCA AGGGCTCCGG CGACGCCAGA
ATGCAGCCGC TCTACTTCCT GATCACGACT GCCGGAAACG ATACGAACAC CATCTGCTAC
GAAGTCCACC AGAAAGCGCA GGACATCCTT GACGGCAGAA AGGTTGATCC GACCTTCTAT
CCGGTCATCT ACGGTGCGGA CACTTCCGAG GACTGGACAG ACCCGGAGGT CTGGAAGAAG
GCAAATCCCT CGCTCGGTAT CACGGTCGGC ATCGACAAGG TGGAAGCCGC CTGCGAGTCG
GCAAAACAAA ATCCCGGCGA GGAGAACTCC TTTAGACAGC TCCGCTTAAA TCAATGGGTA
AAGCAGGCGA TTCGCTGGAT GCCAATGGAG AAATGGGACG CCTGTGCTTT CCCGGTAAAT
GAGGACGACC TCGAAGGCCG TGTCTGTTAC GGCGGCCTTG ACCTCTCCTC CACCACAGAT
ATCACTTCCT TTGTGCTGGT CTTCCCGCCA AGGGATGAGG ATGACAAGTA TGTGATCCTT
CCGTACTTCT GGGTGCCGGA GGATACGCTG GATCAGCGTG TCCGGCGTGA CCATGTGCCT
TACGACACTT GGGAAAAGGA AGGATACCTC GAAACCACGG AGGGCAACGT CATCCACTAC
GGCTACATCG AGAAATTCAT CGAGCGGCTG GGCGAGCGGT TCAACATCCG TGAGATTGCC
TTCGACCGCT GGGGAGCCGT CCAGATGGTA CAAAACCTTG AGAACATGGG CTTCACTGTC
GTTCCCTTCG GTCAGGGCTT CAAGGATATG AGCCCGCCCA CGAAAGAGCT GATGAAGCTG
ACACTGGAAA AGAAACTCGC CCACGGCGGC CACCCGGTGC TCCGCTGGAA TATGGACAAC
ATCTTCATCC GTACTGATCC TGCCGGAAAC ATCAAGGCCG ACAAGGAGAA GTCCACGGAG
AAGATCGACG GTGCCATCGC AACCATCATG GCACTTGACC GGGCGATCCG CTGCGGCAAC
GACAACGGTG CTTCTGTGTA TGACGGCAGA GGCATCCTTT TCATATAG
 
Protein sequence
MPMRKLKNYK PTRFMAESSH YSKQMADFAV MFIEQLCHTK GTWAGKPFEL IDWQERIIRD 
LFGTLKPNGY RQFNTAYIEI PKKMGKSELA AAVALLLCCG DGEERAEVYG CAADRQQATI
VFDVAADMVR MCPALNRRVK ILASQKRIIY EPTNSFYQVL SAEAYSKHGF NIHGVVFDEL
HTQPNRKLFD VMTKGSGDAR MQPLYFLITT AGNDTNTICY EVHQKAQDIL DGRKVDPTFY
PVIYGADTSE DWTDPEVWKK ANPSLGITVG IDKVEAACES AKQNPGEENS FRQLRLNQWV
KQAIRWMPME KWDACAFPVN EDDLEGRVCY GGLDLSSTTD ITSFVLVFPP RDEDDKYVIL
PYFWVPEDTL DQRVRRDHVP YDTWEKEGYL ETTEGNVIHY GYIEKFIERL GERFNIREIA
FDRWGAVQMV QNLENMGFTV VPFGQGFKDM SPPTKELMKL TLEKKLAHGG HPVLRWNMDN
IFIRTDPAGN IKADKEKSTE KIDGAIATIM ALDRAIRCGN DNGASVYDGR GILFI