Gene DET1087 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDET1087 
Symbol 
ID3229624 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDehalococcoides ethenogenes 195 
KingdomBacteria 
Replicon accessionNC_002936 
Strand
Start bp988488 
End bp989822 
Gene Length1335 bp 
Protein Length444 aa 
Translation table11 
GC content56% 
IMG OID637120651 
ProductHK97 family portal protein , putative 
Protein accessionYP_181802 
Protein GI57234157 
COG category[S] Function unknown 
COG ID[COG4695] Phage-related protein 
TIGRFAM ID[TIGR01537] phage portal protein, HK97 family 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTATAT TTTCAGGATT ATTTCGGAGC CGGGATAAGC CCAAGGACGC GACCAGCGGA 
AGCTCCTACC GCTTCTTCTT CGGCGGCACG ACCTCCGGCA AAGCTGTAAC GGAACGCTCC
GCCATGCAGA TGACGGCGGT CTACTCCTGC GTTCGGATTC TATCCGAGGC GATTGCTGGC
CTGCCCGTTC ACCTGTACCG GTACGACGGC AGCGGCGGCA AGGAAAAAGC GACCACTCAT
CCGCTCTACT TCCTATTGCA TGATGAGCCA AACCCGGAAA TGACATCCTT TGTCTTTCGG
GAAACGCTGA TGACGCACCT TTTGCTGTGG GGAAACGCCT ACGCGCAGAT CATCCGAAAT
GGCAAGGGCG AGGTCGTGGC TCTCTATCCG CTTATGCCAA ACCGCATGAC GGTTGACCGC
GACGCAGACG GTCACCTCTA CTACGAATAT CAGACCTCGC AGGATGAGGC GCACACGATG
GATGGCAGCC GCGTCAGGCT CTCTCCAAGC GATGTGCTCC ATATTCCCGG CCTTGGCTTT
GACGGCCTGA TGGGCTACAG CCCGATTGCG ATGGCAAAGA ACGCTATCGG CATGGCGATT
GCCTGTGAGG AATACGGAGC TAAGTTCTTC GCTAACGGCG CGACGCCCGG CGGCATCTTG
GAGCATCCCG GTGTGATAAA AGACCCGGAG CGTGTCAGGG AAAGCTGGAA CTCAGCCTTC
GGCGGCAGCG CCAATGCAAA CAAGGTGGCG GTTCTTGAGG AGGGCATGAA ATACACGCCC
ATCTCCATTT CACCGGAGCA GGCGCAGTTC TTGGAGACGC GGAAGTTCCA GATCAATGAG
ATCGCTCGTA TCTTCCGCAT CCCGCCTCAT ATGATCGGCG ACCTTGAGAA ATCGAGCTTT
TCCAACATCG AGCAGCAGTC GCTGGAGTTC GTGAAATACA CGCTCGACCC GTGGGTCTGC
CGCTGGGAAC AGTCCATGCA GCGGGCGCTT TTGTCTATGG ACGAGAAGAA GGAATACTTC
TTCAAGTTCA ATGTGGACGG CCTGCTTCGC GGAGATTACC AGAGCCGCAT GAACGGCTAT
GCGACCGGAC GCCAGAACGG CTGGATGAGC GCTAACGATA TCAGGGAGCT GGAAAATCTC
GACCGTATCC CGGAGGAGGA AGGCGGCGAC CTGTATCTTA TAAACGGCAA CATGACCAAG
CTCAAGGACG CAGGCATTTT TGCAGCCTCG TCTCAGGGAC AGGAGGAGCC AGATGAAACA
GAAGAATCAA AACAAGAGCC GGAACAGCCA CAGCAAAGTG AGCGCACCCG GCCACGAAAG
AAGGAGGCAC TATGA
 
Protein sequence
MSIFSGLFRS RDKPKDATSG SSYRFFFGGT TSGKAVTERS AMQMTAVYSC VRILSEAIAG 
LPVHLYRYDG SGGKEKATTH PLYFLLHDEP NPEMTSFVFR ETLMTHLLLW GNAYAQIIRN
GKGEVVALYP LMPNRMTVDR DADGHLYYEY QTSQDEAHTM DGSRVRLSPS DVLHIPGLGF
DGLMGYSPIA MAKNAIGMAI ACEEYGAKFF ANGATPGGIL EHPGVIKDPE RVRESWNSAF
GGSANANKVA VLEEGMKYTP ISISPEQAQF LETRKFQINE IARIFRIPPH MIGDLEKSSF
SNIEQQSLEF VKYTLDPWVC RWEQSMQRAL LSMDEKKEYF FKFNVDGLLR GDYQSRMNGY
ATGRQNGWMS ANDIRELENL DRIPEEEGGD LYLINGNMTK LKDAGIFAAS SQGQEEPDET
EESKQEPEQP QQSERTRPRK KEAL