Gene Dhaf_0586 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDhaf_0586 
Symbol 
ID7257553 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfitobacterium hafniense DCB-2 
KingdomBacteria 
Replicon accessionNC_011830 
Strand
Start bp647625 
End bp649445 
Gene Length1821 bp 
Protein Length606 aa 
Translation table11 
GC content55% 
IMG OID643560504 
ProductCollagen triple helix repeat protein 
Protein accessionYP_002457088 
Protein GI219666653 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones55 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGCTAT TAACCACGGG ATTGATTGAA AACTCACCCG TCTCCGGAAT AAGAGCAACT 
ACTGTGCTTT CTGTCAGAGT AGTGAATGAT GATTCCGCAT CTGCTTCTGT GCAAATATCT
GGATCTTATG TATCTGGAAC CACTACTACG GCGTATGTTT TGGAGGTATT CATTCTGGAA
CCTGGAGAGG TGGCAACACG AAGCTACTCA GCCAATTTTG ATGCGTTTCA GTTTCAATTT
ACCACCAGTT TGGAAAATGT TGAGATTTCG GCTTGGGGAA AAGATAACGA GGGAAATTTG
ATCGCAGCTC ATCGTGTGCT GCCTGCAGAG CTTGATCCCT TAGATCAGAA TGGAGGGGCA
GGGCCAACAG GAGCCACGGG TGCGACAGGT CCAGCAGGGC CAACAGGAGC CACGGGTGCG
ACAGGTCCAG CAGGAGAAAC AGGAGCCACG GGTGCGACAG GTCCAGCAGG AGAAACAGGA
GCCACGGGTG CGACAGGTCC AGCAGGAGAA ACAGGAGCCA CCGGTGCGAC AGGCCCGGCA
GGAGAAACAG GAGCGACCGG TGCCACAGGT CCAGCGGGAG AAACGGGAGT AACCGGCGCG
ACAGGCCCGG CAGGAGAAAC AGGAGCGACC GGTGCCACAG GTCCAGCAGG AGAAACAGGA
GCAACAGGCG CGACTGGCCC CGCCGGAGAA ACAGGAGCGA CCGGTGCCAC AGGTCCAGCG
GGAGAAACGG GAGTAACCGG CGCGACAGGC CCGGCAGGAG AAACAGGAGC GACCGGTGCG
ACGGGTCCAG CAGGAGAAAC AGGAGCGACC GGTGCGACGG GTCCAGCAGG AGAAACAGGA
GCGACCGGTG CGACAGGTCC AGCAGGAGAA ACAGGAGCGA CCGGTGCGAC AGGTCCAGCA
GGAGAAACAG GAGCAACAGG CGCGACTGGC CCCGCCGGAG AACCTGGCGG TCCTACCGGT
CCTACTGGCG CAACGGGTGC AACTGGACCT GCGGGAGAAT CCGGTGGTCC TACGGGGCCG
ACTGGCGAAA CCGGGCCGAC AGGTGCAACA GGCGAAACTG GCCCTACCGG ACCGACCGGA
GCAACCGGTG AAACCGGTGC TACCGGCACA TTCGAGCCTA ATCCATTTGC AGTCTATGTG
CAAGCTGGGG CAGTCGGTGG GGACGGCACA CAGGCCAGTC CATTTGAAAC GATACAACAG
GGCGTCACAG CAGTATCACC GACAGGAACC GTTCATATTC TGGGCGGGAC TTATCCGATT
ACAGCGACAA TATCGGTGAA TAAAGCAGGA GTAACCCTGA AAGGGTACCC TAATACGCTG
ATTGAACTTC AGGCCGCGGT AATTCCTTTC GCTGTCACGG GTAGTGGAGT GACCATAGAT
GGCTTGACGA TCACCAGCGA TAATCCGTAT GCGGTTCCAT TTATTCAGCT TGGAGGAAGC
AATCATAAGC TTATAAACAA CTATTTTTAT GGACCGCCTC AAGCGGGGCC GTCAGATACC
TGGGTTGTAA ATCGCGGATT CGTGACGCAA GCAAATAATA TGCAGAATTT GACGGTTCAA
AATAATATTT TCTATTTTCT GAGACAGCCG GCTTATTTGA ATCCCAATAC GACAGGGTAT
ATCATCGATA ATGTGGTGTA TAATACCCGT GGATTTGTTG TGGACAGAGC AGTTGTCGTT
TTATCGGGTA ACTCATGGGG CAGCCCGGAA AATGCAGTGG ATATAGCCTT GCTGGTTGGT
ACGATCGCCG GTCCTCCTTA TGATCCCTTA GCTGACCTGG CTGCGAACAA CAGCGATGCC
AGTATCAGTG ACCAAAGATA G
 
Protein sequence
MALLTTGLIE NSPVSGIRAT TVLSVRVVND DSASASVQIS GSYVSGTTTT AYVLEVFILE 
PGEVATRSYS ANFDAFQFQF TTSLENVEIS AWGKDNEGNL IAAHRVLPAE LDPLDQNGGA
GPTGATGATG PAGPTGATGA TGPAGETGAT GATGPAGETG ATGATGPAGE TGATGATGPA
GETGATGATG PAGETGVTGA TGPAGETGAT GATGPAGETG ATGATGPAGE TGATGATGPA
GETGVTGATG PAGETGATGA TGPAGETGAT GATGPAGETG ATGATGPAGE TGATGATGPA
GETGATGATG PAGEPGGPTG PTGATGATGP AGESGGPTGP TGETGPTGAT GETGPTGPTG
ATGETGATGT FEPNPFAVYV QAGAVGGDGT QASPFETIQQ GVTAVSPTGT VHILGGTYPI
TATISVNKAG VTLKGYPNTL IELQAAVIPF AVTGSGVTID GLTITSDNPY AVPFIQLGGS
NHKLINNYFY GPPQAGPSDT WVVNRGFVTQ ANNMQNLTVQ NNIFYFLRQP AYLNPNTTGY
IIDNVVYNTR GFVVDRAVVV LSGNSWGSPE NAVDIALLVG TIAGPPYDPL ADLAANNSDA
SISDQR