Gene Dhaf_1007 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDhaf_1007 
Symbol 
ID7257975 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfitobacterium hafniense DCB-2 
KingdomBacteria 
Replicon accessionNC_011830 
Strand
Start bp1094502 
End bp1095839 
Gene Length1338 bp 
Protein Length445 aa 
Translation table11 
GC content51% 
IMG OID643560921 
ProductCollagen triple helix repeat protein 
Protein accessionYP_002457503 
Protein GI219667068 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones44 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAGAGT TAACCACTGG TTTGATAGAT AATTTTCCGG TTGATGGAGC GCGGCCCTCG 
GTATCGGTAG CCCTTAGAAT TACCAATGAT GGTGATTCTA CGGAGACGGT TACCATAACC
GGCTACTATC TCAATGGGAC TATCAAAGAA GTATATGTTT TAGAACTAGT TACTGTTAAT
CCAAATGAGG TAATAATCCG GGAGTATTAT GCTAATTTGG ATGCTTTTGA ATTTGTATTC
TCGACAAGTT CTGAAACGGT AGTTATATCG GTATGGGGTA AGGACGCTGA AGGGAATTTA
GTAGATGCCC ATCGGGTGCT CCCTGCTGAG TTGGATTCAC TGGAACCGAT TATAGTGCCG
ACGGGGCCCA CAGGAGAGAC CGGAGCCACA GGAGCAACCG GGGCAACAGG GCCCACAGGA
GCAACCGGAA CAACGGGAGC TACAGGAGAG GCCGGAGCAA CGGGAGCCAC AGGGGAGACC
GGGGCAACGG GAGCCACAGG GGAGACCGGG GCCACAGGGG CCACAGGAGA GACCGGGGCC
ACAGGGGCCA CAGGAGAGAC CGGAGCAACG GGGGCTACAG GAGAGACTGG TGCAACGGGA
GCTACAGGAG AGACTGGTGC AACGGGAGCT ACAGGAGAGA CTGGTGCAAC AGGGCCCGCA
GGAGAGACCG GGGCCACAGG GGCTACAGGA GAGACTGGTG CAACGGGGGC CACAGGAGAA
ATCGGAGCGA CAGGGGCTAC AGGAGAGACT GGAGCGACAG GGGCTACAGG AGAGACTGGA
GCGACAGGGG CTACAGGAGA GACTGGGGCG ACAGGGGCCA CAGGAGAAAC TGGAGCGACA
GGGGCTACAG GAGAGACCGG AGCAACGGGA GCCACAGGAG AGACCGGGGA GACTGGTCCG
ACTGGTCCTA CCGGAGAAGT CGTTTTGGCT TTTGGATCTT TAAGAGGAAG TAGTGCAGAG
GCACCTGGGG CAACATTCAC ACCCGTACCG TTTAGTATAG TTGGACCTTT ATCAGATACT
ATCACAGTTA GTCTATCGGG CAATGAATTA GTGGTAGGGG AAAGCGGAAT TTATCAAATA
ACAATATCTA TTAACGCTCA AGCCACTACT GATCCAGATC CTGATGACCC ATATCTGGAG
GCTATTATCA CTGTCAATGG TTCGCCAATT TTTGGCGATA CAACCACTTT CTTTAAAATA
TTTAATAGAA GTAGTTCAAC GTTTGTAGTT CAAGCATCTT TAACAGCAGG AGATGAAGTA
GGAGTGAGTG CTAGTACGGA TTTCCCTATT TTAGGTTATA TAAATCGCTC CTTAACTGTT
GTTCAATTAA GTAATTAA
 
Protein sequence
MAELTTGLID NFPVDGARPS VSVALRITND GDSTETVTIT GYYLNGTIKE VYVLELVTVN 
PNEVIIREYY ANLDAFEFVF STSSETVVIS VWGKDAEGNL VDAHRVLPAE LDSLEPIIVP
TGPTGETGAT GATGATGPTG ATGTTGATGE AGATGATGET GATGATGETG ATGATGETGA
TGATGETGAT GATGETGATG ATGETGATGA TGETGATGPA GETGATGATG ETGATGATGE
IGATGATGET GATGATGETG ATGATGETGA TGATGETGAT GATGETGATG ATGETGETGP
TGPTGEVVLA FGSLRGSSAE APGATFTPVP FSIVGPLSDT ITVSLSGNEL VVGESGIYQI
TISINAQATT DPDPDDPYLE AIITVNGSPI FGDTTTFFKI FNRSSSTFVV QASLTAGDEV
GVSASTDFPI LGYINRSLTV VQLSN