Gene Dhaf_1008 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDhaf_1008 
Symbol 
ID7257976 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfitobacterium hafniense DCB-2 
KingdomBacteria 
Replicon accessionNC_011830 
Strand
Start bp1096123 
End bp1097265 
Gene Length1143 bp 
Protein Length380 aa 
Translation table11 
GC content52% 
IMG OID643560922 
ProductCollagen triple helix repeat protein 
Protein accessionYP_002457504 
Protein GI219667069 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAGAGT TAACCACCGG TTTGATAGAT AATTTTCCGG TTAATGGGGC ACGCCCCTCA 
GTATCGGTAG CCCTTAGAAT TACCAACGAT GGTGATTCCA CGGAGACGGT TAGAATAACC
GGCTACTATC CCAATGGGTC TTTCAAAGAA GTATATGTTT TAGAAATAGT TAATGTTAAT
CCAAATGAGG TGATAACTCG AGAGTATTTC GCTGATCTGG ATGCTTTTGA ATTTGTATTC
TCGACAAGTT CTGAAACGGT AGCTATATCG GTATGGGGTA AAGACGCTGA AGGGAACTTA
GTGGATGCCC ATCGGGTGCT CCCTGCTGAA TTGGATTCAC TGGAACCAAT AATAGGACCC
ACAGGGGAGA CCGGAGCCAC GGGAGCTACA GGAGAGACCG GGGCAACGGG GCCCACAGGA
GAGACCGGCG CAACAGGGGC CACAGGGGAA ACTGGTGCAA CAGGAGCTAC AGGAGAGACC
GGCGCAACAG GGGCCACAGG GGAAACTGGT GCAACAGGAG CCACAGGAGA GACCGGCGCA
ACAGGGGCTA CAGGGGAAAC TGGTGCAACA GGAGCCACAG GGGAGACCGG CGCAACAGGG
GCCACAGGGG AAACTGGTGC AACAGGAGCC ACAGGAGAGA CCGGCGCAAC AGGGGCTACA
GGAGAGACCG GCCCGACCGG ACCTACAGGT TCCACAGGAC CAACCGGTGG AGCTGGATCA
TTATCTGGGC TCCAGGTTCA GTTGCAGGGA AGCAGTGGAG GTACGGTCGC CAATAATGCC
AATGTCCTGT TTGACACTAC AATCAACGCT CCTTCCGCAA ACATCACTTA TAATGCCGGA
ACTGGAACCT TTTTTATCAA TCAGCCGGGA AATTACTATA TTTCCTGGTG GGTTAACACA
GATGGGGCCG AAGCAGAGCC TACGGTGTCT TTTGGCATCC GGGTTATTAG CGGCGGTTCG
CAGACCATTT TATCGTCTTC CCCTTCGCCG ATGGTGACAT TACAGTTAAA TGGAAATGCT
TTGCTTACGG TGACAACGAC TCCGCTGGTC TTTAACCTGT TTAATAACAG CGGCGCGACG
GTCTCCTATG GCACGTCGGC TGTCCAGGCA GACTTGACTA TTGTTGAAGT AGCATCACTG
TAA
 
Protein sequence
MAELTTGLID NFPVNGARPS VSVALRITND GDSTETVRIT GYYPNGSFKE VYVLEIVNVN 
PNEVITREYF ADLDAFEFVF STSSETVAIS VWGKDAEGNL VDAHRVLPAE LDSLEPIIGP
TGETGATGAT GETGATGPTG ETGATGATGE TGATGATGET GATGATGETG ATGATGETGA
TGATGETGAT GATGETGATG ATGETGATGA TGETGATGAT GETGPTGPTG STGPTGGAGS
LSGLQVQLQG SSGGTVANNA NVLFDTTINA PSANITYNAG TGTFFINQPG NYYISWWVNT
DGAEAEPTVS FGIRVISGGS QTILSSSPSP MVTLQLNGNA LLTVTTTPLV FNLFNNSGAT
VSYGTSAVQA DLTIVEVASL