Gene Dhaf_1038 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDhaf_1038 
Symbol 
ID7258006 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfitobacterium hafniense DCB-2 
KingdomBacteria 
Replicon accessionNC_011830 
Strand
Start bp1134236 
End bp1135384 
Gene Length1149 bp 
Protein Length382 aa 
Translation table11 
GC content51% 
IMG OID643560952 
Productcysteine desulfurase family protein 
Protein accessionYP_002457534 
Protein GI219667099 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0520] Selenocysteine lyase 
TIGRFAM ID[TIGR01977] cysteine desulfurase family protein 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.000028217 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATTTACT TTGATAATGC GGCAACAACT TGGCCCAAAC CGGAATGCGT TTATGAGGCA 
GTGGATCAAT GCTTGCGCAA CAAAGGAGCG AATCCCAGCC GCTCCGGTCA TTTTATGGCC
TTGCTTGCCG GGCAGATTGT CCTCAATGCC CGAGTGCAAA TCGCCGAGTT TTTTAATATC
TCTGACCCCT TGCAAGTGGT GTTTACTCCC AATGCCACCG AAGCTCTTAA TATAGGCCTG
AAAGGGCTAT TAAAACCGGG AGACCATGTG CTTACCAGCT CTCTTGAACA TAATGCTGTA
ACCCGCCCCC TGGAAAAGCT GCGCAGCCAG GGTGTAGAGG TAACCAAGCT GCCGACATCT
GTGCAGGAGG GATTGTATCC TGAGCAGGTG GCAGCAGCTA TTCAGAACAA TACCAAGCTC
ATCGTCCTCA GCCATGCTTC CAATGTGATG GGGTTAATTC AGCCTATTGG TGAGATCGGC
AGAATTGCCG GGGAAAAGGG TGTCCTTTTT ATGGTGGATT CTGCCCAGAC TGCAGGTTCC
ATGCCCATCG ATGTTCAGGC TATGGGCATC GACCTTTTGG TATTTGCCGG GCATAAAGGG
TTATTGGGGC CTCAAGGGAC AGGCGGCTTG TATCTTCGTG AAGATTTGCG TCTCGATACC
CTGAAGGAAG GAGGAACCGG AGCGAATTCA GAGGAACCCT TCCAGCCGGA AGAGAGCCCG
GAGCGCTATG AGAGCGGAAC CCTCAATACA CCGGGAATTG CAGGTCTGGG AGCAGGTATA
GAATTCATCA AGCAGGAAGG AATAGAGAAG ATACGGGAAA AGGAAAGAAC CCTCACCCGC
CAGTTGATGC TGGGCTTAAG CGCAATACCC GGCGTTATTC TTTATGGCCC TGACCCTTCT
GTAGAAAGGG CGCCTGTCGT GTCTATTAAC CTGGAGGGAA GGGAACCTTC GGAAGTTTCC
TATCTCTTGG ATAAGCTTTA TGGAATCGCG TCAAGACCGG GCTTGCATTG TGCCCCCGAT
GCCCACAAAA CCCTTGGTAC CTTCCAACAA GGAACAGTTC GTTTAAGCTT AGGGTACTTT
AATACCAGCC AAGAGGTGGA GGAGTGTCTG GATGCGGTTG CCGGACTCAG TTCCCCGAAC
AAGAAATAA
 
Protein sequence
MIYFDNAATT WPKPECVYEA VDQCLRNKGA NPSRSGHFMA LLAGQIVLNA RVQIAEFFNI 
SDPLQVVFTP NATEALNIGL KGLLKPGDHV LTSSLEHNAV TRPLEKLRSQ GVEVTKLPTS
VQEGLYPEQV AAAIQNNTKL IVLSHASNVM GLIQPIGEIG RIAGEKGVLF MVDSAQTAGS
MPIDVQAMGI DLLVFAGHKG LLGPQGTGGL YLREDLRLDT LKEGGTGANS EEPFQPEESP
ERYESGTLNT PGIAGLGAGI EFIKQEGIEK IREKERTLTR QLMLGLSAIP GVILYGPDPS
VERAPVVSIN LEGREPSEVS YLLDKLYGIA SRPGLHCAPD AHKTLGTFQQ GTVRLSLGYF
NTSQEVEECL DAVAGLSSPN KK