Gene Daro_3989 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaro_3989 
Symbol 
ID3567278 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDechloromonas aromatica RCB 
KingdomBacteria 
Replicon accessionNC_007298 
Strand
Start bp4285145 
End bp4286236 
Gene Length1092 bp 
Protein Length363 aa 
Translation table11 
GC content63% 
IMG OID637682462 
ProductNi-Fe hydrogenase, small subunit:twin-arginine translocation pathway signal 
Protein accessionYP_287186 
Protein GI71909599 
COG category[C] Energy production and conversion 
COG ID[COG1740] Ni,Fe-hydrogenase I small subunit 
TIGRFAM ID[TIGR00391] hydrogenase (NiFe) small subunit (hydA)
[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones60 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACTGAGA CCTTTTATGA AGTATTGCGC CGCCAGGGCA TCACCCGGCG CAGTTTCCTG 
AAATTCTGCA GCCTGACGGC GACCTCGCTC GGGCTGGGTT CTGCCGCTGC GCCGCGCATC
GCCCATGCGC TGGAAACCAA GGCCCGGACG CCGGTGATCT GGCTGCACGG CCTGGAATGT
ACCTGCTGCT CTGAATCCTT CATCCGCTCG GCGCATCCGC TGACCAAGGA TGTCGTGCTT
TCGATGCTCT CGCTCGATTA CGACGACACG CTAATGGCCG CTGCCGGTCA TCAGGCCGAG
GCGATCATTG AAGAGGTCAA GAAGAAGTAC AAAGGCAACT ACATCGTCGC CGTCGAGGGC
AATCCGCCCT TGAACGAAGA CGGCATGTTC TGCATCCACG GCGGCCGTCC TTTCGTCGAA
GTGCTGAAAG AAACCTGCGC CGATGCCAAG GCCATCATCA GTTGGGGTGC CTGCGCTTCC
TACGGTTGCG TGCAGGCTGC CAAGCCGAAT CCGACCCGCG CCACACCGGT ACACAAGGTC
ATTTCCGGCA AGCCGATCAT CAACGTGCCG GGCTGCCCGC CGATCGCCGA AGTCATGACC
GGCGTCGTCA CCTACATGCT GACCTTCGAC CGCATTCCCG AACTCGACCG TCAGGGTCGC
CCGAAGATGT TCTACGGCCA GCGCATCCAC GACAAGTGCT ATCGCCGTGG CCACTTCGAT
GCCGGCCAGT TCGCCGAGGC CTGGGACGAT GAAGGTTCGC GCAAGGGCTA CTGCCTGTAC
AAGATGGGCT GCAAGGGCCC GACCACCTAC AACGCCTGCT CGTCGATGCG CTGGAATGGC
GGCGTCAGCT GGCCGGTGCA ATCCGGCCAC GGCTGTATCG GCTGTTCCGA AGAGGGGTTC
TGGGACAAGG GCAGCTTCTA CGACCGCGTG ACCGACATCA AGGCCTTCGG CGTCGAAGCC
AATGCCGACA CCATCGGCAA GGCCGCGGCG GGCACCGTCG GCGCCGCAAT CGCCGCCCAT
GCCGCGGTAA CCGCTTTGGC CCGTGCCCGC CAGAAGGCCG GCGAGACTGA AGAACAGAAG
GGGGAAAAAT AA
 
Protein sequence
MTETFYEVLR RQGITRRSFL KFCSLTATSL GLGSAAAPRI AHALETKART PVIWLHGLEC 
TCCSESFIRS AHPLTKDVVL SMLSLDYDDT LMAAAGHQAE AIIEEVKKKY KGNYIVAVEG
NPPLNEDGMF CIHGGRPFVE VLKETCADAK AIISWGACAS YGCVQAAKPN PTRATPVHKV
ISGKPIINVP GCPPIAEVMT GVVTYMLTFD RIPELDRQGR PKMFYGQRIH DKCYRRGHFD
AGQFAEAWDD EGSRKGYCLY KMGCKGPTTY NACSSMRWNG GVSWPVQSGH GCIGCSEEGF
WDKGSFYDRV TDIKAFGVEA NADTIGKAAA GTVGAAIAAH AAVTALARAR QKAGETEEQK
GEK