Gene Daro_2134 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaro_2134 
Symbol 
ID3567550 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDechloromonas aromatica RCB 
KingdomBacteria 
Replicon accessionNC_007298 
Strand
Start bp2303454 
End bp2304449 
Gene Length996 bp 
Protein Length331 aa 
Translation table11 
GC content61% 
IMG OID637680605 
Productthiosulphate-binding protein 
Protein accessionYP_285345 
Protein GI71907758 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1613] ABC-type sulfate transport system, periplasmic component 
TIGRFAM ID[TIGR00971] sulfate/thiosulfate-binding protein 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value3.73195e-21 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000103932 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGCGCAAGA TTGGCCTTTT CGCTGCACTG ACTCTCGCTT TCGGCCTCGG TAGCGCCGCT 
GCCCAGACCA CGCTGCTCAA TGTCTCCTAC GACCCGACGC GTGAGTTGTA CAAGGACTTC
AACGCTGCCT TTGCCAAGCA ATGGCAGGCC AAGACCGGCC AGATCGTTAA CGTCCGCCAG
TCACACGGCG GTTCCGGCAA GCAGGCCCGT TCGGTGGCCG ACGGTCTGGA AGCCGATGTG
GTCACCCTGG CCCTTGGCTA CGACATCGAT GCCCTGGCCG AGCGGAAGCT GATCCCCGCC
GATTGGCAGA AGCGCTTCCC GAACAACTCC TCGCCCTATA CCTCGACCAT TGTCTTCCTG
GTCCGCAAGG GCAACCCGAA AGCGATCAAG GACTGGGGTG ATCTGGCCAA GCCAGGCGTC
GCGGTCATCA CACCGAACCC GAAGACTTCG GGCGGTGCCC GCTGGAACTA TCTGGCGGCC
TGGGCCTGGG CGTTGAAACA ACCCGGTGGC AATGAGCAAA AGGCCAAGGA TCTGGTCAGC
GCAATATTCA AGAACGTGCC GGTCCTCGAT TCCGGCGCCC GTGGTTCGAC CACCACCTTC
GTCGAGCGAG GCCTGGGTGA TGTGCTGATC GCCTGGGAGA ACGAAGCCAT TCTGGCGGTG
ACGGAACTGG GCAAGGACAA GTTCGAGATC GTCGCGCCGA GCCTGTCCAT CCTGGCCGAA
CCACCGGTCG CGGTCGTCGA CAAGGTCGTC GAGAAGCGCG GCACGCGGCT GACAGCGCAG
GCCTATCTCG ATTACCTGTA TTCCGAAGAA GGCCAGCAGA TCGCTGCCAA GCACTACTAC
CGGCCGAGCA ACGCCAAGGT GGCGGCCAAG TACGCGGCCA TTTTCCCGAA ACTGAAACTG
GTCACCATCA ACGACAGCTT CGGCGGTTGG CAGAAAGCGC AGAAAACGCA CTTCGCCGAT
GGTGGCACCT TCGACCAGAT CTATCTGAAG AAATAA
 
Protein sequence
MRKIGLFAAL TLAFGLGSAA AQTTLLNVSY DPTRELYKDF NAAFAKQWQA KTGQIVNVRQ 
SHGGSGKQAR SVADGLEADV VTLALGYDID ALAERKLIPA DWQKRFPNNS SPYTSTIVFL
VRKGNPKAIK DWGDLAKPGV AVITPNPKTS GGARWNYLAA WAWALKQPGG NEQKAKDLVS
AIFKNVPVLD SGARGSTTTF VERGLGDVLI AWENEAILAV TELGKDKFEI VAPSLSILAE
PPVAVVDKVV EKRGTRLTAQ AYLDYLYSEE GQQIAAKHYY RPSNAKVAAK YAAIFPKLKL
VTINDSFGGW QKAQKTHFAD GGTFDQIYLK K