Gene Daud_0833 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaud_0833 
Symbol 
ID6026881 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Desulforudis audaxviator MP104C 
KingdomBacteria 
Replicon accessionNC_010424 
Strand
Start bp878741 
End bp879778 
Gene Length1038 bp 
Protein Length345 aa 
Translation table11 
GC content66% 
IMG OID641593643 
Productperiplasmic solute binding protein 
Protein accessionYP_001716980 
Protein GI169830998 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0803] ABC-type metal ion transport system, periplasmic component/surface adhesin 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATCAAG TATCTGCGTT ATCAGAGGGA GGGAGCATAG TGTTTTACCG GACAAGAAAG 
ACATTGGCGG GGTTGGTCTG CCTGCTGTTC GTTTTCGCGG CCGGTTGCGG GGGTCCCGCC
GAGACACCCC AATCACCGGG ACCCGGCAAA ACGGTGGTGA TCGCCACCAT TTTTCCGCTG
GCCGATATTG CCAAGAACAT CGGCGGCGAC CGGGTAGCGG TGTCGACGCT TTTGCCCCGT
GGGGCCAGTC CCCACACCTT TGAGCCGACA CCGCGGCAGA TGGAGCAGGT GACCGGCGCG
CGGGTGCTCA TCCAGGTGGG GGCCGGGCTC GACGACTGGG CCGGAAAGCT CGCCGGGGTG
GCCGGCAAGG ACCTCCTCCG GGTTGCAGTC ACCGAGGGGC TGTCCCTGCG CGCGGGCGGG
CACCAGCACG CGGAATGTGC GGGGCATGCC CATGAGCACG ACCATAACGC CTGCCCGGCG
GCGGAGACCG CTGCGGCAGA CCCGGCGTAC GGGTTCGCCG ACCCGCACGT GTGGCTTGAC
CCGGTGCTGG TCCGGGACGA GATCGCCCCC CGCATTTTCT CCGCCCTTTG CCAGGCCGCG
CCGGAGGATG CGGACTACTT CGCCGCCAAT CTGGAATCCT ACCAGGCTGA GCTGACCGTG
CTGCACGCGG ACCTGACCGC ATTGACCGCC GGATTTGAGC GCCGCAGTTT CATCGCCTAC
CATTCGGCCT GGGGATACTT CGCCGACCGG TACGGGCTGG TGGAAGCGGC TACGGTCGAG
GAAGCACCGG GCAAGGAGCC TTCTCCGGGC TGGATCATGA AGGTGGTGGA GACGGCCCGG
GCCCACCAGG CCGGGGCGAT TTTCGCCGAG CCCCAGTTCA GCACCAAGGC GGCCGAGGTG
ATCGCCGCCG AGTACGGCGC CCGGGTGTTG GTTTTGGACC CGCTCGGCGG GGAGGATATC
CCCGGTTATG ACAGCTACGT CAACCTCATG CGTTCGAATG CCGCCGTATT GGCCGAAGGT
CTTTCCCGGG TGGACTAA
 
Protein sequence
MDQVSALSEG GSIVFYRTRK TLAGLVCLLF VFAAGCGGPA ETPQSPGPGK TVVIATIFPL 
ADIAKNIGGD RVAVSTLLPR GASPHTFEPT PRQMEQVTGA RVLIQVGAGL DDWAGKLAGV
AGKDLLRVAV TEGLSLRAGG HQHAECAGHA HEHDHNACPA AETAAADPAY GFADPHVWLD
PVLVRDEIAP RIFSALCQAA PEDADYFAAN LESYQAELTV LHADLTALTA GFERRSFIAY
HSAWGYFADR YGLVEAATVE EAPGKEPSPG WIMKVVETAR AHQAGAIFAE PQFSTKAAEV
IAAEYGARVL VLDPLGGEDI PGYDSYVNLM RSNAAVLAEG LSRVD