Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Daud_0833 |
Symbol | |
ID | 6026881 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Desulforudis audaxviator MP104C |
Kingdom | Bacteria |
Replicon accession | NC_010424 |
Strand | + |
Start bp | 878741 |
End bp | 879778 |
Gene Length | 1038 bp |
Protein Length | 345 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 641593643 |
Product | periplasmic solute binding protein |
Protein accession | YP_001716980 |
Protein GI | 169830998 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0803] ABC-type metal ion transport system, periplasmic component/surface adhesin |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGATCAAG TATCTGCGTT ATCAGAGGGA GGGAGCATAG TGTTTTACCG GACAAGAAAG ACATTGGCGG GGTTGGTCTG CCTGCTGTTC GTTTTCGCGG CCGGTTGCGG GGGTCCCGCC GAGACACCCC AATCACCGGG ACCCGGCAAA ACGGTGGTGA TCGCCACCAT TTTTCCGCTG GCCGATATTG CCAAGAACAT CGGCGGCGAC CGGGTAGCGG TGTCGACGCT TTTGCCCCGT GGGGCCAGTC CCCACACCTT TGAGCCGACA CCGCGGCAGA TGGAGCAGGT GACCGGCGCG CGGGTGCTCA TCCAGGTGGG GGCCGGGCTC GACGACTGGG CCGGAAAGCT CGCCGGGGTG GCCGGCAAGG ACCTCCTCCG GGTTGCAGTC ACCGAGGGGC TGTCCCTGCG CGCGGGCGGG CACCAGCACG CGGAATGTGC GGGGCATGCC CATGAGCACG ACCATAACGC CTGCCCGGCG GCGGAGACCG CTGCGGCAGA CCCGGCGTAC GGGTTCGCCG ACCCGCACGT GTGGCTTGAC CCGGTGCTGG TCCGGGACGA GATCGCCCCC CGCATTTTCT CCGCCCTTTG CCAGGCCGCG CCGGAGGATG CGGACTACTT CGCCGCCAAT CTGGAATCCT ACCAGGCTGA GCTGACCGTG CTGCACGCGG ACCTGACCGC ATTGACCGCC GGATTTGAGC GCCGCAGTTT CATCGCCTAC CATTCGGCCT GGGGATACTT CGCCGACCGG TACGGGCTGG TGGAAGCGGC TACGGTCGAG GAAGCACCGG GCAAGGAGCC TTCTCCGGGC TGGATCATGA AGGTGGTGGA GACGGCCCGG GCCCACCAGG CCGGGGCGAT TTTCGCCGAG CCCCAGTTCA GCACCAAGGC GGCCGAGGTG ATCGCCGCCG AGTACGGCGC CCGGGTGTTG GTTTTGGACC CGCTCGGCGG GGAGGATATC CCCGGTTATG ACAGCTACGT CAACCTCATG CGTTCGAATG CCGCCGTATT GGCCGAAGGT CTTTCCCGGG TGGACTAA
|
Protein sequence | MDQVSALSEG GSIVFYRTRK TLAGLVCLLF VFAAGCGGPA ETPQSPGPGK TVVIATIFPL ADIAKNIGGD RVAVSTLLPR GASPHTFEPT PRQMEQVTGA RVLIQVGAGL DDWAGKLAGV AGKDLLRVAV TEGLSLRAGG HQHAECAGHA HEHDHNACPA AETAAADPAY GFADPHVWLD PVLVRDEIAP RIFSALCQAA PEDADYFAAN LESYQAELTV LHADLTALTA GFERRSFIAY HSAWGYFADR YGLVEAATVE EAPGKEPSPG WIMKVVETAR AHQAGAIFAE PQFSTKAAEV IAAEYGARVL VLDPLGGEDI PGYDSYVNLM RSNAAVLAEG LSRVD
|
| |