Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Daud_1474 |
Symbol | |
ID | 6027540 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Desulforudis audaxviator MP104C |
Kingdom | Bacteria |
Replicon accession | NC_010424 |
Strand | + |
Start bp | 1557022 |
End bp | 1558020 |
Gene Length | 999 bp |
Protein Length | 332 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 641594292 |
Product | hypothetical protein |
Protein accession | YP_001717612 |
Protein GI | 169831630 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.0000000729086 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCACAGAA TACTTTCAAT CGCCGCCGCC ATAACCGTCC TGGCTGCCGG AGCAGCGGCC GCCTGGTTCT TCACACATCA CCCGGAACCG GAGGTGCACC GCCTCTCAGT GCGGGAAACA GCACGCGAGA TAGCCTACCT GCCCCACTAC CTGGCGGCGG CCTTGGGCTA CTTCGAGGAC AACGGCCTGC ACCTGAGCCT GACCACCGCA CCCCGGGGGA TAGTGGACCC GGCCCAGGAC AAAGCCGAAC TCTACCTCAC CCCTCTGGAC CGGTTGATGG TCGACGGGCC GGTGGCCTTC GCGGCCCTGA CCGGAAAAGA CCCTAACTTC TTACTTGGCC GGGAAGCAAA ACCGGACTTT AAGTGGGAGA ATCTAAAGGA GTACACCGTG ATCGGCGACC CGCCCGACAC CGTCGGGGAG GTGGCGCTGG AGGAAATCCT GCGGCGCCTC GAACTGGTGC CCCAGACCCA CGTCACCATC ATCCAGCACC TGCCGGTGCA TCTCCGGGTG GGCGCTTTTC TGTCCGGGAC AGGAAGTTTC ATCATCCTGC CGGATCCCAT GGCCGCCTAC CTGGAGAAGT CCGAAACCGG CTACGTCCTG GTTTCGCTCG CGGAGGCCGG CGAGATTCCG TCCCGGGTCT GCGCGGCCAC GCCAGAGTTT TTGCAGACCC ACCCCGACGT CGCCCTGGGG TACTGCTTAG CCGTCTTCCA GGCCCAAGAA TGGATCGCGC AAAAGAGTCC CGAGGAAATC GCCGTCGTCG CAGCGCCTTT CTTCCCCTAC CTCGACCTCG AAACTCTGAC CCGGGTAATC GCCCGGAACA AGGAGACCAG GCTCTGGGCC CCAAACCCGC TGGTTGAGGA AAACGCCTAC CAGAACCTGC AGAACTGGTT GATCCAGTCC GGGGAGTTGC CCCAGGCCGT GCCCTACCAC CAGGCCGTGG AACCTTCGTT CGCCCGCGAG GCGGTGGAGG GGGGCGCACT CCCAGTCCCC CAGCAATGA
|
Protein sequence | MHRILSIAAA ITVLAAGAAA AWFFTHHPEP EVHRLSVRET AREIAYLPHY LAAALGYFED NGLHLSLTTA PRGIVDPAQD KAELYLTPLD RLMVDGPVAF AALTGKDPNF LLGREAKPDF KWENLKEYTV IGDPPDTVGE VALEEILRRL ELVPQTHVTI IQHLPVHLRV GAFLSGTGSF IILPDPMAAY LEKSETGYVL VSLAEAGEIP SRVCAATPEF LQTHPDVALG YCLAVFQAQE WIAQKSPEEI AVVAAPFFPY LDLETLTRVI ARNKETRLWA PNPLVEENAY QNLQNWLIQS GELPQAVPYH QAVEPSFARE AVEGGALPVP QQ
|
| |