Gene Daud_1474 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaud_1474 
Symbol 
ID6027540 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Desulforudis audaxviator MP104C 
KingdomBacteria 
Replicon accessionNC_010424 
Strand
Start bp1557022 
End bp1558020 
Gene Length999 bp 
Protein Length332 aa 
Translation table11 
GC content64% 
IMG OID641594292 
Producthypothetical protein 
Protein accessionYP_001717612 
Protein GI169831630 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000729086 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCACAGAA TACTTTCAAT CGCCGCCGCC ATAACCGTCC TGGCTGCCGG AGCAGCGGCC 
GCCTGGTTCT TCACACATCA CCCGGAACCG GAGGTGCACC GCCTCTCAGT GCGGGAAACA
GCACGCGAGA TAGCCTACCT GCCCCACTAC CTGGCGGCGG CCTTGGGCTA CTTCGAGGAC
AACGGCCTGC ACCTGAGCCT GACCACCGCA CCCCGGGGGA TAGTGGACCC GGCCCAGGAC
AAAGCCGAAC TCTACCTCAC CCCTCTGGAC CGGTTGATGG TCGACGGGCC GGTGGCCTTC
GCGGCCCTGA CCGGAAAAGA CCCTAACTTC TTACTTGGCC GGGAAGCAAA ACCGGACTTT
AAGTGGGAGA ATCTAAAGGA GTACACCGTG ATCGGCGACC CGCCCGACAC CGTCGGGGAG
GTGGCGCTGG AGGAAATCCT GCGGCGCCTC GAACTGGTGC CCCAGACCCA CGTCACCATC
ATCCAGCACC TGCCGGTGCA TCTCCGGGTG GGCGCTTTTC TGTCCGGGAC AGGAAGTTTC
ATCATCCTGC CGGATCCCAT GGCCGCCTAC CTGGAGAAGT CCGAAACCGG CTACGTCCTG
GTTTCGCTCG CGGAGGCCGG CGAGATTCCG TCCCGGGTCT GCGCGGCCAC GCCAGAGTTT
TTGCAGACCC ACCCCGACGT CGCCCTGGGG TACTGCTTAG CCGTCTTCCA GGCCCAAGAA
TGGATCGCGC AAAAGAGTCC CGAGGAAATC GCCGTCGTCG CAGCGCCTTT CTTCCCCTAC
CTCGACCTCG AAACTCTGAC CCGGGTAATC GCCCGGAACA AGGAGACCAG GCTCTGGGCC
CCAAACCCGC TGGTTGAGGA AAACGCCTAC CAGAACCTGC AGAACTGGTT GATCCAGTCC
GGGGAGTTGC CCCAGGCCGT GCCCTACCAC CAGGCCGTGG AACCTTCGTT CGCCCGCGAG
GCGGTGGAGG GGGGCGCACT CCCAGTCCCC CAGCAATGA
 
Protein sequence
MHRILSIAAA ITVLAAGAAA AWFFTHHPEP EVHRLSVRET AREIAYLPHY LAAALGYFED 
NGLHLSLTTA PRGIVDPAQD KAELYLTPLD RLMVDGPVAF AALTGKDPNF LLGREAKPDF
KWENLKEYTV IGDPPDTVGE VALEEILRRL ELVPQTHVTI IQHLPVHLRV GAFLSGTGSF
IILPDPMAAY LEKSETGYVL VSLAEAGEIP SRVCAATPEF LQTHPDVALG YCLAVFQAQE
WIAQKSPEEI AVVAAPFFPY LDLETLTRVI ARNKETRLWA PNPLVEENAY QNLQNWLIQS
GELPQAVPYH QAVEPSFARE AVEGGALPVP QQ