Gene Daud_2034 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaud_2034 
Symbol 
ID6025846 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Desulforudis audaxviator MP104C 
KingdomBacteria 
Replicon accessionNC_010424 
Strand
Start bp2141960 
End bp2143120 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content63% 
IMG OID641594855 
Productmajor facilitator transporter 
Protein accessionYP_001718156 
Protein GI169832174 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGCGGTCGA CTTGGCTGGA CCAGGGCCTG ATTCTGTTAT TCACCGGGTC CTTCATGGTT 
TTTGTGAACC TGCACCTGGC GTTCATCATA ATGCCCCTGT ATGTGCTCGA ACTGGGGGGC
GGCGACTGGA CGGCGGCCTG GTACAATACC CTTCTGGCCG GAGCGGCGGT GCTGTTCCGG
TTTCTTTTCG CTTCCTGGGT GGACCGGTTC GGGCGCAAGT TTTCCCTGCT GGTGAGCGGA
TCGGCCCTGG TGACCGCGCC GCTGTTTATT CTGCTGGCCG GTTCGCCGGC CTATTTGTCG
TTCATTCGTG TGTTCCAGGC CCTGGGCCTG GCGCTCTACC CGCTGGCCGC GAACACCCTG
ATCGCCGACC TAAGTCCGGT GGCGCGGCGT GGGACCGTCC TGGGCCTGCA GCGGTTGATC
ATTATCACCG CCCTCATCAC TGGGCCGCCC GTGGCGGTCC TGATTGTCGA GCAGTACGGA
TTCCAGACCC TGTTTTGGTT GCTTACCATT CTGGGGCTCG CCGGGATGGC GCCGCTCCTG
GCCATTCGCG AACCGGTGCG CGCCGGAACC GGGACTCCGG TTCTGAACGG GTTTCAGTTC
GTTCTTGCCT CCCGTCCGCT GCGCGTGTTG ATCTCGTCGA CGGCCGCCTG CGGCCTGGCC
TACGGTGTAC TGCTCACCTT TCTCCCTTTG TACGCGGTGC GCGTGGGAAT CGACAATTTC
GGCCTCTATT TCACTGTGTT TGCTTTCAGC GGCCTTGTTT CGGGGGTGGT TGCCGGGCGC
CTGTCCGACG CCTTCGGCCG CCGCAAAGTG CTGGTGCCGT CGCTGGCCCT TTTCGGCATG
GGCATTCTGT ATCTCGGCCT CCCGGCGCCG GGAGCGGTAA TGATGGTCAG CGCCGTGGTG
GCCGGTATCG GCTACTCGGC CTCACTCACC CTGCTGGTCG CCTGGGTGGT GGATGCGGCC
GGCCGTAAGC TGCGCGCGGC TTCCCTGGGT CTTTTTGAAA ACGGGATCGA CGTGGGGATT
ACCGCGGGCT CTTTTGCGTT TGGGAGTGTG GTCGCCCTGC TGGGTTTCGG GTTCGCTTTT
TCCACCGCCG GCGCGCTGCT GCTGATATTC GCGGTCCTGA TCGCCACCCT GGACCGGGGC
CCCGTTTCCC AAATCCGTTA G
 
Protein sequence
MRSTWLDQGL ILLFTGSFMV FVNLHLAFII MPLYVLELGG GDWTAAWYNT LLAGAAVLFR 
FLFASWVDRF GRKFSLLVSG SALVTAPLFI LLAGSPAYLS FIRVFQALGL ALYPLAANTL
IADLSPVARR GTVLGLQRLI IITALITGPP VAVLIVEQYG FQTLFWLLTI LGLAGMAPLL
AIREPVRAGT GTPVLNGFQF VLASRPLRVL ISSTAACGLA YGVLLTFLPL YAVRVGIDNF
GLYFTVFAFS GLVSGVVAGR LSDAFGRRKV LVPSLALFGM GILYLGLPAP GAVMMVSAVV
AGIGYSASLT LLVAWVVDAA GRKLRAASLG LFENGIDVGI TAGSFAFGSV VALLGFGFAF
STAGALLLIF AVLIATLDRG PVSQIR