Gene Daud_1663 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaud_1663 
Symbol 
ID6026011 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Desulforudis audaxviator MP104C 
KingdomBacteria 
Replicon accessionNC_010424 
Strand
Start bp1752133 
End bp1753470 
Gene Length1338 bp 
Protein Length445 aa 
Translation table11 
GC content64% 
IMG OID641594485 
Productlipopolysaccharide biosynthesis 
Protein accessionYP_001717796 
Protein GI169831814 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3206] Uncharacterized protein involved in exopolysaccharide biosynthesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.861892 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACGAGC AGCAGAAGAA CGGACCGCAA AGGCCGGCAT TCACCGAGGA CGTGATTGAC 
CTGCGGGCTT ATTTCAAGGT CCTGCACAAG TGGCGCAAGG TGATCGCGCT GGGGACCTTT
CTGGCCGTGC TCACCAGCGG CATCCTGAGT TTCTTTATTC TGCAGCCGGT GTACGAGGCC
AAAACGCTCT TGATGGTCAC CCAGGCCGCG GACCGGCAGC GGGTGGTGGA ACAGGACGGC
CTGGAGGGCG TGGTCGGGAC CCTCTCGCGG ATTCCGGTGA TGACCATGAA CACCTACCTG
GGGCAGATCA AGAGCGAGGC CCTGATGGAC CGGGTGATCG CAAAGCTGGG CCTGGACCGG
GCGCTGTATG AGCCCAGGCA CCTGCTGGAG ATGGTGGACG CCTCGGTGAT CAAGGACGCC
AACCTGATCG AAGTGCGAGT GCGGCACACC GACCCGGTGC TGGCGCGGGA TATCGCGAAC
GCCATCAATA CCGAGTATCT GTTGATGCTG TCCGACAAGA ACGAGGAGCA GATGGCCCGG
TCGGTGGACT TCCTGGAACG CCAGCGGGAC GAAGCCCTAG CGCGGCTGGA CGAGGCGCAG
GGAAAACTGA AGGAGTTCGA AGCGCAGCCG CGGAGCGTGG CCGTGTTGGA GGCGGAATTC
ACCCAGAAGT CAGAAGATCT GGCCCGGTAC AAGTCGCGGC TGAACACGGC CGTGATCGAG
CTGCAGCAGA TCAGCGCCGG CGTGGCCCGG ATGCAGGAGG AATTGAACGC CACGCCGCAG
ACCATCAGTG TGGAACGAAG CACTGATGAA GGAGTGGTCA CCGCCCGGGA GCCCAACCCG
GTTTACGCGA CCCTGTCCGA ACGTTTGAGC GAGCGCAAGG CCGCCCAGGC GGAGAAGGAA
GCCGAGGTAC AGGCACTGAC CCAGCTGATC GCCGGCCTGG AGGGAGAGAC CAGCGTGCTG
CTGGCCGAAC TGACGACCAA GCGGGCCGGC CTGGACCGGC TGCGGGACGA GGTGAACCGT
CTGGATGCCA CGGCGAATCT GCTGGCGCAG AAAGTGACGG AAACCCAGAT TGCGCGATCC
ATTGACCTGG GCCAGACCTC GGTGGTGGTG GTGTCGCCGG CGAACACGCC GACCACGCCG
GTGAAGCCGA ACAAGAAACT GAACATGGCG GTGGCGTTGG TGCTGGGACT GATGGTGTTC
GTGGGATTGG CCTTCGTGCT GGAACACCTG GACTACACGA TCAAGAACCC GGAGGATGTG
GAGCGGCACC TGGAGCTGCC GGTGCTGGGA GTGGTGCCCG CGGTGGACGC GAGGGCGGCG
CAGCGGTCCA CCTACTGA
 
Protein sequence
MNEQQKNGPQ RPAFTEDVID LRAYFKVLHK WRKVIALGTF LAVLTSGILS FFILQPVYEA 
KTLLMVTQAA DRQRVVEQDG LEGVVGTLSR IPVMTMNTYL GQIKSEALMD RVIAKLGLDR
ALYEPRHLLE MVDASVIKDA NLIEVRVRHT DPVLARDIAN AINTEYLLML SDKNEEQMAR
SVDFLERQRD EALARLDEAQ GKLKEFEAQP RSVAVLEAEF TQKSEDLARY KSRLNTAVIE
LQQISAGVAR MQEELNATPQ TISVERSTDE GVVTAREPNP VYATLSERLS ERKAAQAEKE
AEVQALTQLI AGLEGETSVL LAELTTKRAG LDRLRDEVNR LDATANLLAQ KVTETQIARS
IDLGQTSVVV VSPANTPTTP VKPNKKLNMA VALVLGLMVF VGLAFVLEHL DYTIKNPEDV
ERHLELPVLG VVPAVDARAA QRSTY