Gene Daud_0201 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaud_0201 
Symbol 
ID6026445 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Desulforudis audaxviator MP104C 
KingdomBacteria 
Replicon accessionNC_010424 
Strand
Start bp232909 
End bp234480 
Gene Length1572 bp 
Protein Length523 aa 
Translation table11 
GC content58% 
IMG OID641593056 
Producttransposase IS66 
Protein accessionYP_001716395 
Protein GI169830413 
COG category[L] Replication, recombination and repair 
COG ID[COG3436] Transposase and inactivated derivatives 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.228467 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCCATGTG TCGAATTGGT AGACATGGAT AATGCAGCGA AAACCATCGA AGAACTCCAG 
ATAAAATGTG CTTTACAGCA ACAGCAAATC GCTGAACTAA CGGCCAAACT TAACTGGTTT
GAAGAACAGT TCCGTCTCAG CCAACAGCGT CAATTCGGCC GCTCCAGTGA GCAGACCCAA
AACCAAGTGG AGCTTTTCAA CGAGGCAGAG GCCGAAGCCA GAGCGTCTTT CGAACCAACG
ATCGAGGAAA TCACCTACCG CCGCCGCAAA AAGCAGGGCC GGCGCCAGGA ACAGCTAAAG
GATCTGCCGG AGGAGATCAT TGAATACCGG CTCGCTCCGG AAGAGCAGAG ATGTGCCTGC
GGCGGCGCCC TGCACGAAAT GAGCACCGAG GTCAGGCAGG AACTCAAAAT CATTCCGGCC
CAGGTCAGTG TCGTCAAGCA CGTCCGCTAT GTCTACGCCT GCCGCCGCTG TGAACGGGAA
GACATCAAAA CCCCCATCGT CACCGCCCCG ATGCCGGCGG CGGTACTGCC GGGAAGCCTG
GTTTCCCCCT CGGCCATGGC CTACATTATG ACCCAAAAGT ACGTGGAGGG CATGCCGCTT
TACCGCCAGG AACAACACCT GGCCCGTCGG GGCGTGGAAC TCTCCCGCCA AACCCTGGCC
AACTGGATGA TCCAGGGTGC GGATCGCTGG CTAAGCCTCC TGTATGCCCG GATGCACAAG
CATCTACTGG CGCAAGACAT CCTGCACGCC GATGAGACGA CCTTGCAGGT ACTCAATGAA
CCGGGCCGGT CGGCGCAAAG CACTTCCTAC CTCTGGCTTT ACCGCACCGG GCGGGCCGGA
CCGCCAATAA TCCTTTATGA CTACCAGACC ACCCGGGCCA GTAAACATCC CCGCCGGTTC
TTGTCCGGCT TTAAGGGTTA CCTGCATGTC GACGGCTATA CCGGCTACAA CGAACTGCCG
GATGTGACCC TGGTCGGGTG TTGGGCGTAT GCCCGGCGCA AGTTCGACGA AGCGCTAAAA
GCACTGCCCA ACGCCCAGCG CGGTGCGGCG GTGGCCGCCA AAGAAGGGTT GGAGTTCTGC
AACCGTCTCT TTGCCATCGA ACGGGAGTTC CGTGAAGTCA CTCCCCAGGA GCGTCATACG
CGTCGCCAGG AACTCAGTCG GCCGGTGGTG GAGGCTTTTT CAGCCTGGCT GAAATACCAG
AGCCCCAGAG TTCTGCCGAA AAGCGCCTTC GGCCAAGCCA TCAAGTATTG CCGCAACCAG
TGGGACAGGC TTACCGTTTT TCTGGAAGAC GGCCGCCTGG AGTTGGACAA CAACCGCAGT
GAGCGCTCCA TTAAACCATT TGTCATCGGC CGTAAGAACT GGCTATTCGC GAACACCGCC
CGTGGGGCAA GCGCCAGTGC CATCATTTAT AGTGTTGTGG AAACAGCGAA GGAAAACGGC
CTCAACCCCT TCAGTTACCT GCAGTATCTT TTTGTAAAGC TGCCGAACAT GGATATTCAG
GATGAACAGG CCTTAGAAGA GTTGCTTCCC TGGTCGGCAA CACTGCCACC GATCTGTCGG
GGTGGCAAGT AG
 
Protein sequence
MPCVELVDMD NAAKTIEELQ IKCALQQQQI AELTAKLNWF EEQFRLSQQR QFGRSSEQTQ 
NQVELFNEAE AEARASFEPT IEEITYRRRK KQGRRQEQLK DLPEEIIEYR LAPEEQRCAC
GGALHEMSTE VRQELKIIPA QVSVVKHVRY VYACRRCERE DIKTPIVTAP MPAAVLPGSL
VSPSAMAYIM TQKYVEGMPL YRQEQHLARR GVELSRQTLA NWMIQGADRW LSLLYARMHK
HLLAQDILHA DETTLQVLNE PGRSAQSTSY LWLYRTGRAG PPIILYDYQT TRASKHPRRF
LSGFKGYLHV DGYTGYNELP DVTLVGCWAY ARRKFDEALK ALPNAQRGAA VAAKEGLEFC
NRLFAIEREF REVTPQERHT RRQELSRPVV EAFSAWLKYQ SPRVLPKSAF GQAIKYCRNQ
WDRLTVFLED GRLELDNNRS ERSIKPFVIG RKNWLFANTA RGASASAIIY SVVETAKENG
LNPFSYLQYL FVKLPNMDIQ DEQALEELLP WSATLPPICR GGK