Gene Daud_0391 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaud_0391 
Symbol 
ID6025691 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Desulforudis audaxviator MP104C 
KingdomBacteria 
Replicon accessionNC_010424 
Strand
Start bp422029 
End bp423597 
Gene Length1569 bp 
Protein Length522 aa 
Translation table11 
GC content56% 
IMG OID641593239 
Productextracellular solute-binding protein 
Protein accessionYP_001716577 
Protein GI169830595 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAGGCATA TAAAGCAAAG TATATCTATC CTGTTGATTA TTATGTTGAT TGCCGGGTTA 
GCCGGGTGTG GTGAGCAACA AAAAGAAGTA CAGGAAGCCA GGCAAGAGCT TGTGGTCGGG
TTGGCGGCCG ACGCCAATGA ACTCAAGTTC AAAGAAGTGG GTATCGGGTC GCCGAATGCC
AATATCTACG AAAGTCTCGT CAAGCTGGAC GCCGACTACC AGGTTCAACC GCTCTTAGCC
ACCAGGTGGG AGTACAGAGG CGACAACACC TGGAGGTTTT ACCTAAGGGA AGGCGTCAAG
TTTCATAACG GAGAAGAGTT GACGGCGGAG GCGGTCAAGC GGTCCCTGGA AGAGGAAATA
CGTCCCAGCG GTAGGGCCGT CCTGAAAATC GAGAAGGGCT CCGTCAAGGT TGTCGATAAA
TATACGGTCG ATATCGTCAC CACAGAGCCC AACATGCGCG TCCCGGAGAT CCTGGCGCAT
CCCCTCAACG GAATCCGAGC GCCGGGTGTC GATCCTGTTG CCAATCCGAC CGGCACAGGC
CCGTTCCGTT TCGTCCGCTA CGAAAAAGAT AAAGAACTCG TCGTGGAGCG CAATCCCGAC
TACTGGGGTA CACCGCCCAA GCTGGATAAG GTGACTTTTA AATACATTCC CGATCACAAC
ACGCGCCTGA TGGCCCTTCA AGCCGGAGAA ATAGACGTGG CGAGGGAGAT CCCCCGCGAA
ATGCTCGGGC AGGTCAAGGC GATGGACGGT GTCAGATTGG CAACCGCGCC GCAGGGCCCG
TATGTGGCAC TGTCCCTCAT GGTCAACGGG AAACCGCCGC ACGACATTTT GCGGGATAAG
ACAGTACGGC AGGCCATCGG GTGGGCTATC GACCGCGAAG CCATTATTCA AAAGGTGTGG
GAAGGCAACG CGGACGAGAG TCAAACGGTT ATTCCCGCGG GGATTCTGGG AGAGCACAAA
AACCTGGTAC AAGGATTTGG CTATGACCCG GCAAAAGCCG GACAGCTGCT TGATGACGCC
GGCTGGAAGC CCGGGCCCGA CGGTATCAGG ACCAAAAACG GCAGGCGGCT TGAGCTTACG
CTGGTTTCCG GTTTTCCTTC GGCCAGCGTG TTGAAACCTT TGCCCGAAGT CCTGCAGCAG
CAGCTTCGTG ACGTCGGCAT TGATGTCCGT ATTGTGGAGG TGGCTGATGA CGGCCTGTAC
TACGGCAGGC TGGAAAAGGG TGAAGGCGAC CTCTGGTTGG AGCGGGGGAA TCAGAATAAC
GGCGACCCGA CCTTCCTTCC GGAACTCCTA TACCACAGCC GGGGATACCA GGGGTCTACT
TACAACAAGC CGTTCTGGCC GGGGGAAGAG TTCGACCGCC TTATTGACGA GGCCAGGAAC
ACCCCGGATA TCCGCGAGGC CACCCGGCTT GTGGCGGAGG CTATGCACAT TCTGATTGAT
AAGGAAAGCA CCGTCGTGCC TATTGCCGCC CTTTACAATG TCTATGCCGT AAAAGAAAAG
GTCCAGGGAC TCAGCCCTCA TCCCGCGCCT ATCCACACCA GATGGGATAC CGCTTACATC
AAAGGATAG
 
Protein sequence
MRHIKQSISI LLIIMLIAGL AGCGEQQKEV QEARQELVVG LAADANELKF KEVGIGSPNA 
NIYESLVKLD ADYQVQPLLA TRWEYRGDNT WRFYLREGVK FHNGEELTAE AVKRSLEEEI
RPSGRAVLKI EKGSVKVVDK YTVDIVTTEP NMRVPEILAH PLNGIRAPGV DPVANPTGTG
PFRFVRYEKD KELVVERNPD YWGTPPKLDK VTFKYIPDHN TRLMALQAGE IDVAREIPRE
MLGQVKAMDG VRLATAPQGP YVALSLMVNG KPPHDILRDK TVRQAIGWAI DREAIIQKVW
EGNADESQTV IPAGILGEHK NLVQGFGYDP AKAGQLLDDA GWKPGPDGIR TKNGRRLELT
LVSGFPSASV LKPLPEVLQQ QLRDVGIDVR IVEVADDGLY YGRLEKGEGD LWLERGNQNN
GDPTFLPELL YHSRGYQGST YNKPFWPGEE FDRLIDEARN TPDIREATRL VAEAMHILID
KESTVVPIAA LYNVYAVKEK VQGLSPHPAP IHTRWDTAYI KG