Gene Daud_0180 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaud_0180 
Symbol 
ID6026943 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Desulforudis audaxviator MP104C 
KingdomBacteria 
Replicon accessionNC_010424 
Strand
Start bp205091 
End bp206236 
Gene Length1146 bp 
Protein Length381 aa 
Translation table11 
GC content63% 
IMG OID641593033 
ProductATP:guanido phosphotransferase 
Protein accessionYP_001716376 
Protein GI169830394 
COG category[E] Amino acid transport and metabolism 
COG ID[COG3869] Arginine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGAGTTTGA AGAACACGTT ACAGAGCCCG TACAGCCAGT GGATGGAGGA CGACGCTCCC 
GAGTCGGACG TGGTCATCTC CTCGCGGGCT CGCCTGGCCC GAAGTCTGGC CGGTTACCCC
TTTCCCCACC GGCTTTCTCC GGAACAGGCC GAACAGGTGA TCCAGGCAGT CAGCCTGGCC
GTCCGCAACG CGGAGTTCCG GACACGTTTC GGCGACGTGG AGTTGGTCCG GATGACCGAA
TTGTCTCCGG TCGACCGGTG GATACTGGTG GAAAAGCACT TGATCAGCCC CGGTTTTCTC
AAGAACACGG GGAGCAGTTT CGAATACAAG GGGCTGGTGC TGACCCCGGA CGAGCAGCTG
AGCATCATGG TGAACGAAGA AGACCACCTG CGCATCCAGT GCCTGTTCCC CGGGCTGCAG
CTTGAGGCCG CGGCCCGCAC GGCGGACGAG GCCGACAGCC TGCTGGAAAA GACGCTTGAT
TTCGCGTTTT CGGACCGGAT CGGCTATCTA ACCGCCTGTC CGACCAACGT GGGGACCGGG
CTCCGCGCTT CGGTGATGGT GCACCTCCCG GGACTGGTAC TGCTCGGACA GGTCAAGGAG
GTGCTGACGA CCGTTTCCAG GCTGGGGCTG ACGGTACGCG GCCTGTTCGG CGAGGGGACC
GATGCGGTGG GCAACCTTTT CCAGGTCTCG AACCAGGTGA CCCTGGGTCA CCGGGAGTCT
GAAATTACGG GCAACCTGGC TTCGGTCACC CGGCAGGTGA TCGAGCAGGA GCGCTCAGCC
CGCCAACAGC TGGTCAGGCA GATGCCGGTG GTCATGCGTG ACCGGGTGGG CCGGGCGCTC
GGGATCCTTA AACACGCGCA CACCCTGGGT GTGGAAGAGG CCATGCGCCT GATTTCGGAC
GTACGCCTGG GGGTAATGGC GGGACTCCTG AAGGGGCCGC CGTCGCGGGT GCTTCTGGAA
CTTATGGTTA TCACCAGGCC GTCGTACCTG GTGAGAGTCA GCGGGCGGGA ACTATCCCCG
CCTGAATGGG ACGAATTGCG GGCCACACTG GTCCGGGAGC TTGTGAACGC GCATTCCGGA
GAACATCCCG GAGCGCAGGA TGGTCGTGAC ACCGGCCGCC GACCGGAAGA GAAGAAGAAG
ATATGA
 
Protein sequence
MSLKNTLQSP YSQWMEDDAP ESDVVISSRA RLARSLAGYP FPHRLSPEQA EQVIQAVSLA 
VRNAEFRTRF GDVELVRMTE LSPVDRWILV EKHLISPGFL KNTGSSFEYK GLVLTPDEQL
SIMVNEEDHL RIQCLFPGLQ LEAAARTADE ADSLLEKTLD FAFSDRIGYL TACPTNVGTG
LRASVMVHLP GLVLLGQVKE VLTTVSRLGL TVRGLFGEGT DAVGNLFQVS NQVTLGHRES
EITGNLASVT RQVIEQERSA RQQLVRQMPV VMRDRVGRAL GILKHAHTLG VEEAMRLISD
VRLGVMAGLL KGPPSRVLLE LMVITRPSYL VRVSGRELSP PEWDELRATL VRELVNAHSG
EHPGAQDGRD TGRRPEEKKK I