Gene Daud_2017 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaud_2017 
Symbol 
ID6026257 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Desulforudis audaxviator MP104C 
KingdomBacteria 
Replicon accessionNC_010424 
Strand
Start bp2121518 
End bp2122531 
Gene Length1014 bp 
Protein Length337 aa 
Translation table11 
GC content69% 
IMG OID641594839 
Productthiamine-monophosphate kinase 
Protein accessionYP_001718140 
Protein GI169832158 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0611] Thiamine monophosphate kinase 
TIGRFAM ID[TIGR01379] thiamine-monophosphate kinase 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGCTTGG CGGATGTCGG CGAGTCGGGG CTGGTGGAGC GGCTCCTCGG GCGGCTTGCC 
CGGGGCCCCG GTGTAGTCCG GGGCGCCGGA GACGACGCCG CCGTGCTCGA TCTGGGCGGT
AAAGAACTAT TGCTGTTCAC CGTGGACACC CTGGTGGAGG AAGTTCATTT TTCCAGGGCC
TACGGCTCGA TGCGGGATCT GGGCGCCAAG GCCATGGCTG TAAACCTGAG TGACGTCGCT
GCCATGGGCG GCCGGCCGGT GTATGCGGTC GTGAGCCTGG CGGCCCCGGC GGAAACCGCG
GTGGCGGACA TCGACGATTT GTATGCGGGA CTCGCCGGTA CAGCGGCCCG GTACGGCGTT
ACCCTGGTCG GAGGCGACAC CGTACGTCAC CCGCACGGGC TCGTGATTAC AGTGGCCCTT
TTGGGTCTCG CCGGGCGGGA GCGGGTGCTG TACCGCAAGG GCGCCGTGTC GGGAGACCTG
TTCTACGTCA CCGGCAGCCT GGGGGCGAGC GCTGCCGGGC TGTTCTTGTT TCAAAACCCG
CATCCGGCCT GCCCGCCGGA GGTGGAAGAC CGGTTGAAAA AAGCGCACTT GAGCCCGGAA
CCCCGGGTGG TGGCCGGCGG CTTGCTCGCC GCCAGCGGGG TGGTCAGCGC CGCCGAGGAC
ATCAGCGACG GCTTAGCCTT GACCGTGGCC CACATCTGTA CGGCCGGCGG CGTGGGTGCG
CGACTCCTGG CCGACCGGGT GCCGCTCTCC CCGGAGGTGC GGCGGTTGGG AATCCTTACC
GGCAAAGACC CCCTGGAGTG GGCGCTCTTC GGGGGCGAGG ACTACGAACT CCTGTTCACG
GTGCGCCCCG GAGCGGCCGC CGGCCTGGAA AGAGAAATGG CGGCGGCGGG CTGGCCGGTG
ACCTGGATCG GGGAAGTGCT CGGTCCCGGA GAGGGGCTGT GGCTCGAAGA CGCGGCGGGC
GCTGGGCGCC CCCTGGTTCC CGGGGGTTAC GACGCCTTCG GGACCGAACC GTGA
 
Protein sequence
MRLADVGESG LVERLLGRLA RGPGVVRGAG DDAAVLDLGG KELLLFTVDT LVEEVHFSRA 
YGSMRDLGAK AMAVNLSDVA AMGGRPVYAV VSLAAPAETA VADIDDLYAG LAGTAARYGV
TLVGGDTVRH PHGLVITVAL LGLAGRERVL YRKGAVSGDL FYVTGSLGAS AAGLFLFQNP
HPACPPEVED RLKKAHLSPE PRVVAGGLLA ASGVVSAAED ISDGLALTVA HICTAGGVGA
RLLADRVPLS PEVRRLGILT GKDPLEWALF GGEDYELLFT VRPGAAAGLE REMAAAGWPV
TWIGEVLGPG EGLWLEDAAG AGRPLVPGGY DAFGTEP