Gene Daud_1972 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaud_1972 
Symbol 
ID6026811 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Desulforudis audaxviator MP104C 
KingdomBacteria 
Replicon accessionNC_010424 
Strand
Start bp2075362 
End bp2076561 
Gene Length1200 bp 
Protein Length399 aa 
Translation table11 
GC content66% 
IMG OID641594790 
Productphosphoribosylaminoimidazole carboxylase, ATPase subunit 
Protein accessionYP_001718095 
Protein GI169832113 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0026] Phosphoribosylaminoimidazole carboxylase (NCAIR synthetase) 
TIGRFAM ID[TIGR01161] phosphoribosylaminoimidazole carboxylase, PurK protein 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0002154 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAAAACGG TGTTTCCGGG AGGCCGCGTC GGGGTGCTGG GAGGCGGCCA ACTGGGCCGG 
ATGCTGGCCC TGGAGGCGAA ACGGATGGGA TACGGGGTGG GCGTGCTGGA CCCCGTGGCT
GGTTGTCCGG CGGCTCAAGT TGCGGACTTT TTCCTGCAGG CCTCCCTGGA TGATGTCGAG
GCCGCTCTGA GGCTGGCCGC CCAAGTGGAC GTGGTTACCG TAGAAAACGA GTTCGTGCCG
GCCGCGCTTC TGGCTCGGCT CGAGAGAGCC GTGCCGGTCC ACCCAAGCGC GAGCGTGTTG
CGTACCATCC AGGACCGCCT GCTGCAGAAG GAGTTTCTCA AGACGGCCGG TTTTCCCCAG
GCACCCTTCG CGGCGGTTGA CGATCCCCGC TGTCTTGGCG AGGCCGTCCG TGCGGTTGAC
TTTCCCGCCG TTCTCAAGAG CCGGCAGGGC GGTTACGATG GAAAAGGCCA GGTGGTGGTG
ACTGAACCCG GTGCGCTGGA GAGCGCCTGG CGGGCCATCG GTGGCCGGCC GGCCGTACTG
GAAACCCTCG TTCCTTTCAA GATGGAGATC GCGGTCATTC TGGCCCGCGG CGTGCAGGGC
GAGACGCGGG TCTACCCGGT GGCCGAGAAC GTGCACGTGC GGCATATCCT CCACACCACC
AGGGTGCCGG CCGCGGTGTC GGAGCGGACC AGCCGGGAGG CGAAACGGAT GGCCTGCGAC
ATTGCCGAGT TGCTCGGACA CGTCGGGGTC ATGGCGGTTG AAATGTTCGT CTTGGGCGGC
GAGAGCGTGC TGGTCAATGA GATCGCTCCC CGAACGCACA ACAGCGGACA CTACACTTTC
GGCGCCTGCG TGACCTCTCA GTTCGAGCAA CATCTGCGGG CGGTCTGCGG TCTGCCGTTG
GGCGATCCCG CGCTGCTCTC CCCCGCGGTC ATGGTGAATC TGCTTGGAGA GCTTTGGGTT
GAGGGCACCC CGTGCTGGGA AACGGTGTTG TCGCGCCCAA ACGCCCGCCT GCACCTTTAC
GGCAAAAGGG AGGCAAAGGT GGGCCGGAAG ATGGGGCACG TCCTGATCGT TGATGCCGAC
ACCGACCGCG CCTTACGGGA GGCGGAGGAG ATCGTGGTGC TGCTTCGCCC CGGCGACAGC
GTCTCGGCTC CGATCAGCCG GGCGGGCGGC GGGGCGGACG GGAGATGGGC GCGATGCTGA
 
Protein sequence
MKTVFPGGRV GVLGGGQLGR MLALEAKRMG YGVGVLDPVA GCPAAQVADF FLQASLDDVE 
AALRLAAQVD VVTVENEFVP AALLARLERA VPVHPSASVL RTIQDRLLQK EFLKTAGFPQ
APFAAVDDPR CLGEAVRAVD FPAVLKSRQG GYDGKGQVVV TEPGALESAW RAIGGRPAVL
ETLVPFKMEI AVILARGVQG ETRVYPVAEN VHVRHILHTT RVPAAVSERT SREAKRMACD
IAELLGHVGV MAVEMFVLGG ESVLVNEIAP RTHNSGHYTF GACVTSQFEQ HLRAVCGLPL
GDPALLSPAV MVNLLGELWV EGTPCWETVL SRPNARLHLY GKREAKVGRK MGHVLIVDAD
TDRALREAEE IVVLLRPGDS VSAPISRAGG GADGRWARC