Gene Daud_2147 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaud_2147 
Symbol 
ID6026186 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Desulforudis audaxviator MP104C 
KingdomBacteria 
Replicon accessionNC_010424 
Strand
Start bp2263381 
End bp2264628 
Gene Length1248 bp 
Protein Length415 aa 
Translation table11 
GC content66% 
IMG OID641594965 
Productglycine hydroxymethyltransferase 
Protein accessionYP_001718266 
Protein GI169832284 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0112] Glycine/serine hydroxymethyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTCTGGA ACCGGTCCCT GGCCGAAACC GACCCCGAAA TCGCCCGGGC CATCGCGCTG 
GAGATCACCC GTCAAGGCGC CAAGCTTGAG CTGATCGCCT CCGAGAACTT CGTCAGCCGC
GCCGTCCTGG AAGCCCAGGG TTCGGTGCTG ACGAACAAGT ACGCCGAGGG CTATCCCGGC
GCGCGCTACT ACGGCGGCTG CGAGTACGTG GACATCGTGG AGAGTGTGGC GATCAGGCGG
GCGAAGGAAA TCTTCGGCGC CGGGCACGCG AACGTGCAGC CCCACTCCGG GGCCCAGGCC
AACATGGCCG CCTATTTCGC CTTCCTCGAA CCGGGCGACA CGATCATGGG GATGCGTCTG
GCCCACGGGG GGCACCTGAC CCACGGCGCG AAGATCAATT TCTCGGGCCG GTACTTCCGG
TACGTGCCCT ACGGGGTGGA GGAGGAAACC GGCCGGATTG ACTACGACCG GATGCATGCC
ATCGCCCGCG AACACCGCCC GAAACTGATC GTCGGCGGGG CCAGCGCCTA CCCGCGCGAA
CTGGACTTCG CCCGGATGCG TGCCATTGCG GATGACGTCG GTGCGCTCTT GATGATCGAC
ATGGCGCACA TTGCCGGCCT GATCGCCGCC GGACTGCACA TGTCCCCGGT GCCGTACGCC
GACGTGGTGA CCACCACGAC CCACAAAACC CTGCGCGGCC CGCGGGGCGG GATGATCCTG
TGCCCGGAGG AGTACGCCGC CGCCATTGAC AAGGCGGTAT TCCCGGGAAT CCAGGGCGGC
CCTCTGATGC ACGTGATCGC GGCCAAGGCC GTGGCCCTGG GCGAGGCTCA GCGCCCCGAG
TTCAAGACCT ACCAGGAACA AATCGTGAAA AACGCCCGCG CCTTAGCCCA AGCCCTGCAG
GAGCGGGGTT TTGAGCTGGT GGCGGGCGGC ACCGACACCC ACCTGATCCT GGTCGACCTC
CGGAACAAGG GCCTCACCGG CGCCGTGGCC GAGGACCTTC TGGACCGGGT GGACGTCACC
GTGAACAAGA ACATGGTTCC GTTCGATCCC CAGCCGCCCC GGGTCACCAG CGGCATCCGC
ATCGGCACCC CGGCGGTCAC CACCCGCGGG ATGAAGGAGG ACAGCATGGT CCAGATCGCC
GAGGTGATCA GCCTGACTCT GGATCATCCG GAAGAAGGGG CCGTCCAGGC GCGGGCGAAA
GCCATTGTTG CCGAATTGTG CGCCGCCCAC CCGTTCCTGA AACTGTAG
 
Protein sequence
MVWNRSLAET DPEIARAIAL EITRQGAKLE LIASENFVSR AVLEAQGSVL TNKYAEGYPG 
ARYYGGCEYV DIVESVAIRR AKEIFGAGHA NVQPHSGAQA NMAAYFAFLE PGDTIMGMRL
AHGGHLTHGA KINFSGRYFR YVPYGVEEET GRIDYDRMHA IAREHRPKLI VGGASAYPRE
LDFARMRAIA DDVGALLMID MAHIAGLIAA GLHMSPVPYA DVVTTTTHKT LRGPRGGMIL
CPEEYAAAID KAVFPGIQGG PLMHVIAAKA VALGEAQRPE FKTYQEQIVK NARALAQALQ
ERGFELVAGG TDTHLILVDL RNKGLTGAVA EDLLDRVDVT VNKNMVPFDP QPPRVTSGIR
IGTPAVTTRG MKEDSMVQIA EVISLTLDHP EEGAVQARAK AIVAELCAAH PFLKL