Gene Shel_23860 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagShel_23860 
Symbol 
ID8396275 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSlackia heliotrinireducens DSM 20476 
KingdomBacteria 
Replicon accessionNC_013165 
Strand
Start bp2646216 
End bp2647394 
Gene Length1179 bp 
Protein Length392 aa 
Translation table11 
GC content61% 
IMG OID644987133 
Product5-aminoimidazole-4-carboxamide ribonucleotide transformylase 
Protein accessionYP_003144744 
Protein GI257065072 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0532002 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAACGAGC TCGAACTCAA ATATGGCATG AACCCTAACC AGAAGCCTGC CCGCATCTTC 
ATGCGTGAAG GCGGCGATCT GCCGATCGAG GTGCTCAACG GCAAGCCCGG CTTCATCAAC
TTCCTCGACG CCTTCAACTC CTGGCAGCTG GTTAAAGAGC TGAAGGAAGC CACCGGCCTG
CCCGCGGCGG CATCCTTCAA GCACGTCAGC CCTGCCGGCG CTGCCGTGGG CCTGCCTCTG
ACCGACCTGG ACCGCAAGAT CTATTTCGTC GACCAGGAAG GCGAACTCAG CCCCATCGCC
TGCGCATACA TTCGCGCTCG CGGCGCCGAC CGCCTGTGCT CCTACGGCGA CTGGGCGGCC
CTGTCCGATG TGTGCGACGC CGACGTGGCC CGTTACCTGA AGCTTGAGGT GAGCGACGGC
ATCATCGCAC CCGGCTACAC CGATGAGGCG CTGGAAATCC TCACCACCAA GAAGGGCGGC
AAGTTCAACG TGGTGCAGAT CGACCCCGAA TACACTCCCG CCGAACTGGA GTTCAAGGAC
GTTTTCGGTA TCACGTTCCA GCAGGGCCAC AACAACTTCA AGATCGACCG CGAGCTGCTC
TCCAACATGG TGACCGAGAA CAAGGACCTG CCCGAGCAGG CGGTCATCGA CCTGATCATT
TCTCTCATCA CCCTCAAGTA CACGCAGTCC AACTCGGTCT GCTACGTCAA AGATGGCATG
GCCATCGGCG TCGGTGCGGG TCAGCAGAGC CGCATCCACT GCACGCGTCT GGCTGGCAGC
AAGGCTGACA ATTGGTACAT GCGCCAGCAT CCGAAGGTTC TGGGGCTGCA GTTCGTCGAC
GACATCCGCC GTCCGAACCG CGACAACGCC ATCGACGTCT ACACCAGCGA CGAGTGGGAA
GACGTCCTGC GCGAGGGCGA GTGGCAGCAG ATCTTCAAGG TGAAGCCCGA ACCGCTGACT
GCCGAGGAGA AGAAGGAGTG GATCGCCACC CAGACCGGCG TTTCCGTCGG ATCCGACGCG
TTCTTCCCCT TCGGCGACAA CGTCGAACGT GCTCGCAAGT CCGGCGTGTG CTACATCGCC
GAGCCGGGCG GCTCCATCCG CGATGACCAT GTGGTCGAAA CCGCCAACAA GTACGGCATC
GCCATGGCCT TCACGGGCAT GCGTCTGTTC CATCACTAA
 
Protein sequence
MNELELKYGM NPNQKPARIF MREGGDLPIE VLNGKPGFIN FLDAFNSWQL VKELKEATGL 
PAAASFKHVS PAGAAVGLPL TDLDRKIYFV DQEGELSPIA CAYIRARGAD RLCSYGDWAA
LSDVCDADVA RYLKLEVSDG IIAPGYTDEA LEILTTKKGG KFNVVQIDPE YTPAELEFKD
VFGITFQQGH NNFKIDRELL SNMVTENKDL PEQAVIDLII SLITLKYTQS NSVCYVKDGM
AIGVGAGQQS RIHCTRLAGS KADNWYMRQH PKVLGLQFVD DIRRPNRDNA IDVYTSDEWE
DVLREGEWQQ IFKVKPEPLT AEEKKEWIAT QTGVSVGSDA FFPFGDNVER ARKSGVCYIA
EPGGSIRDDH VVETANKYGI AMAFTGMRLF HH