Gene Rcas_1844 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_1844 
Symbol 
ID5539322 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp2355945 
End bp2356970 
Gene Length1026 bp 
Protein Length341 aa 
Translation table11 
GC content63% 
IMG OID640893982 
Productphosphoribosylaminoimidazole synthetase 
Protein accessionYP_001431953 
Protein GI156741824 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0150] Phosphoribosylaminoimidazole (AIR) synthetase 
TIGRFAM ID[TIGR00878] phosphoribosylaminoimidazole synthetase 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.496619 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCACAT ACCGCGATGC GGGCGTCGAT ATTGAGGCTG CGGCGCGCGC CAAACAGTTG 
ATGGCGGGAG CCGTGCACAG CACCTATTCC CCCGCAGTGC TGGCGGGCAT AGGCGCCTTT
GGCGGATGCT TCGATCTGCG CACCGCTCTG GGAGAAGCGC TCGGCGATGC CGTGCTGGTC
GCTTCTACCG ATAGCGTCGG CACGAAGACA CTGGTTGCTG CGGCGCTCGG TCGCTACGAA
ACACTCGGCT ACGATCTGGT CAACCACTGC GTCAACGATA TTCTGGTGCA GGGCGCACGA
CCACTGTTCC TGCTCGATTA TCTGGCGGTC GACCGGCTCA ATCCGCAGCG TGCTGCGACA
CTGGTAGCGA GTGTCGCAAC CGCGTGTCGC GCGGCCGGAT GCGTTCTGCT CGGCGGCGAA
ACGGCGCAGA TGCCCGATGT GTACCGCGAC GGCGCCTTCG AACTAGCGGG AACGATTGTC
GGCGTGGTGC AGCGCGAGCG GATGCTGCCG CAAAACGTTG CAATTGGCGA CGTTATCCTG
GCGCTGCCTT CGAGCGGATT GCACACCAAC GGCTATTCCC TGGCGCGGCG AGCGCTTGGA
CCGGATAGCG CCTTCGGGTA TGACGCTACA CCCGCCGAAT TAGGCGGCAG AAGCGTCGGC
GAGGCATTAC TCGAACCGCA CCGGTCGTAT CTTTCCGCGT TCGAGCAGTT GGCAGCAGCG
GAGATACCGG TACACGCGCT GGCGCACATC ACCGGCGGCG GCGTATATGA AAACCTGCCG
CGCGTCCTTC CAGAAGGGTA TGGCGCAGTC ATCCGGCGCG GCACATGGGA TGTCCCGCCG
ATCTGTGCGC TTGTGGTGCA TGCTGCCGGT CTCGATGAGC ACGAAGCCTA TCGCACACTG
AACATGGGTC TCGGCATGCT CGTAATCGTC CCATCCGAAG CCGCCGACGC CGCACTGCGC
ACCGTCCATG AAGCCCGGCT GGTCGGCGAA GTGATCGCCG GCGAAGGAGT GCATCTGATA
GCATAG
 
Protein sequence
MTTYRDAGVD IEAAARAKQL MAGAVHSTYS PAVLAGIGAF GGCFDLRTAL GEALGDAVLV 
ASTDSVGTKT LVAAALGRYE TLGYDLVNHC VNDILVQGAR PLFLLDYLAV DRLNPQRAAT
LVASVATACR AAGCVLLGGE TAQMPDVYRD GAFELAGTIV GVVQRERMLP QNVAIGDVIL
ALPSSGLHTN GYSLARRALG PDSAFGYDAT PAELGGRSVG EALLEPHRSY LSAFEQLAAA
EIPVHALAHI TGGGVYENLP RVLPEGYGAV IRRGTWDVPP ICALVVHAAG LDEHEAYRTL
NMGLGMLVIV PSEAADAALR TVHEARLVGE VIAGEGVHLI A