Gene Rcas_4094 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_4094 
Symbol 
ID5541605 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp5307035 
End bp5308204 
Gene Length1170 bp 
Protein Length389 aa 
Translation table11 
GC content57% 
IMG OID640896206 
Productbiotin carboxylase-like protein 
Protein accessionYP_001434144 
Protein GI156744015 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0026] Phosphoribosylaminoimidazole carboxylase (NCAIR synthetase) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.138669 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.680577 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCATGA ACATCGTTTT TCTCTCACCG CACTTCCCGC CGAACTGGTA TCTGTTTTGC 
GTGCGCCTGC GCAATCTGGG CGCGAATGTT CTGGGCGTCG CCGACGAACC GTATGAACTG
CTGCACCCGG ACCTGCGCAC TGCATTGACC GAATACTACC GGGTATCGGA TCTGCACCAT
TACGACGAGG TGTTGCGTGC GCTCGGCTAT TTCACGCACC GCTATGGCAA AATCCAGCGG
GTCGACTCCC TGAACGAATA CTGGCTGGAA ACCGAAGCGC GGTTGCGCAC CGATTTCAAC
ATCGAAGGAC CGAAAATCAC CGATCTGCCC GGCATCAAGC GGAAATCGGA AATGAAGCGC
CTTTTCACCC GCGCACAGGT GGACGTAGCG CGTGGGATTC TGGCGCATTC ACCGGCGCAG
GTGCGCGCTT TCGCCGTTGA GGTCGGCTAC CCGCTCGTCG CCAAGCCGGA CGTTGGCGTC
GGCGCGAATC ACACGTACAA GATTACGAGC GACGCCGAAC TCGATGCCTT CCTGTCGCGC
CAGCCTGAGG GGTTTCTCAT CGAGGAGTAT GTGCATGGCG TCATTCAGAC TTTCGATGGA
CTTGCAGACC GTGATGGCGA ACCGGTCTTC TTTACGTCGA TGCAGTACAG CAACGGTGTC
CTGGAGGTTG TCAACAACGA CGACGATATT TACTATCTGA CCGAGCGTGA CATTCCGCCC
GATCTCGAAC AGGCAGGGCG GCGCATCCTC AAGATATTCA ATGTGCGCGA ACGATTCTTT
CACTTCGAGT TCTTCCGCAC TCCTAAAGGG CGATTGGTGG CGCTCGAAGT CAACATGCGT
CCTCCGGGCG GGCTGAGCAT CGATATGTTC AACTACGCCG GCAACATTGA CCTGTACAAT
GCATGGGCGA ATGTGTTGAT CAATCATCGC GTCAGCATCC CGCCGACGCG GCTGTACCAC
GTCTGTTACG CTGGACGCAA ACCGTTCCGT TCCTATGCTT TGACCCACGA AGAGGTGCTG
ATCCGCTTCG GCGATTGCAT CGTCCACCAC CAGCCGATGC ATCCGCTGTT TCATCGAGCG
ATGGGTGCGT ATGCATATCT GATCAGATCG CCGGATCGCG CGGAAGTAAT TGCAATTGCG
CAGGAGATTC AGCGGTTGAG CGTGTGTTGA
 
Protein sequence
MTMNIVFLSP HFPPNWYLFC VRLRNLGANV LGVADEPYEL LHPDLRTALT EYYRVSDLHH 
YDEVLRALGY FTHRYGKIQR VDSLNEYWLE TEARLRTDFN IEGPKITDLP GIKRKSEMKR
LFTRAQVDVA RGILAHSPAQ VRAFAVEVGY PLVAKPDVGV GANHTYKITS DAELDAFLSR
QPEGFLIEEY VHGVIQTFDG LADRDGEPVF FTSMQYSNGV LEVVNNDDDI YYLTERDIPP
DLEQAGRRIL KIFNVRERFF HFEFFRTPKG RLVALEVNMR PPGGLSIDMF NYAGNIDLYN
AWANVLINHR VSIPPTRLYH VCYAGRKPFR SYALTHEEVL IRFGDCIVHH QPMHPLFHRA
MGAYAYLIRS PDRAEVIAIA QEIQRLSVC