Gene Rcas_3359 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_3359 
Symbol 
ID5540858 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp4387729 
End bp4388889 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content62% 
IMG OID640895477 
Productcarbamoyl-phosphate synthase, small subunit 
Protein accessionYP_001433427 
Protein GI156743298 
COG category[E] Amino acid transport and metabolism
[F] Nucleotide transport and metabolism 
COG ID[COG0505] Carbamoylphosphate synthase small subunit 
TIGRFAM ID[TIGR01368] carbamoyl-phosphate synthase, small subunit 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.663891 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGACAC GAACATACGC AATGCTGGCG CTGGAGGACG GTACGATCTG GCATGGATAT 
GCGCTGGGGG CGATTGGCGA GCGCACCGGT GAGGTCGTGT TCAACACCTC GATGACCGGC
TACCAGGAAG TGTTGACCGA TCCATCCTAT TATGGCCAGA TTGTGGTGAT GACGGCGCCG
CACATCGGCA ACACCGGCGT TAACCGCGAA GATGAAGAAA GCCGTCATCC CTGGGTCGCC
GGTTTTGTGG TGCGCGCCGC AAGTCCGTAT GTCTCCAACT GGCGCGCTGC GCAGTCGCTC
CATGAGTACC TGGCGGAACA CGGTATCGTG GCAATGACCG GAGTCGATAC CCGCGCGCTG
GTGCGCCATA TTCGCACGCA AGGCGCAATG CGCGCCGTCA TCTCGTCGGA GAACCCGGAG
CCGGATCGCC TGATCGCCGC TGCGCGCGCC GCGCCGTCGA TGAATGGGCT TGACCTGGTG
CCGTATGTGA CCTGCGCTGA GCCGTACCAC TGGGTCGAGG GCAATCCAGG CGAGTGGGGA
CCGGGCGAAA CGCCAGCACA ACGTGGTGAA TCGACATTTC ACGTGGTCGC CTACGACTTT
GGGATCAAGC GCAATATTCT GCGGTTGCTG GCAGAGCACG GTTGTCGCGT GACGGTCGTG
CCCGCCACCA TGCCGGCTGC CGACGTCCTG GCGCTCGAAC CCGATGGTGT GTTTCTCTCG
AATGGACCGG GCGATCCGGC GGCGGTGACG TATGGCGTCC AGGCAGTGCG CGATCTGCTG
GGCAAAACTC CTGTGTTCGG CATCTGCCTG GGGCATCAGA TTCTTGGTCT GGCGCTCGGC
GGCACGACCT ATAAGTTGCA CTTCGGTCAT CGTGGCGGCA ACCAACCGGT GCGTTTCAGC
GATACTATGC GGGTGGAGAT TTCCAGCCAT AACCACGGCT TTGCGGTCGA TGCGTCGTCG
TTGCCGGAGG GAGTTGAGAT TACGCACATC AACCTGAACG ATGGGTGCGT CGAAGGGTTA
CGCGCGCCGG ATCAACGCGC TTTCAGCGTG CAGTATCATC CCGAAGCCGC GCCGGGACCG
CACGATGCGC GCTATCTGTT TCGCCGGTTT GTCGAACTGA TGGAGCAGGC GCGAAACCAG
CGTTCAATGT CGAGCGGTTG A
 
Protein sequence
MTTRTYAMLA LEDGTIWHGY ALGAIGERTG EVVFNTSMTG YQEVLTDPSY YGQIVVMTAP 
HIGNTGVNRE DEESRHPWVA GFVVRAASPY VSNWRAAQSL HEYLAEHGIV AMTGVDTRAL
VRHIRTQGAM RAVISSENPE PDRLIAAARA APSMNGLDLV PYVTCAEPYH WVEGNPGEWG
PGETPAQRGE STFHVVAYDF GIKRNILRLL AEHGCRVTVV PATMPAADVL ALEPDGVFLS
NGPGDPAAVT YGVQAVRDLL GKTPVFGICL GHQILGLALG GTTYKLHFGH RGGNQPVRFS
DTMRVEISSH NHGFAVDASS LPEGVEITHI NLNDGCVEGL RAPDQRAFSV QYHPEAAPGP
HDARYLFRRF VELMEQARNQ RSMSSG