Gene RoseRS_0168 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_0168 
Symbol 
ID5207103 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp211514 
End bp212743 
Gene Length1230 bp 
Protein Length409 aa 
Translation table11 
GC content61% 
IMG OID640593798 
Productaminotransferase, class V 
Protein accessionYP_001274554 
Protein GI148654349 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1104] Cysteine sulfinate desulfinase/cysteine desulfurase and related enzymes 
TIGRFAM ID[TIGR03402] cysteine desulfurase NifS 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.925051 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTCTAC AAAGCGAGGA GTATGTGACT GACCGATTGA TCTACCTTGA TCACGCGGCA 
ACGACGCCGG TCGATCCCGT GGTGCTCGAG GCGATGATCC CCTATTTCAC CACGCACTTT
GGCAATCCAT CGAGCATCTA TCGCATTGGA CGCGCTGCAC TGCATGCACT CGACGAAGCG
CGTGATCAGA TTGCGTCGAT CCTGGGAGCA TCGCCGCGTG AGATTATTTT TACCGGCAGT
GGTTCGGAGA GCGATAACCT TGCGATTCGT GGCGTCGCAC TGGCGCAACG CCAGGCAGGA
CGGGGATCGC ACATTATTAC CAGCGCCATT GAGCATCACG CGGTGCTGCA TACCGTCGAA
CATTTGCAAG CGTTTGGGTT CGAGGCGACC ATCCTGCCGG TCGATCCCAA AGGTCTGGTG
CAGCCGGAAT ACCTGCGCGC CGCGTTGCGT CCGGACACGG TGCTGGTATC GATCATGTAC
GCCAACAACG AAATCGGCAC GATCCAGCCG ATTGCGGAAC TGGGCGCCAT CTGTCGCGAA
CGCGGCATTC CTTTCCACAC CGATGCCGTC CAGGCGCCCG GTGCGTTACC GCTCGACGTG
CGTGCGCTCA ACGTCGATCT TATGACCATT GCTGCGCACA AGTTCTACGG TCCCAAAGGC
GTGGGTGCGC TCTACGTTCG CCGGGGCACA CCGCTCCTGC CGCAGATCAC CGGCGGTGGA
CAGGAACGAC GACGACGCGC TGGCACCGAG AATGTGCCCG GCATCGTCGG CATGGCGACG
GCGCTGCGCC TGGCGGAAGA ACGCCGCGCG CACGACAGCG CCCACTGTGC GCGTCTGCGC
GACCGGTTGG TGGCGGGCAT TCTGGAACGT GTGCCGGGAT CGCGTCTCAA TGGTCATCCG
ACCCAACGAC TGCCCAATAA TGCCAGTCTT TCGTTCGAGG GTGTCGAGGG TGAAAGCATC
CTTCTGTTGC TCGATCAGCA CGGGATTGCT GCTTCGAGCG GCTCTGCCTG CACCAGTGGA
TCACTGGAGC CATCGCACGT GTTGATCGCC CTGGCTCGCG CTTCCGGGAC GGAACGTGCG
CCGGAAATTG CGGCTGCGCT CCCCGGCGCG GTGCGATTCA CTTTTGGGCG TGAGAACACC
GATGCCGATG TCGATGCGGT TCTGGAGATG TTGCCGGGTA TTGTTGCACA ACTGCGGGAA
ATGACGGTAA CATCGCAGGG AGGCGTATGA
 
Protein sequence
MSLQSEEYVT DRLIYLDHAA TTPVDPVVLE AMIPYFTTHF GNPSSIYRIG RAALHALDEA 
RDQIASILGA SPREIIFTGS GSESDNLAIR GVALAQRQAG RGSHIITSAI EHHAVLHTVE
HLQAFGFEAT ILPVDPKGLV QPEYLRAALR PDTVLVSIMY ANNEIGTIQP IAELGAICRE
RGIPFHTDAV QAPGALPLDV RALNVDLMTI AAHKFYGPKG VGALYVRRGT PLLPQITGGG
QERRRRAGTE NVPGIVGMAT ALRLAEERRA HDSAHCARLR DRLVAGILER VPGSRLNGHP
TQRLPNNASL SFEGVEGESI LLLLDQHGIA ASSGSACTSG SLEPSHVLIA LARASGTERA
PEIAAALPGA VRFTFGRENT DADVDAVLEM LPGIVAQLRE MTVTSQGGV