Gene Rcas_1802 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_1802 
Symbol 
ID5539280 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp2314131 
End bp2315270 
Gene Length1140 bp 
Protein Length379 aa 
Translation table11 
GC content59% 
IMG OID640893941 
Productcarboxylate-amine ligase 
Protein accessionYP_001431912 
Protein GI156741783 
COG category[S] Function unknown 
COG ID[COG2170] Uncharacterized conserved protein 
TIGRFAM ID[TIGR02050] uncharacterized enzyme 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones43 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGACATCGT ACAATCCTGC ATCGCCCGAT TTTCCGTTCA CGCTTGGCAT CGAGGAAGAG 
TATCAGGTCG TAGACCCGCA AACGCGCGAA CTGCGCTCGT ACATCACACA GATTCTCGAC
CGCGGGCGCA TGATCCTGCG CGAGCAGATC AAGCCGGAAC TGCACCAGAG CATGGTGGAA
GTCGGCACAC AACCATGCCG AACCATCCAG GAAGCGCGCG CCGAAGTAGT GCGGCTGCGC
GGCACGATTG CCGGTCTTGC CCGGCAACAC GGATTGACGA TTATCTCCGC CGGAACCCAT
CCGATCTCGT CGTGGATGAG CCAGGAGATC ACCCCGTTCG AGCGCTACAA AGGCGTCGTC
GAGGAGATGC AGCAACTGGC GCTGCAACTG CTGATCTTTG GCATGCACGT CCACGTCGGG
ATGCCGGATG ATGAGGTCGC CATTGAACTG ATGAACGTTG CACGCTATTT TCTGCCCCAT
ATTCTGGCGC TCTCAACATC ATCGCCATTC TGGATGGGGC GCAACACCGG CTTCAAGTCG
TACCGTTCCG CCCTCTTCTC GAACTTTCCG CGCACCGGCA TTCCGCCGAG TTTCCATTCT
GCCGCCGAGT TTCAGAACTA TGTGAAACTG CTGATCAAGA CGAACTGTAT CGACGATGCG
AAGAAGATCT ACTGGGACCT GCGCCCGCAC CCGTACTTTG GCACCCTCGA ATTTCGCGTG
TGCGACGCCG CGACGCGCGT GGACGAGTGC ATCGCGCTGG CGGCGCTCAT GCAGGCGCTG
GTCGTCAAAC TGCATCTGAT GTTTTCCGAA AACACCACCT TCCGCGTCTA TCGGCGCGCC
GTCATTATGG AGAACAAGTG GCGCGCTCAA CGCTGGGGGC TGGATGGCAA ACTGATCGAC
TTCGGCAAGC GCGCTGAAGT GGAAGCGAAG GCGCTCATGC ACGAACTGGT CGCCTTCGTC
GATGAGGTGG TCGATGAACT TGGCAGCCGC CACGAAGTCG AATACCTGCT CAACGTCGCC
GATGGCGGAT CGAGCGCCGA CCGGCAACTG GCGGTGTTTC GCGAAACGAA TGACTTGCAC
GCCGTGGTGG ACAATCTGAT CGTCGAGACC CTCGAAGGAG TGCCGGTGTA TCAGGGGTGA
 
Protein sequence
MTSYNPASPD FPFTLGIEEE YQVVDPQTRE LRSYITQILD RGRMILREQI KPELHQSMVE 
VGTQPCRTIQ EARAEVVRLR GTIAGLARQH GLTIISAGTH PISSWMSQEI TPFERYKGVV
EEMQQLALQL LIFGMHVHVG MPDDEVAIEL MNVARYFLPH ILALSTSSPF WMGRNTGFKS
YRSALFSNFP RTGIPPSFHS AAEFQNYVKL LIKTNCIDDA KKIYWDLRPH PYFGTLEFRV
CDAATRVDEC IALAALMQAL VVKLHLMFSE NTTFRVYRRA VIMENKWRAQ RWGLDGKLID
FGKRAEVEAK ALMHELVAFV DEVVDELGSR HEVEYLLNVA DGGSSADRQL AVFRETNDLH
AVVDNLIVET LEGVPVYQG