Gene Rcas_1543 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_1543 
Symbol 
ID5539019 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp1969932 
End bp1971233 
Gene Length1302 bp 
Protein Length433 aa 
Translation table11 
GC content61% 
IMG OID640893681 
ProductPUCC protein 
Protein accessionYP_001431654 
Protein GI156741525 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000848404 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACGCTGA TCAAGAACAT TCGCCTGGGG TTGCTGCACG TGGCGATTGC TATGACCTTC 
GTGCTGATCA ATAGCGTGCT GAACCGGATT ATGATCCACG ATCTCGGCAT TCTGGCGAGC
GTTGTCGCTG TGCTGGTGGT GCTGCCGTAT ATCCTCTCGC CAGCGCAGGT CTGGATCGGG
CAATATTCCG ATACCCATCC GATGTTTGGG TACCGGCGCA CACCGTATAT CGCGTTGGGC
ACGCTGCTCG CGCTGGCCGG CGCAGCGCTG GCGCCGCACG CAGCCCTGGC GCTGGTGCGT
GATCCGCTGA TCGGTGTACC ACTGGCGATT CTGCTCTTCG GGATGTGGGG TGTCGGGTAT
AACCTGGCGG TCGTCGCATA CCTGGCGCTC GCCAGCGATA TGTCTACCGA GCAGCAGCGT
TCACGCACAG TGGCGATCAT GTGGTTCATG ATGATTGCCA GCGTCATTGT GACTGCGATT
GTCGTCGGGC GCGCGCTGGA GCCGTACAGT GAAGAGCGCC TCTTTACCGT CTTTCTGGAG
ACTGGCGGCG TGGCGCTGGC ATTGGCGCTC GTGGGGTTGA TCGGTCTCGA ACCGCGGCGC
GAACCTATTG CTGTGCAGCA GAGTCGCGCC GGACAGGTGG CGGCTATTCG CGCCGTACTC
GACAATCCAC AGGCGCGCAT CTTTTTCGTC TACCTGATCA TGATGCTGGC GGCGATCCTG
GGTCAGGATG TGCTGCTGGA GCCATTTGGC GCACAGGCGT TCGGAATGAA TGTCAAGGAA
ACCACCCAAT TGACCGCAAT GTGGGGCGGC GCAACACTCT CGGCGCTGCT GCTGTATGGC
GCTGTGCTCA GCCGCTGGAT GAGCAAGAAG CGCGGCGCGA TGATCGGCGG CTCGATTGCC
GCGACCGGCT TTCTGCTGAT TGCCCTCAGC GGCATGCTCG CTATCGAAGC CATGTTCCTT
CCCGGCATCG TGCTGCTTGG CTTTGGTACC GGCATTGCCA CAACCACCAA CCTGGCGCTC
ATGCTCGACA TGACGACGGC TGAACAGGTC GGATTGTTTA TCGGTGCGTG GGGCGTAGCG
GATGCATTGG CACGCGGGGT GGGCACGCTC CTTGGCGGCG TCATGCGCGA TGTGATTGCG
CACATGAGCG GAAGCGCCGT CAGCGGTTAT GTCAGCGTGT TCCTGATCGA GGCGTTACTG
TTAGGCATTT CTCTGGTATT ATTACAGCGC ATCGACGTAA CCGCCTTCCG CAGCCGCCAG
CCGTCGCTGA CCGAACTGGT TGCGCTCTCT GGCGACGCCT GA
 
Protein sequence
MTLIKNIRLG LLHVAIAMTF VLINSVLNRI MIHDLGILAS VVAVLVVLPY ILSPAQVWIG 
QYSDTHPMFG YRRTPYIALG TLLALAGAAL APHAALALVR DPLIGVPLAI LLFGMWGVGY
NLAVVAYLAL ASDMSTEQQR SRTVAIMWFM MIASVIVTAI VVGRALEPYS EERLFTVFLE
TGGVALALAL VGLIGLEPRR EPIAVQQSRA GQVAAIRAVL DNPQARIFFV YLIMMLAAIL
GQDVLLEPFG AQAFGMNVKE TTQLTAMWGG ATLSALLLYG AVLSRWMSKK RGAMIGGSIA
ATGFLLIALS GMLAIEAMFL PGIVLLGFGT GIATTTNLAL MLDMTTAEQV GLFIGAWGVA
DALARGVGTL LGGVMRDVIA HMSGSAVSGY VSVFLIEALL LGISLVLLQR IDVTAFRSRQ
PSLTELVALS GDA