Gene Rcas_2186 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_2186 
Symbol 
ID5539667 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp2807631 
End bp2808965 
Gene Length1335 bp 
Protein Length444 aa 
Translation table11 
GC content58% 
IMG OID640894319 
Producthypothetical protein 
Protein accessionYP_001432287 
Protein GI156742158 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.131537 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.0421946 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTTTAC CGCGCATATG TGTCAGCACG AATCGGCGTT TTCTCATGAC CGAGAGCGGT 
GAGCCATTCT TCTGGCTTGG CGATACCGCC TGGGAACTTT TTCATCGCCT GAGCCTGCCA
GAAGCCGAGC ACTATCTGAA CGCGCGGCGT CAGCAGGGAT TCAACCTGAT TCAGGCGGTG
GCGCTGGCTG AACTAGACGG CTTGCACACA CCTAATGCCC AGGGACATGT GCCGCTTCTT
GGTGATGATC CAACCAGACC GAACGAGTTC TACTTCCGCC ACGTGGATGC GATCATCCGC
ATGGCGTCTG ATAAGGGTCT CTACATCGGT CTGCTGCCGA CATGGGGCGA TAAAGTGCAT
GCGCGCCTGT GGGGCATTGG ACCGGTTATC TTCAACGCCG GCAATGCGCG TATCTACGGT
CGCTTCCTGG GTGAACGCTA CCGAAACGAC ACCAACATCA TCTGGATTCT GGGTGGAGAT
CGCCCTGCCC ACGGGTACGA AGAGGTATGG GCAGCCATGG CACAAGGCAT CACAGAAGGG
TCGGGGTTCA AGCCTTTTTT TACTTACCAT CCAATGGGTG GAACCAGTTC TTCCGCTGTG
CTCCACGATG CCGATTGGCT CGACATGAAC ATGCTCCAAT CCGGTCACTG TCTGATGGAC
GCGCCAAACT GGGAAATGAT CCGCGCCGAC TATGAGCGCA CACCGACCAA ACCGGCGCTG
GACGGAGAAC CGAACTATGA GCACCATCCC ATCGACCCCT ATCTACGTCC ATGGAAACCG
GAGTATGGTC GCTTTACCGA CTATGATGTG CGCAAGCAGG CGTATCGCGC AGTGTTTGCG
GGCGCATGCG GTCACACCTA TGGCAGTCAC TCAGTCTGGC AGATGTGGTC GTCCAGATAC
GCACCAACCA CCTTTCCGTC ACTTCCATGG GACGAAGCCC TATACGGACC GGGAGCGCAG
CAACTGGTGC ATCTGAAGAA CCTGATCCTC GCCCACCCGT ACTTCACCCG CATCCCCGCG
CCCGACCTGC TGCCAGATGT GGCGCCTATA CCGCTGCCTG TGGACCAGGA GGATCGGATC
AATCCCGTTC GCGCTGCGCA TCCCGTTGCG ACCCGCGATA GTGAAGGAAC GTATGGGTTG
GTCTACTTCC CGATGGCAGG TCAGTCGTTG CGGGTGGATT TGCGTGTGCT CAGAGGGGAA
GTCAGATCAG CATGGTTTGA TCCGCGGAGT GGTGCGACTC ATCCCATTGG AATGCACTCT
CAAGGGATTG TGACCTTCGT TTCGCCAATC GGCGGACCGG ACTGGGTGCT GGTCCTGAAG
GCAGAGCAAC CCTGA
 
Protein sequence
MSLPRICVST NRRFLMTESG EPFFWLGDTA WELFHRLSLP EAEHYLNARR QQGFNLIQAV 
ALAELDGLHT PNAQGHVPLL GDDPTRPNEF YFRHVDAIIR MASDKGLYIG LLPTWGDKVH
ARLWGIGPVI FNAGNARIYG RFLGERYRND TNIIWILGGD RPAHGYEEVW AAMAQGITEG
SGFKPFFTYH PMGGTSSSAV LHDADWLDMN MLQSGHCLMD APNWEMIRAD YERTPTKPAL
DGEPNYEHHP IDPYLRPWKP EYGRFTDYDV RKQAYRAVFA GACGHTYGSH SVWQMWSSRY
APTTFPSLPW DEALYGPGAQ QLVHLKNLIL AHPYFTRIPA PDLLPDVAPI PLPVDQEDRI
NPVRAAHPVA TRDSEGTYGL VYFPMAGQSL RVDLRVLRGE VRSAWFDPRS GATHPIGMHS
QGIVTFVSPI GGPDWVLVLK AEQP