Gene Rcas_2158 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_2158 
Symbol 
ID5539638 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp2771944 
End bp2773869 
Gene Length1926 bp 
Protein Length641 aa 
Translation table11 
GC content64% 
IMG OID640894291 
Producthypothetical protein 
Protein accessionYP_001432260 
Protein GI156742131 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000475979 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCCATCAA AACATCTGTC TCGCCGCGCG GCTCTGTCTG CCGCTGCGAC ATTCGCCGCG 
CTTGCGTCGA TCCCTTCCGC GTTTGCGCAG GAAGGCGTAG CGCTTGTTGC CCGTCGCGCG
GTGGTTGCGC CGCTCCAACC GGTCGAGGTT GACGTGCGCA TTCCCGGATA CACCGGTCCT
GCTGCCGTTA TCCTGTTCGA CAGCCGCAAA CGGTTCGCCG GCGTTGCCGA AGGACACGTC
GAGAACGGCA TTGGAACCCT GATCGCCGTG CCGCGCGGCG CGCCCGGTGC GCAGTGGGTG
GCGTTGTTCG CCTCTGGTCG ATTCGTCGCC GCCAATCCGG CGCTGTTCTC ACTCGATGCG
CGCACCGAAA TCTGGACCGG GCAGGAACGC TTCGACCGGT TTGTGCCGAA TGCGGCTGCC
ATATTTGCTG CTGCCACGCT GTCGTACACC TTCAATGGCG CATTCTTTCA CGGTTACCGT
TCGCCCGACA GCCCGCTGAT CTGGTTGCGC GACCACGTGT ACGCACAACG CGGCGCGCGC
TACTTCGACG CCGATCTCAA AACCGCCTTT GACGACTTTC GCCGCTACCA GCAACCGGAT
GGCAGTTTCC CCGATTTCCT GCCGCGCCCT CCATGGACTG ATCGCGCGCT CCGGGTGCCG
GTCGAAGCCG ACGTTGAGTA CCTCTACGTC CAGGGAGTGT ACGAAGCCTG GCAGGCGACT
GGCGACGATG CCTGGATGCG TAGCCACCTG GAACCGATGC GGCGCGCCGT GACGTACTCG
TTGCAGCACC CGCTGCGCTG GGACGCCGAA CGTGGTCTGA TCAAGCGCCC TTTCACCATT
GACACCTGGG ATTTCGAGTA CGGTTCGACG ACGACCGACC CGGAAACGGG CAAGCCTGCG
CCGCGCCACT GGATCGACGA CAAGACGATC TGGGGCGTTT TTCACGGCGA CAACACCGGC
ATGGCACAGG CGCTGACGAT GCTGGCGCGG ATGGAAGAGC GCGTCGGCGA TGCAACCCTG
GCGCGTGTCT GGCGTGATGT TGCCGCCGGT CTGATACGCA ACCTGAATGC GCTCAGTTGG
AACGGGCGCT TCTTCCGCCA TCATGTCCCC TTTCAGTCCT TCGACATCCC CGGCGTTGAT
CGGGAGCGGC AGTTGAGCCT CTCGAACGCC TATGCGCTCA ACCGTGGCGT GCTCACCGTT
CAGCAGGGGC AGGCGATCAT CGACGAATAT ATCGAACGCT CGAAGACTAT GCGCGCATTC
GCGGAATGGT TTAGCATCGA TCCGCCCTTT CCACCGGGAA GTTTCGGGCT TGCGGGGCGC
AGCGGCGAAC TCCCCGGCGC GTATGTCAAC GGCGGCATCA TGCCGATCAC CGGCGGCGAA
CTGGCGCGCG GCGCATTTCG CTACGGCAAC GAAACCTATG GCTTCGCCAT TCTCGAACAC
TACTGGCTGC GCATGCTCAG TCGCGGGCGC ACCTTTCTCT GGTACCATCC CGACGGCGCA
GAAGGGGTCG GCTCCGATGA CACCATTCCG ACCGATGCGT GGGGGACGGC TGCGATGTTT
ACTGCGCTGA TCGAGGGCGC TGCCGGCATC GAGGATCAGG GCATCGCCAT GCGCGATGTA
ATCGTCAGCC CTCGCTGGGG CGCCGCTGGT CTGACCTCGG CGTATGTCTC GGCGCGCTAC
CCGGCGAGCG ACGGGTATCT GGCGTATGCC TGGCGTCAGC ATCCGCGCCG TATCGACCTC
GACCTGAGCG GGGTCTTTGA TCGCGCGCGA GTGAGGGTGC TGCTGCCGCA GGACACGCCG
GGATCGGTCG AAGCGCTGGT CAACGGTGTG CCCGTGCCGC ATACCATCGA AACCCTGCGC
GCCAGCCGGT ATGTCATCAT CGACGTAGCG GATATGGCAG TTGTTCAGGT ACAGGTGCGC
TGGTAG
 
Protein sequence
MPSKHLSRRA ALSAAATFAA LASIPSAFAQ EGVALVARRA VVAPLQPVEV DVRIPGYTGP 
AAVILFDSRK RFAGVAEGHV ENGIGTLIAV PRGAPGAQWV ALFASGRFVA ANPALFSLDA
RTEIWTGQER FDRFVPNAAA IFAAATLSYT FNGAFFHGYR SPDSPLIWLR DHVYAQRGAR
YFDADLKTAF DDFRRYQQPD GSFPDFLPRP PWTDRALRVP VEADVEYLYV QGVYEAWQAT
GDDAWMRSHL EPMRRAVTYS LQHPLRWDAE RGLIKRPFTI DTWDFEYGST TTDPETGKPA
PRHWIDDKTI WGVFHGDNTG MAQALTMLAR MEERVGDATL ARVWRDVAAG LIRNLNALSW
NGRFFRHHVP FQSFDIPGVD RERQLSLSNA YALNRGVLTV QQGQAIIDEY IERSKTMRAF
AEWFSIDPPF PPGSFGLAGR SGELPGAYVN GGIMPITGGE LARGAFRYGN ETYGFAILEH
YWLRMLSRGR TFLWYHPDGA EGVGSDDTIP TDAWGTAAMF TALIEGAAGI EDQGIAMRDV
IVSPRWGAAG LTSAYVSARY PASDGYLAYA WRQHPRRIDL DLSGVFDRAR VRVLLPQDTP
GSVEALVNGV PVPHTIETLR ASRYVIIDVA DMAVVQVQVR W