Gene Rcas_2478 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_2478 
Symbol 
ID5539959 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp3187446 
End bp3190577 
Gene Length3132 bp 
Protein Length1043 aa 
Translation table11 
GC content61% 
IMG OID640894608 
Producthypothetical protein 
Protein accessionYP_001432576 
Protein GI156742447 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGTTCA TTGCCGAACT GAGCCTCATT CTGATTGCAG CAGCGGTGTT GTGTTGCTAC 
GTCGGATGGG GTGCGGCGCG GTTGGCGTTG CCACCGTCGC TTGCATCATT CCGCGCGCCA
TTGACGCCGC TGATCGGCTA TGTCGTCCTG CTCTGGAGCG GGTTCATGCT GGCGAGCCTG
GTGCTCAACT TGCGTTGGAC AGTGGCAGTC ATCCTGATCG GCGCCACCGT TCTGAACATT
CTCACCTGGC GTGCAGAGGG GCCGCCACAA CCGCTGGCAT GGTTGCGTGC CCAACCAGAG
GCGCTGATCC CACCATTGCT GGCGCTGCTG ACAGGCATCC TGCCGTTGCT CGAGTACGGC
TATCCAACGA TCATTGGGCG CGGATGGGAT ACCGAGGCTT ACCTGCCGAT GGCGCAGCAC
CTGATCGACT ATTCCCTGCC GCGCATTCCG GAAGCGCCAC AGAGTCTCCT GCGCGACCTT
GTGACGCATC CACCGCGAAT CGGGCTAACC CTTGGCTTTT CGATATTTCA TGGAATGACG
ATGATCTTCA GCGGCGCCAG CGCACTGGCA TCATTCGCGC CTGTCATTGC GTTTATGCGT
GCGCTGGCTG TACTGGCGAT GTATGTCTGG CTGCGCGCAA CGATGGACGC AGGGCGGGTC
GGATCATTTC TGGGAGCAAC GCTCACCGCG CTGACCTCGC TGATGCTCTG GATCGGCTAT
TTCAACTTCG GGATGCAGAT GTCCGCCTGG TGTCTACTGG CGCTGGCGCT CACCACCGGG
CTGGCAGCCG TTGATGATCT GGCACAGCGT CGTCTGGCGG CATGGCGTGG AGCGCTGCTG
GCGGCAATCG CTCTGGCTGC CATACCCATC GCCTACTATC CGGCGCTGGT TATCGCCGTT
CCGCTCATTT CAGCCGCAGG AGCGGCGCGT CTGTTCGAAA CATGGCGTCA TCCGCAGCCG
TACACGACGC CGATCTCGCT GGCGCTGGCG GCGCTGGCAC TTGCCGGATT GACCCTGGCT
GCGTCTGCGC TGGCGGTTCA GGATTACTTC GAGGGGTTCA GTTTTCGCTA CTCGCTGATC
GAGCCAAAAA TCGGACCGGA TCGGTTTATC GGCGTCGATG AAATCCTGGG ATTGACCGCA
TTTCGCCTGT CTAACGACGG TGATCAACCA CCTTCGTTGC TGATCGGAGT TGCGCTGCTG
GCGACGGTGT TGCCGGGATG TGCAGCGTTG GTTCTGCCGC ACCGGTGGCG CAATGACGCT
GATGCCGGTG AACGCACGCG ATTACGCTGG ACACTGACCA TTGCTGCCGT CGCAGCAGCG
TTGATCTGGC TACGCTTTGG CAGACCGTAT GAATATGGCT TTATGAAAGG CGCTGCGTAC
ACTTCGTTCG TCATCTGGGG GCTGACTGCG TCAGGGGTAG AACGAATCGC CCAATGGACA
AAGCGCACCG GGATGCTGCT GGCGTCCAGC GCTGCTCTGC TGATCCTTGC CTGCACCGGT
TGGTCGCAAT CGCTGACGGT CGCCGATCAT ATACGCGGAC CGGCAATCTT CACCCGTGAT
ATTGCTGCAT TCGACCGGGT AGCGGCGCAA CTGCCGCATG GCGCGACCGT GTTGTTGAGC
GGCGACGAGA CCCTGACCGG ACCGATCAAT GGTATGCTGG CGACAATGCT GTATGGCAAG
GAACTCTGGG GACGGGTTCC CGCCGCGTAT GCTGCGCAAT CGTTCTGGTC TCCTGGCGAA
ACGCCGAACT ATGTCGTGCT GGCAGCGCGC GAGGACCCCT GGCCCCTGGA CGTTGGCGCG
AAGGAGCGCT GGCGGAGTAG CGCGATTGCT CTCTACGAAA TGCCGCCGGA TGCCACCTTT
GTTCTGGGAC GCAGCGAGAG TTATGTCATT GCAGCAGTCG ATCCAAAATC GCCCGCATCG
CTGGCAATCT GGCGACGTGC CGGGCACAAT CGCGTCATTG CGCCCAACGA ACCCTTTACT
CTGGAGATGC CGCACGCAGC GACGTTGCGC CTGACGCTGG CAGCGCTGGA AGCGCAGACG
GTAATGTTGC GTCAGGGGCA TACCACCACA ACGCTCTCGC TAGAGGCAGG GGTTACAACG
ATCAAAACAG GGAGCAGTTC GACTGTACAG GTCATCCCCA CAGCGCCGCT GGCGCTGGTG
CATGCTGTTG TGTCCCCAAC CGATACGCCG ACGCCGGTCT CGACATCGCT CGACATAACG
CGCGTGGCAT GGAGCGCAAC GAGCGAACAA CAGGGCGATC AGATCGTTCT ATCGACAAGT
CTGGCAAATC CAGGCAATCA CGCCTTGCGT TACGAAGTGA TTATTATCGG CGATACGTTC
GATGCGCCGG TGCGCATCGC GCGGTTGCTG GCTGCTGCGC CTTTGGAAGG TGAATGGCGG
TTGGCGCTCG ATCTGGCACG CGGCGCTTCC GAAGCGCGAA TGAATGGCGC TCCTGCACCG
ATGCTGGCAG CCGATGTCGC CGTAAATCCT CCCGACGGTC GCTACTTTGG CGTTCTGGCG
ATCTATAGCG GCGGTGCGGT CGTTGCGCAG GCGCCGCTTT TTACCATGAC CATGAGCGAG
GGCGCCGTGG CGACCTTCGA GCCGGTCTTC TTCTCGGTCG AAACTGCCCG CGCCCGATCT
GACGCCTCGC CGCTCCCCGC GCATCAGCGC GCACTTCTCG CCGGAACGCC GCTGATGTGT
GACGAGTTGC GCCTGGCGCT GGAACAGATT GTTCTGGAGC GCCAATCACC CCCGCCTGGC
GTGACTCCTG TGACGCCACT CTCCCCCGGT GAACGTCTGA ACGTTCAGGT CTTCTGGCGT
GCAACCGGCG ACCGTGAGAA CCAGGATCGG TCACCAATGG TATCGTTCCA GGTGCTGGAT
GATGAAAACC GCAAATGGGC GCAGTGGGAC GGCGTACTCG GCGATTGGCT TCCTGTACCT
GCCTGGAAGC CCGGTGCAGC AGTGCGGCAG GACATCCCGT TGACGCTCGA TGCCGCCACG
CCGCCTGGCG ATTACCGCCT GTTGCTCATT GTGTACGACC CATCAACCGG TCGTCCCATT
CTGGTTGCCG GACAGGAAGC CGCAGTTGTC GGGAAGGTGA GGGTTGCGGC AAGCGGGGGG
ATAGATCCTT GA
 
Protein sequence
MSFIAELSLI LIAAAVLCCY VGWGAARLAL PPSLASFRAP LTPLIGYVVL LWSGFMLASL 
VLNLRWTVAV ILIGATVLNI LTWRAEGPPQ PLAWLRAQPE ALIPPLLALL TGILPLLEYG
YPTIIGRGWD TEAYLPMAQH LIDYSLPRIP EAPQSLLRDL VTHPPRIGLT LGFSIFHGMT
MIFSGASALA SFAPVIAFMR ALAVLAMYVW LRATMDAGRV GSFLGATLTA LTSLMLWIGY
FNFGMQMSAW CLLALALTTG LAAVDDLAQR RLAAWRGALL AAIALAAIPI AYYPALVIAV
PLISAAGAAR LFETWRHPQP YTTPISLALA ALALAGLTLA ASALAVQDYF EGFSFRYSLI
EPKIGPDRFI GVDEILGLTA FRLSNDGDQP PSLLIGVALL ATVLPGCAAL VLPHRWRNDA
DAGERTRLRW TLTIAAVAAA LIWLRFGRPY EYGFMKGAAY TSFVIWGLTA SGVERIAQWT
KRTGMLLASS AALLILACTG WSQSLTVADH IRGPAIFTRD IAAFDRVAAQ LPHGATVLLS
GDETLTGPIN GMLATMLYGK ELWGRVPAAY AAQSFWSPGE TPNYVVLAAR EDPWPLDVGA
KERWRSSAIA LYEMPPDATF VLGRSESYVI AAVDPKSPAS LAIWRRAGHN RVIAPNEPFT
LEMPHAATLR LTLAALEAQT VMLRQGHTTT TLSLEAGVTT IKTGSSSTVQ VIPTAPLALV
HAVVSPTDTP TPVSTSLDIT RVAWSATSEQ QGDQIVLSTS LANPGNHALR YEVIIIGDTF
DAPVRIARLL AAAPLEGEWR LALDLARGAS EARMNGAPAP MLAADVAVNP PDGRYFGVLA
IYSGGAVVAQ APLFTMTMSE GAVATFEPVF FSVETARARS DASPLPAHQR ALLAGTPLMC
DELRLALEQI VLERQSPPPG VTPVTPLSPG ERLNVQVFWR ATGDRENQDR SPMVSFQVLD
DENRKWAQWD GVLGDWLPVP AWKPGAAVRQ DIPLTLDAAT PPGDYRLLLI VYDPSTGRPI
LVAGQEAAVV GKVRVAASGG IDP