Gene Rcas_3069 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_3069 
Symbol 
ID5540565 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp3973705 
End bp3976053 
Gene Length2349 bp 
Protein Length782 aa 
Translation table11 
GC content61% 
IMG OID640895188 
Producthypothetical protein 
Protein accessionYP_001433141 
Protein GI156743012 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACACACG ACAGCCATTC AATCTACACC GACGCCGATT TTATCAAGGA CGATCTGCGG 
CGCTGCGATC TGGTGATGAA AGGCGGCATC ACCAGCGGCG TAGTCTATCC ACCGGCGATC
ATCGAGCTTG CAACGCGCTA CCGGTTTGTC AACATCGGCG GCGCCTCGGC GGGCGCGATT
GCAGCGGCAG CGGCAGCGGC GGCAGAGTAT GGGCGCGCCG TCCCCTATGC CGGGCATCGC
AGCGGTTTTC AACGGCTCGA CCGATTGCGC GCCTGGCTTG GCGAGGGTGA AGGCAATCTG
GCGGGGCTAT TCCAGCCATT CTCCCGCATG AAACCTCTGT TTCACGCGCT GTTCGACCTG
GTGATCGCTT CCAGAGCAGC GCCCCCAAAA CGCCCATCTG CATCCGGGAG ATCGCCGTCT
GTGTTTCGTC TTCCCGTTGC CGCGATCCGG TTTGTCTGGC GAGCGCTTTC CTTCTTCGTC
CGCATTACTC GATTGCTGGC GCGCCATCAT CCATCGATAA CACTGGGGAT CGGCGTTGCT
ATCGCACTGA TCGGTGCGGC GATCTGGATT CTCCCACCCT TCCTGAGCGA CTCGCCGGTG
CAGCCGTTGG TCGTGATCGT GGGGGCAGTA CTGGCGATTC TTTCTGGTTT GATCGTCGGA
ACACTGGCGG GGGCGACGCA CCTCGCGTGG ATTGCCGTTA CTGAACTGCC GCGCCATCTG
TTTGGGTTGT GCAGCGGACA TACGGACGGC GCAACACCTG AAGGATGGCC GCCAACGCTG
GGAGATGCCA GGGGAGGCGG CAAACCGCCC GCATTGACCG ATTGGCTTCA TGCGGTCATC
AACGGACTGG CAGGTCGCGA CGCCGATCAA CCGCCGCTGA CCTTCGGCGA ACTGGCAAAC
ACACCAGACG GACGGACGAT TTCACTACGC ATGATGACCA GCGACTTGAG CGAACACATG
CCGTATGTGA TCCCTAAAGA CCTGGGGCGT TTTCTCTTCG ATCCGACCGA ATTCGCGCGC
CTGTTCCCGA AGGTTGTTGT TGAACATATG CGCGCGCGAA GCATAGCCGC AGCGTACCAG
GTTCTGGATG AGCGCGGCGC AGTGCGAATG CTGCTGCCCC TGCCCGCCTG GCGCGATCTG
CCGGTCGTCG TTGGTGCGCG TATGAGCCTG AGTTTTCCGC TCCTGATTGC CGCCGTGCCG
CTCTACACCA TCAGCGTTGC CGGGCAACGC GAAGCGAGCG CCGGAAGGAC GTTGCGCGTA
GAACACCTTC AGCGCCATAT TTTCAGCGAT GGCGGGATTG CCAGCAACTT CCCGATCCAT
TTTTTCGACC GCTGGCTGCC AACCCATCCG ACGTTTGGCA TCAATCTGGT GCAATTGCCC
ACCGATGAAG CCGACAACGA ACGGTTTCTC CAGGCGCTGC TGACAAACAA TAGCGCAGAG
TCGTCCGACG AAACGCTGAA ACCGGAGATT ATTGTCAAGC AGGCATACCT GGGACGCGCC
GACTTTCGCG CCGCAGCGCC GGAACCACCG CCTGCCCGCG CCGCGACACC GTTCAGCGAT
CCACAAGGAG AGGTCTATCT GCCGCCTGCC GGACCGGGTG AAGATTATGT CGAATGGCAG
ACGATTGACA CTCTTCCGGC ATTCTTCCGG TCGATCTTCG GCACAGCGCA GAGTTACCGC
GATACGGCAC AGGCGCGCCT GCCAGGGTAC AACGAGCGCG TGGTGCGGGT GCGCCTGCGA
CCGGAGGAAG GCGGTCTCAA TCTACGCATG TCGCCACGGA TCATCGCCAA AATCGAGCGC
AAAGGGCGAC TCGCCGGGCG CGCATTGCTG CCGCAGGAAC CCGAAACTGC CGGAAGAAGG
CGCGGCGGCT TCCGTTTCGA CGATCATCGA TGGGTGCGCC TCGTGACGCT CCTTTCGGAG
ATTGATCAGC AACTCCGCGA TATGCGCACC GCCTACGATG ATGTGCAGGC AGAGTACCCC
ACGTTTTTGC GTCATGCGCT GACGAACGAC CATCTGCCGG TTTGCCCGCC CTACTACGCC
GGCAGCGCCG AAGAGCGCGC ACTCCTTGCG CGGCGCATCG AGGCGCTGAT TGCGCTCTAC
ACGCTGTGGA GCGATCTGGA CGAAGACACA ACAGCGCGCC TTAATACAAT CCTGCTGCGC
ATGAACAAAC GGACGCTCCA GAACCTGGCG CAACTCCTCG AACAACCAGA CGATCAGGAA
GCGTTGAACA GCCTGCACGA CTGGCTTGGC GCCCTGCACG CCGAGAAAGT GCGCGTCGCA
AAAGTGGCAG AACGCGAGGC GTCTCCGGTT GAGGATATGA TGGACCTGCG GGTCATGCCG
ACGGTGTGA
 
Protein sequence
MTHDSHSIYT DADFIKDDLR RCDLVMKGGI TSGVVYPPAI IELATRYRFV NIGGASAGAI 
AAAAAAAAEY GRAVPYAGHR SGFQRLDRLR AWLGEGEGNL AGLFQPFSRM KPLFHALFDL
VIASRAAPPK RPSASGRSPS VFRLPVAAIR FVWRALSFFV RITRLLARHH PSITLGIGVA
IALIGAAIWI LPPFLSDSPV QPLVVIVGAV LAILSGLIVG TLAGATHLAW IAVTELPRHL
FGLCSGHTDG ATPEGWPPTL GDARGGGKPP ALTDWLHAVI NGLAGRDADQ PPLTFGELAN
TPDGRTISLR MMTSDLSEHM PYVIPKDLGR FLFDPTEFAR LFPKVVVEHM RARSIAAAYQ
VLDERGAVRM LLPLPAWRDL PVVVGARMSL SFPLLIAAVP LYTISVAGQR EASAGRTLRV
EHLQRHIFSD GGIASNFPIH FFDRWLPTHP TFGINLVQLP TDEADNERFL QALLTNNSAE
SSDETLKPEI IVKQAYLGRA DFRAAAPEPP PARAATPFSD PQGEVYLPPA GPGEDYVEWQ
TIDTLPAFFR SIFGTAQSYR DTAQARLPGY NERVVRVRLR PEEGGLNLRM SPRIIAKIER
KGRLAGRALL PQEPETAGRR RGGFRFDDHR WVRLVTLLSE IDQQLRDMRT AYDDVQAEYP
TFLRHALTND HLPVCPPYYA GSAEERALLA RRIEALIALY TLWSDLDEDT TARLNTILLR
MNKRTLQNLA QLLEQPDDQE ALNSLHDWLG ALHAEKVRVA KVAEREASPV EDMMDLRVMP
TV