Gene Rcas_0471 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_0471 
Symbol 
ID5537934 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp608494 
End bp611673 
Gene Length3180 bp 
Protein Length1059 aa 
Translation table11 
GC content61% 
IMG OID640892634 
Producthypothetical protein 
Protein accessionYP_001430620 
Protein GI156740491 
COG category 
COG ID 
TIGRFAM ID[TIGR01451] conserved repeat domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.128084 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTCGCA ATATTGCTGT TCTCTGGCTT CTTGGTTTCC TCGGTGGTCT TTTGGTATTC 
GCTCCCCCAA CAGGACGCAC ACCGGTCGCT CACGCGACAG TTCCGAACGA ACAGAACTGG
ACGGTGACGC CTGGCACATG CGATCCGTCG TTTCAAGGCG CCGTCTTTGC TGCGTGGAAT
CCGGCATCGG GCAATCCCAA CTGTGGCGTC TTCACCTCTG GCGCGCCCTC GCTTAATCAG
TTCGATACCG GTCAAGTTTT TCCTCCGAAT GTGCTCCGGG ATGAAGCGAG TGTCACTGCA
CCATGTGAAA ATGGTCGCAC CGCCGGGCTG TGTTACCGAA TGTGGTACGT CGGCACGCGC
TCCGGCGAGC CGTATCGCCG GATTGGATAC GCTGTCTCGC CGGATGGCGT CTCCTGGTAT
CGCGTGCAGG GTCCGCACAC CGGCGGCAGC GTCTTCGAGG GGTCGGGACA GCCGGGCAGT
TTCGATGAGA ATGGCGCGAC CACGTTTCAC GTGATCAAAG ATGGCGGGGA GTACCGTATG
TGGTACACCG GCGTCAACAG CAGTGGGACC TGGAGAGGTT TCGGCTATGC AACCTCGAAC
AATGGCATTA CCTGGACGCG ACAAAACGAC GGGCTGCCGG TGTTGACTCG GCGCCTGGGA
TCGGGTTTGT TCGACGATGA CCGCATTATC GGACCGTTTG TGTTAATTGA TGAGGCAAGC
GCCACTGCGC CGTGTGAAAG CGGTCGCGCG AACGGGCGCT GCTTCCGCAT GTGGTATGAG
GGATTCCGGG CGGATAATAA CTTTTACATC GGACATGCAT TGTCGCCTGA TGGCATTAAC
TGGACGATTG TTAATGGACC TGACGAACTC GGCTCGGTTC TCTCCAATTC GGGCGGATTT
ACCGCCTTTG ATTCTAATGA TGTCGGCTTG ACTGCGGTCA TCAAAGATGG CGCGATCTAC
CGCATGTGGT ACCAGGCAAA AGATTACAAC ACGCCGGACA CCTTCAGAAT CGGGCATGTC
ACTTCGGTGA ATGGGGTCAA CTGGGTGCGT CCCGATCCGA ATGATCCTGC GTTCTATGGC
GGCTTAGACA CGATCAATCT TCCCGGAACC AACGATGATG TGTGGGTCGT CCGTCTTTTG
AAGGAAGACC TGACCTACCG TATGTGGTAC GCCACGGCGG GCACGCCTAA CAGCACCCGT
TTTGGTCTGG TCGAGATGAC GCAGGGCGTG CCGATTACGC CAACGGTGCG GCGCAGCGGT
GATGAGTTCA GGATTGAGTT CAACACACAG CGAACGATAC CGGTTAGTGG CAGTGTGTTG
ATCACCCTGC CGCCGGGCGT TTCGCTCGAC CAGTTTTCGG TCATCGAATT GCAGGGATTT
GAACCTGGCG CGGTCCTTGC GCGCGAGCGC GGCGCGATTA CCGATGCGTA CAGCGGCTTC
TCGGCGCGCG ATGCGCTGCT GCTGCGCCTG CCAAATGGCG CGGTTCCAGG ACCAAAGGTG
ATCCGCTTTA GCCTTGGCGC AGAGGCGCCC AATCCTGCCT ATCTTCTGCT CCAGACCTTC
GACACCCACA AGGTGCTCGA ACGGGCGCGC GTCAATCTGG GAGATCTGCG AATCACACAG
AGTGTGGGAA CAGTCGTTGC GGGCGCATCG GTCGTTTACA CCGTCACGGT CAGTAATGTC
GGTCCGAATG CGGTCTCGAA CGCCTTGCTG AACAGCGTCT TCCCAACGCA ACTGACGGGA
ATCACCTGGA CCTGCGCTCC ATCGGGAGGC GCGTCGTGCG CTCCGGGAAG CGGCAGCGGC
AACTGGAGCA GTAAGCCGCT CGATCTCCCA TCGGGCAGCA GTGTCACGTT TACGGTGACC
GGCAATCTGC CGCCTGCGGA AACCGGTAAC CTGACGGATA CTGTCAGCGT TTCCACACCG
GCTGTGCTCA ATGAACTGAC GCCTGGCGAT AATGTCTCGA CTCTGGTGAC GCCGATTGAG
GTGCGCGGCG ATCTGTCGAT CACACGTGCC AGCAATCCGG TGGTTCCGCA GGCGGGTCAA
CCGATCACCT ATACGCTGAC CGTGGCGAAC AGCGGACCGA GCACCGTCGT CGGCGCGAAT
GTGGTCAATA TGTTCCCGAT CAGTGTGACG AACGTCATCT GGAATTGCAG CGCCACATCC
GGTTCTGTCT GCCCGGCGCC GGGCAGCGGA AACATGAGTG CAGCGGTCAC CCTGGCACCT
GGCGGCATTG CCACGTTCAC TGCAACCGGT ATGGTGTCGC CATCTGCCGT TGTGATGCCG
CCGCACTCGG CGATGGTGAC CGTGCCGGGG AATGTGACAG ATCCCAACCC GGACAATAAT
GTGTTCACCG ATGGCGGCGG ATTGGGGCGG TCCGCCGATC TGGCGATCAG CAAGGCGGTT
GCGCCCGCAA CGGTTGTTCC CGGTCTTCCG GTCACCTACA CGATCACCGT GACCAACACC
GGTGCTGCCG ACGCAGATGG CGCAACGGTG CTCGACCTGT TCCCGCCGAC GATCACGAAT
GTGACGTGGA CGTGCAGCGG CGCGGCCGGC GCAGGTTGTG CGCAGGCAAG CGGCAGCGGC
GATCTGATGG TGACCCTGTC ATCATTCCCC GTGGGCGGAT CGGCAACGAT CACGATGACC
GGCATCGTAG CGGCGCAGGC GACCGGGAAC CTGATCAACA CAGCGCAGGT GCTGCCGCCG
GTTGGGGTTG AAGACCCGGC GTTTGCCAAT AACAGCGCGT CGATTTCGAG TGTGCTCCAA
CCGCGCACCG ACCTGTCCAT TGCGCAGACG ACGCCATCCC ACGCAGTTGT GGGGCAGACC
ATCACCTATA CCATCACGGT GCAGAACAAC GGTCCAAGTG TTGCTGCCGG CGCGCGCGTC
AGCACGACGA CCCCGGCGCA TGTTGTGGTG ACGGGATGGG TGTGCGCGGC GTCGGCGGGG
TCGCAGTGTG GCGCAGCGAG CGGCGCCGCG CCGGTAGACG ACGTCGTAAC GCTGGCGCCC
GGCGGCGCGA TCACCTACAC GGTCACCGGC ACGGTTTTCA ATCGGGCTGT GGGACAACTT
CCCTTCTCCG GCGCCGTCGT TGCGCCGGCA AGCGCCGAAG ACCCGGTGCT GACGAACAAC
CAGGCGCAGA GTTCAACCCA GGCGCTGTAT GTGGTGACGC TGCCGGTAGT TGTGCGGTGA
 
Protein sequence
MRRNIAVLWL LGFLGGLLVF APPTGRTPVA HATVPNEQNW TVTPGTCDPS FQGAVFAAWN 
PASGNPNCGV FTSGAPSLNQ FDTGQVFPPN VLRDEASVTA PCENGRTAGL CYRMWYVGTR
SGEPYRRIGY AVSPDGVSWY RVQGPHTGGS VFEGSGQPGS FDENGATTFH VIKDGGEYRM
WYTGVNSSGT WRGFGYATSN NGITWTRQND GLPVLTRRLG SGLFDDDRII GPFVLIDEAS
ATAPCESGRA NGRCFRMWYE GFRADNNFYI GHALSPDGIN WTIVNGPDEL GSVLSNSGGF
TAFDSNDVGL TAVIKDGAIY RMWYQAKDYN TPDTFRIGHV TSVNGVNWVR PDPNDPAFYG
GLDTINLPGT NDDVWVVRLL KEDLTYRMWY ATAGTPNSTR FGLVEMTQGV PITPTVRRSG
DEFRIEFNTQ RTIPVSGSVL ITLPPGVSLD QFSVIELQGF EPGAVLARER GAITDAYSGF
SARDALLLRL PNGAVPGPKV IRFSLGAEAP NPAYLLLQTF DTHKVLERAR VNLGDLRITQ
SVGTVVAGAS VVYTVTVSNV GPNAVSNALL NSVFPTQLTG ITWTCAPSGG ASCAPGSGSG
NWSSKPLDLP SGSSVTFTVT GNLPPAETGN LTDTVSVSTP AVLNELTPGD NVSTLVTPIE
VRGDLSITRA SNPVVPQAGQ PITYTLTVAN SGPSTVVGAN VVNMFPISVT NVIWNCSATS
GSVCPAPGSG NMSAAVTLAP GGIATFTATG MVSPSAVVMP PHSAMVTVPG NVTDPNPDNN
VFTDGGGLGR SADLAISKAV APATVVPGLP VTYTITVTNT GAADADGATV LDLFPPTITN
VTWTCSGAAG AGCAQASGSG DLMVTLSSFP VGGSATITMT GIVAAQATGN LINTAQVLPP
VGVEDPAFAN NSASISSVLQ PRTDLSIAQT TPSHAVVGQT ITYTITVQNN GPSVAAGARV
STTTPAHVVV TGWVCAASAG SQCGAASGAA PVDDVVTLAP GGAITYTVTG TVFNRAVGQL
PFSGAVVAPA SAEDPVLTNN QAQSSTQALY VVTLPVVVR