Gene Rcas_4047 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_4047 
Symbol 
ID5541558 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp5250349 
End bp5252997 
Gene Length2649 bp 
Protein Length882 aa 
Translation table11 
GC content65% 
IMG OID640896160 
Producthypothetical protein 
Protein accessionYP_001434098 
Protein GI156743969 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.353342 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.0213567 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGACAC GTTTTGCCCT CGTTCTTGCA CTGGTCATCG GTAGTGTGTG CGGTGCGGCG 
GCTTCTGCCG CTGCCGGTGC GATTGCCGTT GTTCGTGGTG GTGACGAAAC GCTGGTTGTC
CAGGCGACAA CGAGCGGAAC GCGGATCATC TGGCGTCCTC CGGCGTCCGA TCCGTCTCTG
CATGTCGAAC CGGTTCTGGT CGCGTTGCGC CTCACCGGTG ATGCGACTAT CGCGCCTCGC
CTGCTGGCGC TCGATGATAC GCCGTGGACG GGCGACTTCG ACGATCCGCC AGGTGCGCCG
GTATTTGTGC TGCGCGAGGC GCGTCAGCGC GGTGAGCGTC TGGCTGTGCT GGCACTCAGT
CCGGTCTATC TGCGCGAGGG GCAGGCGCGC GCGGTGCGCA CGCTCGAAGC GCTGGTCGAG
GGCGCGGCGC CGCTGAGTGA CTCACCTGTT CCCGTCGCTG CATCATCACT TGCAGCAGGA
GAGCCCGCTT CTGGGCGTCC GCCAGCGTTT CCGGCGCCTC CGGCGCTGCG GGTGCGGGTG
GACCGTGCAG GTGTGCAGGT CATCCCCGTC AGTTCTCTGA GTTCGGCGAT CGCCGGAGCG
CCGGAGCGCC TCAAACTGAC CCGCGCCGGG GTGGAAATCC CGCTCGAACT GCGCGACGCG
AGCGGTAATG GCGTCTGGGG CGACCCAGAC GACGAACTAC GCTTTTATGC GCCGCCGCCG
GGTAACCGCT GGAACCGCAG CGATACGTAT TGGATCACGC TCGAAGCAGG ATCGCGATTG
CGCATTGCGT CCCGCGCGGT GAGTGCGCCA TCGGGCGAAG CGCCATCCAC TGCGCTGGAA
CGGGGTGTTG TGCGCGGAAC GGCCTACTAC GACTCGCGGC GCCCTGGCAG TGATGGCGAT
CACTGGTTTG CGAAGCTGCT GCGCGCTGAG GCGGGACAAC CGGCGGATGA CCAGGCGATG
CTGTCCGTTC CGCTCACGAC GACCCTTCCA ACCGCAACTG GAACGGTGAC GCTGACGGTT
GCGGTCCACG CGCAATCGGA TGGTGCGCGT CGCCTGACCG CCGCCATCGA GTCGAGCAGC
GGATCGCCGG TTGAGTGGAG CGGCAGCGGA GATGCACTCC TGACGCTCAG TGTAGCCGGT
AGCCCTGCTG CCACGACGCA AGTGCGCCTG ACGCTGACCG CTGTCGTCGG GTATGCACAG
GTGGCCGTGG ATACGGTCGA ATGGATGCGA CCGGTGCAGT TGCAGTTCGG CGGGAAAGGC
GCTGTCTTTC AGGGCGCGCC GGATCAGCGC GCCTACTGGT TGACTGGTGC GCCTTCCGGG
TTCGATCTCT ACGATATTAC CGATCCTGTG ATGCCGACGC GGTTACAGAT GCCCGCCGGT
TCCGCATTCG AGGATAGTGC GCCGGGGAAA TTGTATCTGC TGACCGGCGT CGGCACACGA
CACACGCCGA CGGTCGAGCC GTTCACGCCG CCAACGCTGC CAACCGATGC CAGTGTGCTG
TACATCGCTC CCGCGCCGTT CCATGCTGCG CTGACGCCGC TCGTGGACCT GCGGCGCGCG
CAGGGGTACA GCGTGGCGGT TGTGGATGTG CAGCACCTCT ACGACGGATG GAGTGACGGT
CAGGTCGATC CTGATGCGAT CCGCGCCTTT CTGCAATTTG CGCGTCCCCA GGCAGTGACG
CTGGTTGGCG ACGGGAGTTC TGATCCGTTC GACTATACCG ACCGTGGTGC GAAGAATGTC
AACCTGATCC CGCCATATCT GGCGATGGTC GATCCGTGGC TGGGCGAAAC CGCCTGCGAA
ACGTGTTACG CGCAACTGGA CGGCGAGCGA CCGACCGATG ACCGGCTGCC GGATGTCTGG
CTTGGGCGGC TGCCGGCAAA GAGCGTTGCT GAAGTGCAGT TGCTGGTGGC CAAGATCATC
AGGTACGAAA CGTCTCCATC CGGCGGCGCA TGGCGCAGCC GCGCGCTCTA CCTGGCGGAT
GATGCGGACA CCAGCGGCGA TTTTGTGGCT CAGGCGGAAG CGAGCATTGC GCTGCACCCG
GTAGGCGTCC AGATCGGTCG GGTGTTTTTT GGCAACGGTG CGGGAGCGTT TCCAACCGCT
GCCGCAGCGC GGACTGCTAC GCGGACACAG TTCGACAACG GCGCGGCGGC GGTGGTCTAC
ATCGGGCACG CGCATCAACA GCAGTGGGCG GTGACGGAGT TGAGCGCGCC GGAGAACTGG
CTACTCCATC GAAATGATGT CGCGGCGCTG ACCAATGGCG AGCGCCTGCC AGTGGTACTG
GCGCTCACCT GCCTGAGCAG CGCCTTTCAG TGGCCCTCAT ACGTGGGCAT GACTGTTGAT
GAGGCGCTGC TGCTGCACGA GAAGGGCGGC GCTGTGGCAG TCTGGGGACC GACGGGGCTG
GGCGTGTCCT ACGGGCACGA CAAACTGCAA CGGGGATTCT TCCGCGCCCT CTGGTCACCC
GCGCCAGATG TGGGGATTGA ACGCGCCGTG CCGCTTGGCG CGTTGACCAG CGCCGGGTTC
CGTGACCTCT TCACCGGAAG CGCCTGTTGT CAGGAAACGA TATTCACCTA TGCGCTGCTG
GGCGACCCGC TCACGCCGCT GCGGATGACG GCGGGGACGC GTGTGATGTT GCCGCTGGTG
CAGCGGTAG
 
Protein sequence
MMTRFALVLA LVIGSVCGAA ASAAAGAIAV VRGGDETLVV QATTSGTRII WRPPASDPSL 
HVEPVLVALR LTGDATIAPR LLALDDTPWT GDFDDPPGAP VFVLREARQR GERLAVLALS
PVYLREGQAR AVRTLEALVE GAAPLSDSPV PVAASSLAAG EPASGRPPAF PAPPALRVRV
DRAGVQVIPV SSLSSAIAGA PERLKLTRAG VEIPLELRDA SGNGVWGDPD DELRFYAPPP
GNRWNRSDTY WITLEAGSRL RIASRAVSAP SGEAPSTALE RGVVRGTAYY DSRRPGSDGD
HWFAKLLRAE AGQPADDQAM LSVPLTTTLP TATGTVTLTV AVHAQSDGAR RLTAAIESSS
GSPVEWSGSG DALLTLSVAG SPAATTQVRL TLTAVVGYAQ VAVDTVEWMR PVQLQFGGKG
AVFQGAPDQR AYWLTGAPSG FDLYDITDPV MPTRLQMPAG SAFEDSAPGK LYLLTGVGTR
HTPTVEPFTP PTLPTDASVL YIAPAPFHAA LTPLVDLRRA QGYSVAVVDV QHLYDGWSDG
QVDPDAIRAF LQFARPQAVT LVGDGSSDPF DYTDRGAKNV NLIPPYLAMV DPWLGETACE
TCYAQLDGER PTDDRLPDVW LGRLPAKSVA EVQLLVAKII RYETSPSGGA WRSRALYLAD
DADTSGDFVA QAEASIALHP VGVQIGRVFF GNGAGAFPTA AAARTATRTQ FDNGAAAVVY
IGHAHQQQWA VTELSAPENW LLHRNDVAAL TNGERLPVVL ALTCLSSAFQ WPSYVGMTVD
EALLLHEKGG AVAVWGPTGL GVSYGHDKLQ RGFFRALWSP APDVGIERAV PLGALTSAGF
RDLFTGSACC QETIFTYALL GDPLTPLRMT AGTRVMLPLV QR