Gene Rcas_1994 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_1994 
Symbol 
ID5539472 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp2559648 
End bp2561591 
Gene Length1944 bp 
Protein Length647 aa 
Translation table11 
GC content60% 
IMG OID640894129 
Producthypothetical protein 
Protein accessionYP_001432100 
Protein GI156741971 
COG category[R] General function prediction only 
COG ID[COG3211] Predicted phosphatase 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.0125565 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTGGAC GCGACGAGCA AGACAAACTG ATCGTCCGAT CGGTCGAGGG GAAAGGCAAG 
GAATTCGAGC GCATTCTTGA AGAGCGTCTG AGTCGCCGTG ATTTCCTCAA AGCCGCAGCA
GTCACATCGG GACTCGTCGT TGCTGCAACC GCGATGAACG CCGATGTTGC CGCAGCGCAG
ACGCGCCCGG CGCCGTTGCC GCCAAAGTTT GGCAAGGTTG CGCCGACTAC GCCGGAAGTG
GACGAGATCG CTGTGCCGGA TGGCTACTAC GCCGCAACGC TCATTCGCTG GGGGGAGCCG
ATCTTTGCCG ACGCGCCAGA GTTCGATGTC TGGACGCAGA CAAAGGAGAA GCAGGAGAAG
CAGTTCGGCT ACAATTGCGA CTACGTTGGC TACTTCCCGC TGCCGTCGTA CACCTCGAAT
AACTCGACGC GCGGGTTGCT GGTGGTGAAC CACGAGTATA CCAACCCAGA GTTGATGTTC
CCCGGCTACG ATGTCGAGAA TCCCAACCCC ACCAGAACGC AGGTCGATGT CGAACTTGCT
GCGCACGGTG TGTCGGTGAT CGAGGTTGCC CGCGGGCGCG ATGGGCGCTG GAATGTGGTG
CGCAATTCGC CCTATAACCG CCGTCTGACC GGTTACACGC CGATGACGGT CAGCGGTCCG
GCGGCCGATC ACGAGTGGAT GAAGACGAAT GCCGATCCGA CCGGGCGCAA TGTGCTTGGC
ACCCTTAACA ACTGTGCTGG TGGCAAGACG CCGTGGGGCA CGGTGCTGAC CGCCGAAGAG
AACTTCCACC AGTATTTTGC GAACCTGCGA GCGATGCCGA ACAGCGATTA TCGCAAGGCG
ATCCATAACC GCTACGGCAT GCCGAGCGGC GCCTCTGAGC GCAGGTGGGA GAATTTCCAT
GATCGCTTCG ATATTGCAAA GGAGCCAAAC GAAGGCTTCC GTTTCGGCTG GATCGTCGAG
TTTGATCCGT ATAATCCCAA TTCTGTGCCG GTGAAGCGCA CCGCACTCGG TCGCTTCCGC
CACGAAGCGG CAACGATCGT GATTGCGCCA TCCGGTCAGG TCGTGGCGTA CTCCGGCGAC
GATGCGCGCG GTGAGTATGT CTATAAGTTC GTCTCGAACG GGCGCTACAA CCCGCGCAAC
CGTGCGGCGA ACTTCAACCT GCTCGACGAT GGTACGCTCT ATGTTGCGCG CTTCAATGCG
GACGGCACCG GTGAGTGGTT GCCGCTGAAG CACGGCTTTG GTCCGCTGAA TGAGGGCAAT
GGTTTCATGT CGCAGGGCGA TGTGCTGATT AAGACGCGCA ACGCCGCCGA TGCGCTTGGC
GCAACCAAGA TGGATCGCCC GGAGGACATC GAGACCAATC CGGTCAACAA GAAGGTCTAC
ATTATCCTGA CGAATAACAG TCAGCGTGGC GTCGGCAACG GTCCGCCGGT TGATGCCGCC
AATCCGCGCG CCAACAACCG CGCCGGTCAC ATTATCGAAC TGACCGAAGA GGGCAATAAC
CACGCGGCGA CCCGCTTCAC CTGGAATATC TTCATTCTGG CGGGTCTGCC GACCGACGAG
TCAACCTACT TTGCCGGCTA CGACAAGAGC AAGGTCAGTC CGATTGGCGC GCCGGACAAC
ATTGTGTTCG ACCTGGCGGG CAACGCCTGG GTCGCTACCG ATGGCGCCGC CAGCGCCATT
AAACTGAACG ACGGCCTGTT TGCCATCCCG GTTGCCGGTT CGGAGCGCGG GCATGTGCAG
CAGTTCTTCT CGTCGGTGGC GGGCAGCGAG GTGTGCGGTC CGGAGTTTAC CCCCGATAAC
CGGACGCTCT TCCTGGCGAT CCAGCATCCG GGTGAGGGCG GCACATTCGA TAAGCCGATC
AGCACCTGGC CCGACCGCCA GGGTCTGGCG CGTCCGAGTG TCATTACCAT TCAGGCGTTC
GATAACCGGC GTATCGGTCG CTAG
 
Protein sequence
MGGRDEQDKL IVRSVEGKGK EFERILEERL SRRDFLKAAA VTSGLVVAAT AMNADVAAAQ 
TRPAPLPPKF GKVAPTTPEV DEIAVPDGYY AATLIRWGEP IFADAPEFDV WTQTKEKQEK
QFGYNCDYVG YFPLPSYTSN NSTRGLLVVN HEYTNPELMF PGYDVENPNP TRTQVDVELA
AHGVSVIEVA RGRDGRWNVV RNSPYNRRLT GYTPMTVSGP AADHEWMKTN ADPTGRNVLG
TLNNCAGGKT PWGTVLTAEE NFHQYFANLR AMPNSDYRKA IHNRYGMPSG ASERRWENFH
DRFDIAKEPN EGFRFGWIVE FDPYNPNSVP VKRTALGRFR HEAATIVIAP SGQVVAYSGD
DARGEYVYKF VSNGRYNPRN RAANFNLLDD GTLYVARFNA DGTGEWLPLK HGFGPLNEGN
GFMSQGDVLI KTRNAADALG ATKMDRPEDI ETNPVNKKVY IILTNNSQRG VGNGPPVDAA
NPRANNRAGH IIELTEEGNN HAATRFTWNI FILAGLPTDE STYFAGYDKS KVSPIGAPDN
IVFDLAGNAW VATDGAASAI KLNDGLFAIP VAGSERGHVQ QFFSSVAGSE VCGPEFTPDN
RTLFLAIQHP GEGGTFDKPI STWPDRQGLA RPSVITIQAF DNRRIGR