Gene Rcas_3039 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_3039 
Symbol 
ID5540535 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp3937649 
End bp3940060 
Gene Length2412 bp 
Protein Length803 aa 
Translation table11 
GC content63% 
IMG OID640895158 
Producthypothetical protein 
Protein accessionYP_001433111 
Protein GI156742982 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0719131 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTCATC ATTTCCTTCC TCCGATTTTG GCGCTGGCGC TTCTCGTTGC CTGGCTATCT 
CCTGCGCCGG TGGACGTGCA GGCAGTCCCA GGGCGATCCG ATTTTGGCGT CAATACCCAT
ATCGCGTCGC GCTATCCGGT GCAGGCGGGT TTTGATGGAG CAGTCGATCT GGTCGCCGGA
ACGTCTGCTG GCTGGGTGCG CGAGGATTTC CACTGGTACT GGCTTGAACC GGAGCAGGGA
CGCTTCGACT GGGCGATCTA CGATCGCGCG ATCGCGCGCC TGGCGGGGAG CGGCGTCAAT
ATTATCGGCG TGCTCAACAC AGCGCCTGCC TGGGCAACAC CGTTTCCTGA CGATGCACCG
GGTCGTCCGT CCTACTATGC GCCTGATCCT GCGGCATTCG CCCGTTTTGC GTCGGCGGTC
GTGACGCGCT ATCGCGACCG GGTGCGCTTC TGGCAGGTCT GGAATGAACC GGAGAATGTG
CGCTACTGGC AGCCTCAGCC CGACCCTGCC GCCTACGCTG CGCTGCTGCG CGCTGCGTAT
CCTGCCATCA AGTCTGCCGA TCCGAATGCC GTGGTGTTGA GCGCAGGGGT CGTTCCGACG
AACATCGGGT TTATTCGTGC CATCGCCGAC AATGGCGCCT GGGGTTCGTT CGATATTCTT
GCCGTCCATC CCTATGTCGA TCCGTTCAGC CCGGAGAATG GGCAGATCGG CGCCGGGGAT
GTGAGCGCCG TGAAAACGCT GGTCGATAAT CTGGGGCGTA AGCCGATCTG GGCAACGGAA
TTCGGGTGGT CCACCGGTCC TGCCGATCGC GATCCGCGTG GAGTGGACGA GGAGACGCAG
GCGAATTTTC TGGTGCGCGC ATCGACGCTG TTGCGCGCCG CAGGCGTCGA GAAGGTGATC
TGGTATAACT TGAAGGATAC CGAACCGCGC AATCTCTATG GCTTGCTGCG CAGGGCTGGC
GGTCCTGCCG ATCTGAGTCA GCCGAAGCCG TCACTGGCGG CATTCCGCAC GTTGAACCAG
CAACTCGCGG GCGCCGCACC TGTCGGATTG ATCGACCTGG GAGCGCGGCA GGTGGTGGTC
GATTTCGAGC AGTTCGGGAC ATGGCGGCGC GGCGATCAGC CGAATGGCTC GTTCTCTGCC
GACGGTTCAC AACGCTACAC TGGCAACATC GCCGGTCGTC TCGACTACAT CTTCCCCGGC
GGCGGCAACG ATTTCGTCGT CTTCACGCCG CGTCCGGCAA TCCCGCTCCC CGGCTCGCCT
GGTCAGTTGG GTATCTGGGT CTACGGCGAC GGCAGCGGGC ATGCGCTCAA AGTCTGGCTG
CGCGATGCTG AAGGAGAAAC ACTCCAATTT CGCCTCGGTT TCGTCGGCAG CGCAGGGTGG
TCGTTTCTCT CCACGACGAT CAACGGTCAG GTCGAGCCGT ACAACCGTAT CAGCGGCGGC
GGCAACCTGC GCCTCGATTT TCCGGCGACG CTGGTTGCCA TTGTGCTCGA TGATGAACCG
GACAGTCGCA GCGGGTCTGG CTCGATCTGG CTCGATGACC TGACAGCGAT CAGCGGTCCC
GAAGCGTATG GTGTGCGCTT TGTGCGTGGC GGCGACACGA TCGATGCGCT CTGGTCGCCA
GGTGGTGCAT TTGTGAATCT ACCGACGGCA GCACCTGTGG GTACGGTGAT CGACCGCTGG
GGCAATGCCT CGCAGATCGA TGCTGGGAAT GGGGCATTTG GCTTGAGCCT GGGTCCATCG
CCAATCTTCC TGATCCATCG CGCTGTACAG GCGTTCGCAC CGCAACCAGC ACCCGCGCCT
GCGCCGGCGC AACCGGCGCC GCAACCGTCC GCCGGACCGT GCCGCTCGTT CCCAGAGACT
GGGTTCGCGG TGTGTGGTCG GCTGCTCGAA TACTGGGAGC GGAATGGCGG GCTGCCGGTG
TTCGGCTTTC CCATTGGTCC CGAAGAAGAC ATGCTAATTG AGGGGAAGAC CGTGCGAGCG
CAATGGTTCG AGCGCAACCG ACTCGAACTG CACCCTCAGA ATGCGCCTCC CTACGATGTG
CTGCTGGGGC GATTGGGCGC CGAACGGTTG GAGGCAAGCG GGCGCAACTG GTTCGATTTT
CCGAAAGGCG ATCCGCGCGC GTCGCTCTAC TTCCCACAGA CCGGACAGGC GATTGCGCCG
GAGTTCCGCC AGTACTGGTC GTCCCATGGG TTGAACCTGG ACGGGCGACC GGGGTATACG
CTCGAAGAAA GCCTGGCGCT CTTCGGGTTG CCGCTTTCGC CGCCGCTGAT GGAAGTCAAC
CCGACCGACG GCAAGACCTA CCTGACGCAA CACTTTGAAC GTGCGCGTTT CGAGTACCAC
CCGGAAAACC GTGGAACGCC GTATGTCGTG TTGCTCGGAT TGCTGGGACG CGAGACGACG
GGGAAGAGGT AG
 
Protein sequence
MRHHFLPPIL ALALLVAWLS PAPVDVQAVP GRSDFGVNTH IASRYPVQAG FDGAVDLVAG 
TSAGWVREDF HWYWLEPEQG RFDWAIYDRA IARLAGSGVN IIGVLNTAPA WATPFPDDAP
GRPSYYAPDP AAFARFASAV VTRYRDRVRF WQVWNEPENV RYWQPQPDPA AYAALLRAAY
PAIKSADPNA VVLSAGVVPT NIGFIRAIAD NGAWGSFDIL AVHPYVDPFS PENGQIGAGD
VSAVKTLVDN LGRKPIWATE FGWSTGPADR DPRGVDEETQ ANFLVRASTL LRAAGVEKVI
WYNLKDTEPR NLYGLLRRAG GPADLSQPKP SLAAFRTLNQ QLAGAAPVGL IDLGARQVVV
DFEQFGTWRR GDQPNGSFSA DGSQRYTGNI AGRLDYIFPG GGNDFVVFTP RPAIPLPGSP
GQLGIWVYGD GSGHALKVWL RDAEGETLQF RLGFVGSAGW SFLSTTINGQ VEPYNRISGG
GNLRLDFPAT LVAIVLDDEP DSRSGSGSIW LDDLTAISGP EAYGVRFVRG GDTIDALWSP
GGAFVNLPTA APVGTVIDRW GNASQIDAGN GAFGLSLGPS PIFLIHRAVQ AFAPQPAPAP
APAQPAPQPS AGPCRSFPET GFAVCGRLLE YWERNGGLPV FGFPIGPEED MLIEGKTVRA
QWFERNRLEL HPQNAPPYDV LLGRLGAERL EASGRNWFDF PKGDPRASLY FPQTGQAIAP
EFRQYWSSHG LNLDGRPGYT LEESLALFGL PLSPPLMEVN PTDGKTYLTQ HFERARFEYH
PENRGTPYVV LLGLLGRETT GKR