Gene Rcas_1074 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_1074 
Symbol 
ID5538540 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp1391791 
End bp1393443 
Gene Length1653 bp 
Protein Length550 aa 
Translation table11 
GC content48% 
IMG OID640893210 
Productmodification methylase NspV 
Protein accessionYP_001431193 
Protein GI156741064 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.77087 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCGATC AAAAACAACG CAAAGCCGAA TTTGGAGATT TTCAAACGCC CATCAGATTA 
GCCAGGGAAG TATGTTCTCT CATTGCTCGG ACCGGTTTTC GTCCCGCTTC GATTCTCGAA
CCAACATGTG GGACGGGTTC ATTTCTCAAA GCATCTCTCG AAACATTCCC AGATGTATCG
CGTGTTCTTG GCTTTGAGAT CAATCCGCAC CACGTGTTGC AAGCGCAGTA TGCTGTCGCA
CCCGCATTTC CTCATGCGTC TATTGAAGTT CATCAGTCTG ATTTCTTTCT CACGAGTTGG
TCTGAGATTG TTAAAGCGTT GCCTGAGCCC ATTCTTGTTA TCGGCAATCC ACCCTGGGTG
ACGAATGCAG CGTTGAGCAC TTGGGGCAGT AGCAATGTTC CGATGAAATC AAACCTCGAC
AATCTCCCTG GTATTGATGC GCTCACCGGC AAAAGTAATT TCGACATTTC GGAATGGATG
CTTAGAAAGA ACATCGAATG GCTTAATGGC AAGAACGGCT TACTTGCAAT GCTTTGTAAA
ACGACAGTAG CACGTAAAGT TCTCTTGTAC GCTTGGCAAA ACGGGCTGCG GATCGAGTCG
GCATCACTTT ATATCCTGGA TGCGCGGGAA TACTTTAGAG CTTCAGTTGA CGCTTGCCTT
CTGGTAGTTC GCAGCAATTC GACCGGCAAC AGCAAAGAAT GCCAGGTTTT TCCTTCTCTT
CATGCACAAC AGCCCCATAG CTTATTCGGT TTGCAGGATG GAATGCTTGT GGCTGATGTC
AAATCATACC TGAAACGGAA AGACCTCACA GGGACAGGCT TTAGGGGCTG GCGGTCAGGA
ATAAAGCATG ATTGCAGCAA CGTCTTTGAG CTGCGCATTG AGTGTGGGAA TCTTGTTAAT
GGCCTGGGAG AATTCGTTGA TATTGAACCC GAAGTGCTCT TTCCTCTGCT CAAAAGTTCT
GATCTCGCAG CGCATAGGAA GCCGCATCGG TGGATGCTTG TTCCTCAACG GGCAATGAGT
GACGACCCGA GCCGTCTTAG GTTGGACGCT CCCAAGGCCT GGAATTACCT TACTGCCCAT
GCACATCTTT TGGACGAACG AAAGAGTTCA ATATACAGGA ACCGTCCGCG CTTCTCAGTC
TTTGGAGTTG GACCATATTC ATTTGCTCCC TGGAAGATTG CTCTTTCGGG TTTATACAAG
AAACTTGAGT TTGTTCAAGT TCCACCGTTT CTGGAACGCC CGGTGGTTTT CGATGACACA
TGTTATTTTT TCCCATGTCA GTCTGAAGAA GAATGCAACC TATTGTACGA ATTGGTCACA
TCCGAACCTG CCAGAGAGTT CTGGTCTGCA TTCATTTTCT GGGATGCAAA GCGGCCAATT
ACGGCACAAC TTCTTAATTC ACTTGATCTG ATGGCTCTTG CACGCCTTTT GGGTAAGGAA
TGTGATAGAG TACGGACTCT TGCAGAAAGA CAGATTGTAG AATATACGGA AGGGGTCTTC
CAGAGACTCC TTTTCAGAGA AGAAACTGCT GACTATGAGA GTGATCTCGT TGCAAACGAA
TTAGATTTGC CAGCCGCCCA ACACGCGCTT CCAGCCGACG CCGCTTCACT GCTCCTTCGC
TTCGCTCAGG GCAAGGCCTC GCGGCGCGGT TGA
 
Protein sequence
MRDQKQRKAE FGDFQTPIRL AREVCSLIAR TGFRPASILE PTCGTGSFLK ASLETFPDVS 
RVLGFEINPH HVLQAQYAVA PAFPHASIEV HQSDFFLTSW SEIVKALPEP ILVIGNPPWV
TNAALSTWGS SNVPMKSNLD NLPGIDALTG KSNFDISEWM LRKNIEWLNG KNGLLAMLCK
TTVARKVLLY AWQNGLRIES ASLYILDARE YFRASVDACL LVVRSNSTGN SKECQVFPSL
HAQQPHSLFG LQDGMLVADV KSYLKRKDLT GTGFRGWRSG IKHDCSNVFE LRIECGNLVN
GLGEFVDIEP EVLFPLLKSS DLAAHRKPHR WMLVPQRAMS DDPSRLRLDA PKAWNYLTAH
AHLLDERKSS IYRNRPRFSV FGVGPYSFAP WKIALSGLYK KLEFVQVPPF LERPVVFDDT
CYFFPCQSEE ECNLLYELVT SEPAREFWSA FIFWDAKRPI TAQLLNSLDL MALARLLGKE
CDRVRTLAER QIVEYTEGVF QRLLFREETA DYESDLVANE LDLPAAQHAL PADAASLLLR
FAQGKASRRG