Gene Rcas_1841 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_1841 
Symbol 
ID5539319 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp2351159 
End bp2352307 
Gene Length1149 bp 
Protein Length382 aa 
Translation table11 
GC content60% 
IMG OID640893979 
Productphosphotransferase domain-containing protein 
Protein accessionYP_001431950 
Protein GI156741821 
COG category[R] General function prediction only 
COG ID[COG0613] Predicted metal-dependent phosphoesterases (PHP family) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTACCACT ACCCTGGCGC CATTCATATG CACACGCGCT TCTCCGATGG CAGCGGCAGT 
GTCGAAGACC TGGCGTGCGC AGCGCGTGAT GCGGGACTGC GCTGGATCAT CATTACCGAC
CACGATGATC TCCAGGCAAA GCGATACGAG GGGTGGCTGC ACGACGTGCT GGTGATCGCC
GGTCACGAGA TCACGCCGCC GCGCAACCAT TTTCTGGCGC TTGGTATCGA CCGCGTCATC
GACAAGCGTC TTCCGCCGCA GGAGTTTATC AATCAGGTCT ACGACGCTGG CGGCTTTGGC
ATCATTGCCC ATCCCGACGA GCGGGTGAAG AATAGTTTTA AAGATGTCTA CCGCTGGGAC
GATTGGGGAA TCGACGGTCC GCGTGATCGC AATGGACGCA CCGTTGGGAT CGAACTATGG
AACCTGATGA GCGACTGGGG GGAACATCTG ACCCGGCGCA ACAAAGAGGT GATCTATTTC
TTTCCGCGCC TGGGCATCAG CGGTCCGACG GCAGAGACGC TCGCCTGGTG GGACCGGCTC
AACATGGCAG GGAAGCGCAC TTTTGGCATT GGCGGGGTCG ATGCCCATGC ATTTGTGCGC
AAGACGCTCT GGGGACGGGT CGAGGTCTTT CCCTACCGCT GGATGTTTGG TACGTTGACG
AATTATGTGG TTCTGCCAGA TCGACTGCCG CTCGATGTTG CCGAGGCAAC CCGAACCATC
CTCAACGCGC TCGCTGCCGG TTGTTCGTAT TTTGTCAACC GACTCGACGG TGATTGCCCG
GCGTTGACGT TTTACGCAGC ACGCGGAGCA GCATACTGGC ATCCGGGCGA TACTGCCGAT
CTGCGCGATG GTCCGCTCAC GTTCATGGTT GATGTCGGGT GTGATGCGCA GGTGCATCTG
ATCCACGATG GACGCATTCT TGCGCGTGGC GCGCGTCTAC TGCGCCATTC GGTCATGCTG
CCGGGAGTCT ACCGCATGGA AGCGTATCGC CGTGGAATGC CGTGGTTGTA TACCAACCCG
GTGTATGTTG TAGGCGTGGG GCGAGAGGTG AGAGGCGAGA GGGGGGGAAG GCGAGAGGTG
AGAGGGGGGA AGGGGGGAAG GCGAGAGGGG GGGAAGGCGA GAGGCGAGAG GGGTCCGACA
ATGGCGTAG
 
Protein sequence
MYHYPGAIHM HTRFSDGSGS VEDLACAARD AGLRWIIITD HDDLQAKRYE GWLHDVLVIA 
GHEITPPRNH FLALGIDRVI DKRLPPQEFI NQVYDAGGFG IIAHPDERVK NSFKDVYRWD
DWGIDGPRDR NGRTVGIELW NLMSDWGEHL TRRNKEVIYF FPRLGISGPT AETLAWWDRL
NMAGKRTFGI GGVDAHAFVR KTLWGRVEVF PYRWMFGTLT NYVVLPDRLP LDVAEATRTI
LNALAAGCSY FVNRLDGDCP ALTFYAARGA AYWHPGDTAD LRDGPLTFMV DVGCDAQVHL
IHDGRILARG ARLLRHSVML PGVYRMEAYR RGMPWLYTNP VYVVGVGREV RGERGGRREV
RGGKGGRREG GKARGERGPT MA