Gene Rcas_0923 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_0923 
Symbol 
ID5538389 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp1214353 
End bp1215777 
Gene Length1425 bp 
Protein Length474 aa 
Translation table11 
GC content61% 
IMG OID640893072 
Producthypothetical protein 
Protein accessionYP_001431055 
Protein GI156740926 
COG category[R] General function prediction only 
COG ID[COG5621] Predicted secreted hydrolase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.946452 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACGTCTG AATTTGACGA GCAGGCTGAT GCTTCGATGA TGATCAATGC CGGTGTGCGC 
CGGCAAGAGT GGCGCTCCTA TCCGTACCTT CTCGTTCCCG ACGATCCACA ACTGCGCTTC
CCACAGGCGG AAGGATACCA GGACATGGCG AGCGACACCT ACTACGCTTC GGGGATTGTG
CAGGGTGAAC AGACCGGAAA ACGGTACGCT TTCTTCGTCA TTTTTGCGCG TCTCTCCGGC
TTTTCGAGCG CTTCCGGCAT CGATATGCAC CTCGGCGCGC TGTTCGATCT TGCGAACGGC
GGATATACCA CATTCGCGTC GTATGACCTG CCTCCAAAAC GCTGGTTTCG TCAACGTCTG
ACGATTACAC GCGGTCATCT TGGCGTCGCC TGGAACTCAC CGTGCTGGAA GAGCCGCTTC
TGGGCGCGGT ACGATGCTTC CGGCGATCTC GTGCCATTCG GGTATACGCT CGATGTATGC
GGGCGCGACA GTCGTGGCGA TCCACTGGCG CTCGATCTCG TTGTCGATGC GGTCAAACCG
CCACAACCGG TCGGTGGTCC GGTTCACAAC GGCGCCATCA CTGTGATGGG GCAACCGAAT
ACCCGCTCCT ACTTCCAGAG CCTGAGCTAT CGCGGGTCGC TCCGGTGGCG TGGTGTGGAA
GAAGCGGTGT GGGGCGACAT CGGTTGGCTC GACCGGCAGT GGTTCCCGGA GTATGTTGGC
GCGTACTCCG GCATACTGGC GGATCGCTAC AGCCACCAGT GGGCGCAGAT GTCGTTCGAC
AACGGGTGGG AACTGAGCCT GTGGCGGAAC TTTGCACGTC ACGAGCGCAA CCGTGAAATT
CCGTTCAGCG GACTGACGAT CACCGATCCT GAAGGGCGCA CGTCGTTCAC CGATGCGTAC
CGGATCGAAG CACTCAGTTA CTGCCGTGAC GAAGGGTATG TCACGCCGCT CTACGCGCCT
GTTCAGCGTC TCTTTGGCGT GCGAGGCGAC AGGCGCTACT TTCTCGACGC CTACCGGTTC
CATGTGCCGT CGCTCGATCT GATTGTGACC AGCACGCCGC TGGCGCCTGC CCCGGCGCAC
CGCATGCCGG TCGATTATCT CACCGGACCG ACCCGCCTGG AAGGAACAAT GGCAGGGAGA
CCGGTGACCG GCTACGGGTT CAACGAGCGC ACGCTGGGGT TGTGGCGACC GTGGGAGTTG
TGCCAGGCGC TCGCCGACTC GCTGCGCCCT CTGGTTGACG AAGGAAAGGC GCCGTCCACA
CTGGTGCAGG CAATAGACGA CGCGCGCCTC GCCATTGACA CAAAGCGCAC CAACGAAGCA
AGACGCATTC TCGACCGGCA GGTGCGTCCC GCGCTCGACA CCCTTCCCGA ATCGCAGTGT
CAACGCCTGA TTCGCCTGGG CAACGATCTG GCGGCAATGC TGTAG
 
Protein sequence
MTSEFDEQAD ASMMINAGVR RQEWRSYPYL LVPDDPQLRF PQAEGYQDMA SDTYYASGIV 
QGEQTGKRYA FFVIFARLSG FSSASGIDMH LGALFDLANG GYTTFASYDL PPKRWFRQRL
TITRGHLGVA WNSPCWKSRF WARYDASGDL VPFGYTLDVC GRDSRGDPLA LDLVVDAVKP
PQPVGGPVHN GAITVMGQPN TRSYFQSLSY RGSLRWRGVE EAVWGDIGWL DRQWFPEYVG
AYSGILADRY SHQWAQMSFD NGWELSLWRN FARHERNREI PFSGLTITDP EGRTSFTDAY
RIEALSYCRD EGYVTPLYAP VQRLFGVRGD RRYFLDAYRF HVPSLDLIVT STPLAPAPAH
RMPVDYLTGP TRLEGTMAGR PVTGYGFNER TLGLWRPWEL CQALADSLRP LVDEGKAPST
LVQAIDDARL AIDTKRTNEA RRILDRQVRP ALDTLPESQC QRLIRLGNDL AAML