Gene Rcas_1245 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_1245 
Symbol 
ID5538714 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp1607754 
End bp1608815 
Gene Length1062 bp 
Protein Length353 aa 
Translation table11 
GC content60% 
IMG OID640893380 
ProductHEAT repeat-containing PBS lyase 
Protein accessionYP_001431360 
Protein GI156741231 
COG category[C] Energy production and conversion 
COG ID[COG1413] FOG: HEAT repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTCGACC AGCATCTCTG GCGGCAACAG CTCATCGACT ACCTTGAGGT TTTTGCGCGT 
CATCCGCATC AGGAAGTTCA ACTGGCAGGC AGTCCTGGCG TTGTTCCGCA CCTGGCATTG
CGCACGCTGA CGCCGTTCAT GAATGCGCTC TATGATAATC CGGTCGATGC TATCGTTACA
CTGGCGACGA TCACACACGA TGCTGGCGCC AATCTTCTCG TCAGCAGGAT GATACGCACC
CGCTATCCAA CGGCGCTCGA TATCGAGCGC AACCTGCGCG CCAGCACCGA TCTCTGCATC
ACGGTCGAGC GCCTGATGAT CGAGTTGCAG ACGATCCCGA TTGCGCTGCG AAACCTCAAT
GCACGGCGCG GTTTCTGGCT GCGCAGCACG ATTGAGCGCG AATTGGACGC ATACCCCTGG
TCGTTTGTGT GGATCCGTGA TCTGTTGCGC GAACTAAACG GTCAGGATCG TATCGAAGCA
CTCCGCCGTC TGCGGTCGCG CAATGGTCGA TATACGAGCG GCGATCTGAC GCTAATCGAG
AAGGCGCTGT CGGACGCTGC GGCACAGACA CGCGCGTATG CCGCGCGTCT GCTGGGGGTC
ATGGTCGATA CCCCGCCACC CACACTGACC CATCGTTTGC TAGACGTTGC GCTGCGCGAC
TCCGATGCGG AGACACGCTT TGCAGCGGCG CGCACAATCG GGCTGCTGCG TGAGCGTATC
ATCACCGGTG ATGCGCTGGC ATACATTGAG AGTCATCTCG TCCACGACGA TAGTTTCTTT
CGCTCATCGG CGGCACTTGT GCTGGGGCAA TTGGGGGAAC ACGCAGGCAA GCCGACCGTC
GTACAGCGTC TGTGCCAGTT GCTGTGCGAC CGCGACGCTT ATGCGCGCGA GGCGGCAGCG
CGCGCGCTGG GACGCATCGG ACCTCCCGCT GCACTGCCAC AGGTTATCGA AGCGCTTGAA
CAAACCACGC AAGATGATGA TGTGCAGGTT CACGAGGCAG CGACCGACTC ACTCGCGCTG
CTGCGTCAGT ATGCCGAACT GAGCGTGCAG GCGGCAGCGT AG
 
Protein sequence
MFDQHLWRQQ LIDYLEVFAR HPHQEVQLAG SPGVVPHLAL RTLTPFMNAL YDNPVDAIVT 
LATITHDAGA NLLVSRMIRT RYPTALDIER NLRASTDLCI TVERLMIELQ TIPIALRNLN
ARRGFWLRST IERELDAYPW SFVWIRDLLR ELNGQDRIEA LRRLRSRNGR YTSGDLTLIE
KALSDAAAQT RAYAARLLGV MVDTPPPTLT HRLLDVALRD SDAETRFAAA RTIGLLRERI
ITGDALAYIE SHLVHDDSFF RSSAALVLGQ LGEHAGKPTV VQRLCQLLCD RDAYAREAAA
RALGRIGPPA ALPQVIEALE QTTQDDDVQV HEAATDSLAL LRQYAELSVQ AAA