Gene RoseRS_3394 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_3394 
Symbol 
ID5210371 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp4263285 
End bp4264859 
Gene Length1575 bp 
Protein Length524 aa 
Translation table11 
GC content58% 
IMG OID640596990 
ProductHEAT repeat-containing PBS lyase 
Protein accessionYP_001277703 
Protein GI148657498 
COG category[C] Energy production and conversion 
COG ID[COG1413] FOG: HEAT repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.232744 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.348803 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAGTC AACGAGATGT GCAAGCGCTA ATCGAAGTAC TGGGACATGC AAAAGATGAG 
CAAGTGCGCC AGGCTGCGCT TGAAGCGCTG ATCGAGATTG CGCTCTCCCT GGCTGCCGAA
AAGGAAGCCA GAGAAATCGA AGATTTTTTT CAGCGATTGC TTCAGGCAGG CACGCCAGGC
GAGGCATCCC CCCAAACGCC GCCTGAGGAC GAAAACGCGG CGCGATACCA GGCCGCCATC
CAGGCGCTGG GAGAAATCGG GGCAACTGCC ATTGCGCAAC TCCTCAGCGC ACTGAAGAAT
CCACAAAAGC GTCGTTACGC CGCCCAAATC CTGGAGCAAA TCGGCTGGCA GCCTGGGCTG
GATGAAAACG GCGCTCTGTA TTGGATCGCA AAAGGCGAAT GGGATAGATG CGCCGCGCTC
GGTGGGATTG CCATAGCGCC GCTGTTACTC GTCCTTCAGG AACAGGACAA CGAAACGCGC
CGGGCTGCTG CCCATGTCTT GGGACAAATC GGCGATGCCC GCGCCGTAGA GCCGCTCCTT
GATTTACTCG TGGATCAGGA CCCAAAGGTG CGCCGGGCAG CCATCGAAGC CATTGGCCGG
ATTGGCGATC CCCACACCGT TGAGGCGCTT GAACTGGCGC TCCAGGACAG GGAGGGGAGC
GTACGCCTGG CGGCGACGAG AGCCTTGGGG CAAATTGGCA GCCCGCGCGC GGTGGAAGCG
CTGATCGCTA CCTTGCAGGG CAGGTTCGCG CATAGACTTC TTCTGGAAGA ACTGGACAAG
ATGAAATCGG AAGATGCTCA CCAGATTGCG GCGCTTCTCA GCTCTTGGGA TGAAAACAGC
GCGCCTGTGC ATCTTCTTCT CCAACAAGCG CTGAAAGCAG CCAGCCATCC GTTTGTCGTC
TATTTGCTTA CCACAATCCT GACCGAATGG AAGAACGAGC GTGCTTCTGC TATAGAGGCG
CTGGGGCGAA TTGGCGATTC CCGCGCCGTG GAACCGCTCA TCGCTGCGCT CAAGGACGAG
GACGTGAATG TGCGCTGGCC CGCCGCCCGT GCGTTGGGAG AAATCAAAGA CACCCGCGCC
ATCAAGCCGC TCATCGCCGC GCTCAAGGAT TGGCATAGCA ATGTGCGCAA AGCCGCCGCC
AAAGCGCTGG TCAAAATCGG CACACCAGCC GTGGAGCTGC TCATCGCCGC GCTCAGGGAC
GAGGACGAGA GGGTGCGCCA GGCCGCCGCC GAAGCGCTGG ATCATCTGGG CTGGAAACCT
GCCCGAGACG AAAATGCAGC GTGGTATTGG GCAACCAAGG GTAAGTGGAA CGCGTGCGTG
GATATCGGCG CACCTGCTGT GCAGCCACTT ATTACAAGCC TGCGCACAAA TGACCCCTAC
ATGCGTAGAG ACGTGGCGCA GGCATTGGTA AGACTGTATC AAAAGGCCGG TATTTCTCAA
GAGGACCGAG AGCAAATTTT GGCAGTTCGT GACCGCTTGA TACGTATTCA TGATGACCAA
CCCCATATAG ATCGATCTAG GGATTGCTCT AGACCACATA CTGATACAGA GCTGGGCGTG
GAATTCCCCT TATGA
 
Protein sequence
MKSQRDVQAL IEVLGHAKDE QVRQAALEAL IEIALSLAAE KEAREIEDFF QRLLQAGTPG 
EASPQTPPED ENAARYQAAI QALGEIGATA IAQLLSALKN PQKRRYAAQI LEQIGWQPGL
DENGALYWIA KGEWDRCAAL GGIAIAPLLL VLQEQDNETR RAAAHVLGQI GDARAVEPLL
DLLVDQDPKV RRAAIEAIGR IGDPHTVEAL ELALQDREGS VRLAATRALG QIGSPRAVEA
LIATLQGRFA HRLLLEELDK MKSEDAHQIA ALLSSWDENS APVHLLLQQA LKAASHPFVV
YLLTTILTEW KNERASAIEA LGRIGDSRAV EPLIAALKDE DVNVRWPAAR ALGEIKDTRA
IKPLIAALKD WHSNVRKAAA KALVKIGTPA VELLIAALRD EDERVRQAAA EALDHLGWKP
ARDENAAWYW ATKGKWNACV DIGAPAVQPL ITSLRTNDPY MRRDVAQALV RLYQKAGISQ
EDREQILAVR DRLIRIHDDQ PHIDRSRDCS RPHTDTELGV EFPL