Gene RoseRS_4164 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_4164 
Symbol 
ID5211148 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp5214021 
End bp5215307 
Gene Length1287 bp 
Protein Length428 aa 
Translation table11 
GC content62% 
IMG OID640597753 
Productisocitrate lyase 
Protein accessionYP_001278458 
Protein GI148658253 
COG category[C] Energy production and conversion 
COG ID[COG2224] Isocitrate lyase 
TIGRFAM ID[TIGR01346] isocitrate lyase 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.652432 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCGCTG TCAATCGTGA CCAGGAGATC GCTGAGCTGG AACGGCGCTG GGCGACCGAC 
CCGCGCTGGC AGGGCATCCG ACGCGACTAC AGCGCCGCCG ATGTGGTGCG GTTGCGCGGC
ACCCTCAAGA TCGAGTACAC GCTGGCGAAT GTGGGCGCCC GGCGGTTGTG GGATCTGTTG
CAGACCGAAC CGTATGTGGC GACGTTTGGC GCACTGACCG GCGCACAGGC AACCCAGATG
GTGCGCGCCG GGATCAAGGC GATCTATATG AGCGGCTGGC AGGTGGCTGC CGACGCGAAC
CTGGCGGGAC AGACCTACCC CGACCAGAGC CTCTATCCGT CGAACAGCGT CCCGGCGCTG
GTGCGTCGGA TCAATAACGC GCTCATGCGC GCCGACCAGA TCCACGCATC GGAAGGGCAT
AACGATATTT ACTGGTATGC GCCGATTGTC GCTGATGCCG AGGCAGGGTT TGGCGGTCCG
CTGCATGCGT TCGAGTTGAC GAAGGCGATG ATCGAGGCTG GCGCGTCGGG AGTTCACTTC
GAGGATCAAC TGGCGTCGGA GAAGAAGTGT GGGCACCTGG GTGGGAAGGT ACTGGTGCCG
ACATCGCAGT TTATCCGCAC GCTCACGGCG GCGCGCCTGG CGGCTGATGT GCTGGACGTG
CCGACGGTGC TGATCGCCCG CACCGATGCG CAGGCAGCGA CGCTGCTCCT GTCTGATGCC
GATGAGTACG ACCGCCCCTT TATCACCGGT GAACGCACAC CGGAGGGCTT TTACCGGGTG
AAGAGCGGGC TGGATGCGGC AATCGCCCGT GGTCTGGCGT ATGCGCCCTA TGCCGACCTG
ATCTGGTGTG AGACGGCGCA TCCCGATCTG GACGAGGCAC GGCGGTTCGC CGAGGGAATC
CACGCGAAAT TCCCCGGCAA GATGCTGGCG TATAACTGCT CGCCGTCGTT CAACTGGAAA
CGCTACCTGG ATGAAGCGAC CATCGCAACG TTCCAGCGCG AACTTGCAGC AATGGGGTAC
AAGTTCCAGT TCATCACGCT GGCTGGCTGG CATATGATCA ACTACTACGC CTTCGAACTT
GCAAAAGCGT ATGCTGCCGA GGGAATGACC GCGTATGTGC GATTGCAGCA GGCGGAGTTT
GCGGCTGAGC AGCATGGGTA TACCGCAACG CGCCATCAGC GCGAGGTGGG AACCGGCTAT
TTCGATGAGG TGTCCACCAT CATTTCTGGC GGTCTCTCCT CAACAACGGC GCTCGCCGGT
TCGACCGAAG AAGAGCAGTT TCATTGA
 
Protein sequence
MSAVNRDQEI AELERRWATD PRWQGIRRDY SAADVVRLRG TLKIEYTLAN VGARRLWDLL 
QTEPYVATFG ALTGAQATQM VRAGIKAIYM SGWQVAADAN LAGQTYPDQS LYPSNSVPAL
VRRINNALMR ADQIHASEGH NDIYWYAPIV ADAEAGFGGP LHAFELTKAM IEAGASGVHF
EDQLASEKKC GHLGGKVLVP TSQFIRTLTA ARLAADVLDV PTVLIARTDA QAATLLLSDA
DEYDRPFITG ERTPEGFYRV KSGLDAAIAR GLAYAPYADL IWCETAHPDL DEARRFAEGI
HAKFPGKMLA YNCSPSFNWK RYLDEATIAT FQRELAAMGY KFQFITLAGW HMINYYAFEL
AKAYAAEGMT AYVRLQQAEF AAEQHGYTAT RHQREVGTGY FDEVSTIISG GLSSTTALAG
STEEEQFH