Gene Rcas_4375 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_4375 
Symbol 
ID5541888 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp5628515 
End bp5629801 
Gene Length1287 bp 
Protein Length428 aa 
Translation table11 
GC content61% 
IMG OID640896481 
Productisocitrate lyase 
Protein accessionYP_001434417 
Protein GI156744288 
COG category[C] Energy production and conversion 
COG ID[COG2224] Isocitrate lyase 
TIGRFAM ID[TIGR01346] isocitrate lyase 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTGGTT TACATCGCGA CCGGGACATC GCCGAACTGG AGCAGCGCTG GGCGACCGAT 
CCGCGCTGGC AGGGCATCCG CCGTGACTAC AGCGCCGCCG ATGTGGTGCG TTTGCGCGGG
ACGCTGAAAG TCGAGTACAC CCTGGCAAGC GTGGGGGCGC GACGCCTGTG GGACCTGTTG
CAGAACGAGC CGTATGTTGC CACATTTGGG GCGCTGACCG GTGCACAGGC GACGCAAATG
GTTCGCGCCG GTCTTAAGGC GATCTATATG AGTGGCTGGC AGGTCGCTGC CGATGCGAAT
CTGGCGGGGC AGACCTATCC CGATCAGAGT CTCTATCCGT CGAACAGCGT CCCGGCGCTG
GTGCGGCGGA TCAATAATGC GCTCATGCGT GCCGATCAGA TTCACGCATC GGAAGGTAAG
AACGATATCT ACTGGTATGC CCCAATCGTT GCCGATGCCG AGGCGGGCTT TGGCGGTCCG
CTCCATGCCT TTGAATTGAC GAAGGCGATG ATCGAGGCCG GCGCAGCCGG CGTCCATTTC
GAGGATCAGT TGGCATCGGA AAAGAAGTGT GGTCATCTGG GCGGTAAGGT GCTCGTGCCA
ACGTCGCAGT TCGTCCGCAC GCTGACGGCG GCGCGCCTGG CAGCCGATGT GCTCGATGTG
CCAACGGTGT TGATTGCGCG CACCGATGCG CAGGCGGCGA CGCTCCTCCT CTCCGATGCC
GATGAATACG ATCGCCCCTT CATTACCGGT GAGCGCACGC CGGAAGGTTT CTATCGCGTG
AAGAGCGGCC TGGATGCGGC GATTGCCCGT GGTCTGGCGT ATGCGCCGTA TGCCGATCTG
ATCTGGTGCG AGACGGCGCA CCCCGATCTG GACGAGGCGC GCCGGTTCGC CGAAGGCATC
CACGCGAAGT TCCCCGGTAA GATGCTGGCG TACAACTGCT CGCCATCCTT CAACTGGAAG
CGCAACCTGG ACGACGCAAC GATCGCAGCG TTCCAGCGCG AACTGGCGGC AATGGGCTAT
AAGTTCCAGT TCATTACGCT GGCGGGCTGG CATATGATCA ACTACCACGC CTTCGAACTC
GCCAAAGCCT ATGCTGCTGA AGGGATGACC GCGTATGTGA AGTTGCAGCA GGCGGAATTC
GCCGCCGAGC AGGAAGGATA TACCGCAACC CGACATCAGC GCGAGGTGGG GACCGGCTAT
TTCGATGAGG TGTCCACGAT TATCTCTGGC GGTCTGTCTT CGACGACTGC ACTCGCCGGT
TCGACCGAAG AGGAGCAGTT CCATTGA
 
Protein sequence
MAGLHRDRDI AELEQRWATD PRWQGIRRDY SAADVVRLRG TLKVEYTLAS VGARRLWDLL 
QNEPYVATFG ALTGAQATQM VRAGLKAIYM SGWQVAADAN LAGQTYPDQS LYPSNSVPAL
VRRINNALMR ADQIHASEGK NDIYWYAPIV ADAEAGFGGP LHAFELTKAM IEAGAAGVHF
EDQLASEKKC GHLGGKVLVP TSQFVRTLTA ARLAADVLDV PTVLIARTDA QAATLLLSDA
DEYDRPFITG ERTPEGFYRV KSGLDAAIAR GLAYAPYADL IWCETAHPDL DEARRFAEGI
HAKFPGKMLA YNCSPSFNWK RNLDDATIAA FQRELAAMGY KFQFITLAGW HMINYHAFEL
AKAYAAEGMT AYVKLQQAEF AAEQEGYTAT RHQREVGTGY FDEVSTIISG GLSSTTALAG
STEEEQFH