Gene Csal_2449 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCsal_2449 
Symbol 
ID4026969 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChromohalobacter salexigens DSM 3043 
KingdomBacteria 
Replicon accessionNC_007963 
Strand
Start bp2754022 
End bp2755332 
Gene Length1311 bp 
Protein Length436 aa 
Translation table11 
GC content63% 
IMG OID637967656 
Productisocitrate lyase 
Protein accessionYP_574495 
Protein GI92114567 
COG category[C] Energy production and conversion 
COG ID[COG2224] Isocitrate lyase 
TIGRFAM ID[TIGR01346] isocitrate lyase 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.155678 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCCAGA CACGCGAACA GCAAATCGCT GCGTTGGAAA AGGATTGGAA CGAGAATCCG 
CGCTGGAAGG ACGTCAAGCG TCCGTACAGT GCCGAAGATG TGGTTCGACT TCGCGGCAGC
GTCAACGAAG CGCACACGCT GGCCAGCCGC GGCGCCGAGA AGCTCTGGCG ACTGGTCAAT
GGCGAGGCCC GCAAGGGCTA CGTCAACTGC CTCGGCGCGC TGACCGGCGG CCAGGCCATG
CAGCAGGTCA AGGCAGGTAT CGAGGCGATC TATCTCTCGG GCTGGCAGGT CGCCGCCGAC
AACAACAGCT ACCTGTCGAT GTATCCCGAC CAGTCGCTCT ATCCGGTGGA CTCGGTGCCC
AAGGTCGTCG AACGCATCAA CAACAGTTTC CGCCGCGCGG ATCAGATCCA GTGGCAGAAG
GGCGCCAACC CCGGCGATGC CGACTTCGTC GATTACTTCG CCCCCATCGT CGCCGACGCC
GAGGCGGGAT TCGGCGGCGT GCTCAACGCC TATGAACTGA TGACGGCAAT GATCCGTGCC
GGTGCCAGTG GCGTGCATTT CGAGGATCAG CTCGCCGCGG TCAAGAAGTG CGGCCACATG
GGCGGCAAGG TGCTGGTGCC CACGCAGGAG GCCGTGCAGA AGCTGGTCGC CGCCCGTCTG
GCCGCCGACG TCGCGGGCAC GCCGACGCTG GTCATCGCGC GCACCGACGC CAATGCCGCC
AACCTGATCA CCGCCGACGT GGATGATTAC GACAAGCCCT TCATCACCGG GGAACGCACC
GCCGAAGGCT TCTATCGGGT CAATGCCGGC CTCGATCAGG CCATCTCGCG AGGCCTGGCC
TACGCGCCCT TCGCCGACAT CATCTGGTGC GAAACCGCCA AGCCGGATCT CGACGAAGCC
AGGCGCTTCG CCGAGGCGAT CCATCGCGAA TATCCGGGGC AACTGCTCGC CTACAACTGC
TCGCCGTCCT TCAACTGGAA GAAGAACCTC GACGATGCCG AGATCGCCGG GTTCCAGCAG
GCCCTGGCCG ACATGGGCTA CACCTACCAG TTCATCACCT TAGCGGGCAT TCACAACATG
TGGTACAACA TGTTCGATCT CGCCCACAGC TACGCTCAGG GCGAAGGCAT GAAGCACTAC
GTCGAGAAGG TCCAGCAGCC GGAATTCGAG GCCGCCGAAC GCGGCTACAC CTTCGTCGCT
CACCAGCAGG AAGTGGGCAC CGGCTACTTC GACGACATGA CCAACGTCAT CCAGGGCGGA
GTGTCGTCGG TGACCGCCCT CAAGGGCTCC ACCGAGGAAG CGCAGTTCTG A
 
Protein sequence
MSQTREQQIA ALEKDWNENP RWKDVKRPYS AEDVVRLRGS VNEAHTLASR GAEKLWRLVN 
GEARKGYVNC LGALTGGQAM QQVKAGIEAI YLSGWQVAAD NNSYLSMYPD QSLYPVDSVP
KVVERINNSF RRADQIQWQK GANPGDADFV DYFAPIVADA EAGFGGVLNA YELMTAMIRA
GASGVHFEDQ LAAVKKCGHM GGKVLVPTQE AVQKLVAARL AADVAGTPTL VIARTDANAA
NLITADVDDY DKPFITGERT AEGFYRVNAG LDQAISRGLA YAPFADIIWC ETAKPDLDEA
RRFAEAIHRE YPGQLLAYNC SPSFNWKKNL DDAEIAGFQQ ALADMGYTYQ FITLAGIHNM
WYNMFDLAHS YAQGEGMKHY VEKVQQPEFE AAERGYTFVA HQQEVGTGYF DDMTNVIQGG
VSSVTALKGS TEEAQF