Gene Csal_2044 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCsal_2044 
Symbol 
ID4025940 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChromohalobacter salexigens DSM 3043 
KingdomBacteria 
Replicon accessionNC_007963 
Strand
Start bp2305233 
End bp2307641 
Gene Length2409 bp 
Protein Length802 aa 
Translation table11 
GC content62% 
IMG OID637967239 
ProductLon-A peptidase 
Protein accessionYP_574094 
Protein GI92114166 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0466] ATP-dependent Lon protease, bacterial type 
TIGRFAM ID[TIGR00763] ATP-dependent protease La 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAGCAGA ACGCTGAACA GACCTTAAGT CTTCCCCTTT TGCCGCTGCG GGACGTGGTC 
GTCTATCCGC AGATGGTGAT TCCGCTGTTC GTGGGGCGCG AGAAATCGAT TCGCGCGCTC
GAGACGGCCA TGGAAAACGA CAAGCGTATC TTGCTCGTGG CCCAGCGTGA GGCGAGTCAG
GACGATCCGG AATTCGGCGA CCTTTTCGAT GTCGGGACCG TGGCCGAGAT CATGCAACTT
CTCAAGTTGC CGGACGGCAC CGTCAAGGTA TTGATCGAAG GTGACTATCG CGCCGACATT
CGTGATGTCC ATGAGGACGC GTCCGGTTAT GTATCGGCCG AGGCCACGCG TCGCGAGAGC
GAGGCATTGA CCGAGCGCGA GCAGGAATCG CTCGTGCGGG TCCTGCTGAA TCAGTTCGAG
CAATATGTAA AGCTGTCCAA GAAGGTGCCC AACGAAGTCC TCAATTCGTT GTCCGGCATC
GAGGACCCGA GCCGCCTGGT CGATACGATC TGCGCGCACT TGTCGCTCAA GATCGGCGAC
AAGCAGGAGC TGCTCGAGAT GGATCGCGTA CGCGATCGCA TCGAGCACCT GATGGCCTTG
ATCGAGTCCG AGATCGATCT GTTGCAGGTG GAAAAGCGCA TCCGTTCGCG GGTCAAGGAC
CAGATGGAGA AGTCCCAGCG CGAGTACTAT CTCAACGAGC AGATGAAGGC CATCCAGAAG
GAGATGGGCG AGCTCGAGAA CGCGCCCAAC GAGGCCGACA AGTACGAGCA GCTGATCGAG
TCGTCCGGCA TGCCCAAGGA AGCGGCCGAC AAGGCGCGCC AGGAGCTGGG CAAGCTCAAG
ATGATGGCGC CGACATCCGC CGAGGCGACC GTGGTGCGTT CGTACCTCGA TTGGCTGGTG
GCGGTACCCT GGAAGAAGCG CTCGCGCGTG CGGCACGACC TGGTGCATGC CCAGAAGGTG
CTCGACGAGG ACCATTACGG GCTGGAAGAG GTCAAGGAGC GCATCCTCGA GTACCTGGCC
GTGCAGAAGC GCGTCAAGAA GCTCAAGGGG CCGGTGCTGT GCCTGGTGGG CCCGCCCGGG
GTCGGCAAGA CCTCGCTCGG TCAGTCGATC GCACGGGCGA CCAACCGGCG TTACGTGCGC
CTGGCGCTGG GGGGTATCCG CGACGAGTCC GAGATTCGCG GGCATCGCCG TACTTACATC
GGCTCGCTGC CCGGCAAGCT GATTCAGCGC ATGAGCAAGG CCGAGGTACG CAACCCGCTC
TTCTTGCTCG ACGAGGTCGA CAAGATCGGC ATGGATCATC GCGGCGACCC CTCGTCGGCG
TTGCTCGAAG TACTCGACCC CGAGCAGAAC AATACCTTCA GCGACCATTA CCTGGAGCTG
GATTACGACC TCTCCGACGT CATGTTCATC TGCACCGCCA ACTCGATGAA CATCCCGGAG
CCGCTGCTCG ACCGCATGGA GATCATTCGC CTGCCCGGTT ACACGGAAGA CGAAAAGCTC
GCCATCGCCA AGCGTTACCT GGTGCCCAAG CAGCTCAAGG CCAACGGGCT CAAGGAGGAC
GAGCTGAGCT TCTCTGACGA ATCTCTGCTC GAGCTGGTGC GTTATTACAC CCGCGAGGCG
GGCGTTCGTG AGCTGGAGCG CCAGATCGCC AAGGTGAGCC GCAAGGTGCT GCGCGAACGT
GTCGAGGCCG AGAAGCAGCA AGGTGCGAAA GGGCCGCAAC TGCTGGCGGC TGCGGACATC
GAGACCTATG CCGGCGTGCG TCGCTACAGC TATGGCCTGG CCGATAAGGA AGACCAGGTC
GGGCGCGTCA CGGGGCTGGC CTGGACATCG GTGGGGGGCG AGCTGCTCAA CATCGAGTCG
GTGGTCTCCC CGGGCAAGGG GCGCCTGAAC AAGACCGGTT CGCTCGGCGA TGTGATGAAA
GAGTCGGTGA GTGCGGCGCT TACCGTGGTG CGGGCACGCG CCGAAGCGCT GGGTATCGAT
CCCGAGCGCT TCGAGAAAGA GGACCTCCAC ATTCACGTCC CCGAGGGCGC CACGCCCAAG
GATGGGCCGA GTGCGGGCAT CGCCATGGTG ACGGCGATGG TCTCGGCCTA CACCGGGCGT
CCGGTGCACT GTGACGTGGC CATGACCGGT GAAGTCAACC TGCGTGGCGA GGTCATGCCG
ATCGGCGGGC TCAAGGAGAA ATTGCTGGCG GCGCGACGCG GTGGTATAAA GACGGTGCTC
ATACCGGAGG AAAATCGCCG GGATCTCAAG GAAGTGCCGG ACAATATCAA GGATGCCCTG
GATATCCGGC CCGTCAAATG GATTGATGAA GTTCTCGACG CGGCGCTGGT GGAAAAAGCA
GAGGTGGAAA GCGGTGAATC CCTAGCGGAA ACCAGCCAAC CGACGCGTTC CAATATCAGC
ACGCATTGA
 
Protein sequence
MEQNAEQTLS LPLLPLRDVV VYPQMVIPLF VGREKSIRAL ETAMENDKRI LLVAQREASQ 
DDPEFGDLFD VGTVAEIMQL LKLPDGTVKV LIEGDYRADI RDVHEDASGY VSAEATRRES
EALTEREQES LVRVLLNQFE QYVKLSKKVP NEVLNSLSGI EDPSRLVDTI CAHLSLKIGD
KQELLEMDRV RDRIEHLMAL IESEIDLLQV EKRIRSRVKD QMEKSQREYY LNEQMKAIQK
EMGELENAPN EADKYEQLIE SSGMPKEAAD KARQELGKLK MMAPTSAEAT VVRSYLDWLV
AVPWKKRSRV RHDLVHAQKV LDEDHYGLEE VKERILEYLA VQKRVKKLKG PVLCLVGPPG
VGKTSLGQSI ARATNRRYVR LALGGIRDES EIRGHRRTYI GSLPGKLIQR MSKAEVRNPL
FLLDEVDKIG MDHRGDPSSA LLEVLDPEQN NTFSDHYLEL DYDLSDVMFI CTANSMNIPE
PLLDRMEIIR LPGYTEDEKL AIAKRYLVPK QLKANGLKED ELSFSDESLL ELVRYYTREA
GVRELERQIA KVSRKVLRER VEAEKQQGAK GPQLLAAADI ETYAGVRRYS YGLADKEDQV
GRVTGLAWTS VGGELLNIES VVSPGKGRLN KTGSLGDVMK ESVSAALTVV RARAEALGID
PERFEKEDLH IHVPEGATPK DGPSAGIAMV TAMVSAYTGR PVHCDVAMTG EVNLRGEVMP
IGGLKEKLLA ARRGGIKTVL IPEENRRDLK EVPDNIKDAL DIRPVKWIDE VLDAALVEKA
EVESGESLAE TSQPTRSNIS TH