Gene Csal_3081 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCsal_3081 
Symbol 
ID4028887 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChromohalobacter salexigens DSM 3043 
KingdomBacteria 
Replicon accessionNC_007963 
Strand
Start bp3431892 
End bp3433859 
Gene Length1968 bp 
Protein Length655 aa 
Translation table11 
GC content63% 
IMG OID637968295 
ProductATP-dependent metalloprotease FtsH 
Protein accessionYP_575124 
Protein GI92115196 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0465] ATP-dependent Zn proteases 
TIGRFAM ID[TIGR01241] ATP-dependent metalloprotease FtsH 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.312439 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGAACGACA TGGCGAAGAA CCTGATTCTC TGGTTGGTCA TCGCGGCGGT ATTGCTGACG 
GTGTTCAACA ACTTCAGCGT CGACAGCTCA CCTCAGGCGA TGAGCTACTC GCAGTTCGTC
CAGCAGGTGC AGAACGACCA GATAGAAAGC GTGACCATCG AAGGCTACAC CATCAACGGT
GAGCGTGAAG ACGGTACGCA GTTCCAGACG ATCCGTCCGG CGGCCGAAGA CCCCAAGCTG
ATGGACGACC TGCTGGCGCA TGACGTCAGC GTGATCGGCA AGAAGCCCGA GGAGCAAAGT
CTGTGGACGC GCCTGCTCGT GGCCAGCTTC CCGATCCTGA TCATCCTCGC GATCTTCATC
TTCTTCATGC GTCAGATGCA AGGTGGCGGC GGTGGCAAGG GCGGCCCGAT GAGCTTCGGC
AAGTCCAAGG CCAAGCTGCT GACGCAGGAT CAGATCAAGA CGACCTTCGC CGATGTCGCC
GGCTGCGACG AGGCCAAGGA AGAAGTCGAG GAACTCGTCG ACTTCCTCAA GGACCCCAGC
AAGTTTCAGC GGCTGGGCGG GCAGATACCG CGCGGCGTGT TGATGGTGGG GCCTCCGGGG
ACGGGCAAGA CCCTGCTGGC CAAGGCCATC TCCGGTGAGG CCAAGGTCCC GTTCTTTACC
ATTTCCGGCT CGGACTTCGT GGAAATGTTC GTCGGCGTGG GGGCCTCGCG TGTTCGCGAC
ATGTTCGAAC AGGCCAAGAA GCAGGCCCCG TGCATCATCT TCATCGATGA GATCGATGCC
GTGGGTCGTC ATCGTGGCTC CGGCATGGGG GGCGGTCACG ACGAGCGCGA GCAGACGCTC
AACCAGTTGC TGGTGGAGAT GGACGGCTTC GAAGCCAACG ACGGCATCAT CGTGATCGCG
GCCACCAACC GCCCCGACGT GCTCGACCCG GCACTGCTGC GTCCCGGCCG CTTCGACCGT
CAGGTGACCG TGGGGCTGCC CGACATTCGC GGACGTGAGC ACATTCTTGG CGTGCACCTG
CGCAAGGTAC CGCTGGCCGA CGATGTGCAG CCGAGCTTCA TCGCTCGCGG CACGCCTGGC
TTCTCGGGCG CCGATCTGGC CAACCTGGTC AACGAGGCCG CCTTGTTCGC CGCGCGTCGC
AACAAGCGCC TGGTGGGCAT GGACGAGCTC GAGATGGCCA AGGACAAGAT CCTGATGGGC
TCCGAGAAGC GCTCGATGGT CATGTCCGAG AAAGAGAAGA GCAACACCGC GTACCACGAG
TCGGGCCATG CCATCATCGG GCTGCTGATG CCCGAGCACG ACCCCGTCTA CAAGGTGACG
ATCATCCCGC GCGGGCGTGC CCTGGGTGTC ACCATGTTCC TGCCCGAGGA GGATCGCTAC
AGCCTCTCTC GGCAGCAGAT CATCAGTCAG ATCTGCTCGT TGTTCGGCGG CCGCCTCGCG
GAGGAAATGA CCCTGGGGCC GAATGGCGTC ACCACCGGGG CGTCCAACGA CATCAAGCGC
GCCACCGAAC TGGCCCACAA CATGGTCGCC AAGTGGGGGC TCTCGGAAGA GATGGGCCCG
CTGATGTACG ACGAGGACGA GTCGCATCAA TTCCTGGGCG GCGGCGGCCA GGGCGGCGGC
AAGCTGAAGT CGGGCGAGAC CACGACGCGT CTCGACAAGG AAGTGCGCAG GATCATCGAC
GAGTGCTATA ACAAGGCGCG CCAGATCCTG GAAGACAATC GTGACAAGCT GGACCTGATG
GCTGAATCGT TGATGCAGTA CGAAACCATC GATGCCAACC AGATCCGCGA CATCATGGAA
GGTCGCAAGC CGCGTCCGCC GGAGGACTGG GACGACAAGG GGCCGACGAC CGGCTCGGGG
TCGACCGCAA ATCCCTCTGC CGACGATGAA GCCGAAGGGC AGGGCGACGA AGAAGGCGAC
ACCAGTCGTC GTCCCTCGGA TCCCCTGGGT GGGCCGGCGG GGCACTGA
 
Protein sequence
MNDMAKNLIL WLVIAAVLLT VFNNFSVDSS PQAMSYSQFV QQVQNDQIES VTIEGYTING 
EREDGTQFQT IRPAAEDPKL MDDLLAHDVS VIGKKPEEQS LWTRLLVASF PILIILAIFI
FFMRQMQGGG GGKGGPMSFG KSKAKLLTQD QIKTTFADVA GCDEAKEEVE ELVDFLKDPS
KFQRLGGQIP RGVLMVGPPG TGKTLLAKAI SGEAKVPFFT ISGSDFVEMF VGVGASRVRD
MFEQAKKQAP CIIFIDEIDA VGRHRGSGMG GGHDEREQTL NQLLVEMDGF EANDGIIVIA
ATNRPDVLDP ALLRPGRFDR QVTVGLPDIR GREHILGVHL RKVPLADDVQ PSFIARGTPG
FSGADLANLV NEAALFAARR NKRLVGMDEL EMAKDKILMG SEKRSMVMSE KEKSNTAYHE
SGHAIIGLLM PEHDPVYKVT IIPRGRALGV TMFLPEEDRY SLSRQQIISQ ICSLFGGRLA
EEMTLGPNGV TTGASNDIKR ATELAHNMVA KWGLSEEMGP LMYDEDESHQ FLGGGGQGGG
KLKSGETTTR LDKEVRRIID ECYNKARQIL EDNRDKLDLM AESLMQYETI DANQIRDIME
GRKPRPPEDW DDKGPTTGSG STANPSADDE AEGQGDEEGD TSRRPSDPLG GPAGH