Gene Csal_3023 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCsal_3023 
Symbol 
ID4028989 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChromohalobacter salexigens DSM 3043 
KingdomBacteria 
Replicon accessionNC_007963 
Strand
Start bp3364702 
End bp3367527 
Gene Length2826 bp 
Protein Length941 aa 
Translation table11 
GC content67% 
IMG OID637968229 
Productpeptidase M16-like protein 
Protein accessionYP_575066 
Protein GI92115138 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1025] Secreted/periplasmic Zn-dependent peptidases, insulinase-like 
TIGRFAM ID[TIGR02110] coenzyme PQQ biosynthesis probable peptidase PqqF 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00252323 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCTTTCAT CCCGTTCCCG TGCCGGCCGC TGGCTGCTCG GCATGTGCCT TTTTCTCCTC 
GCCGTCTCTC AGACGGTGTA CGCCAGCGAC CCGGTGGCCA AGGTCGTCGA TCCCATCGCC
AGCCCCAACG ATTCGCGCGA CTACCGGGCG CTGACGCTCG ATAACGGCCT CGAGATCCTG
CTGGTCAGCG ACCCGGAAGC CGACGAGGCC GCCGCCGCCA TGAACGTGGA CGTCGGCAGC
AGCGACGACC CCGACGCCAC GCCCGGCCTG GCACACTTTC TCGAGCACAT GCTGTTTCTC
GGCACCGACC GTTATCCCGA GGCCGATGCC TATCAGAATT TCATTTCGGC GCACGGCGGC
GATCACAACG CCTTCACCGC CTCACGCGAT ACCAATTACT ACTTCGACAT CGAACCGACG
GCCCTGCCCG AGGCCCTGGA CCGTTTCAGT CGCTTCTTCG TCGCCCCGCG CTTCAACCCC
GAGTATGTCG AGCGCGAGCG CAACGCCGTG CACTCCGAAT ACCAGGCCCG CCTGCGCGAT
GACGGGCGCC GCATCAACGA AGCCACCGAT CGCGCCCTCA ACCCCGAACA CCCGGCGACG
AGGTTTGCGG TGGGCAGCCT GGAGACGCTC CAGGGCGGGG AGCGCTCACT GCGCGAGAAA
CTGATCGACT TCTACGAGTC CCACTACGGG GCCAATGTCA TGCACTTGAC GGTCATCGGC
CCCCAGTCCC TCGACACCCT CGAGTCCATG GTCCGCGATC GCTTTGCCGA GATTCCCGAT
CGAGGCCTGA CACGCACCCC GATCGAGACA CCCCTGGTCA CCGACGCCGA GCTTCCCGCC
CGCCTGGCGG TCAAGAGCCT GTCCCGGGAC CGGGAGGTTC GCTTTTTGTT CCCGATTCCC
GACCCGCAGC AGGACTACCG CACCAAGCCT GCCGAGTACC TGGCCAATCT ACTCGGCCAC
GAAGGCGAGG GCAGCCTGCT GGCGGCCCTG CGCCGCGAAG GGTGGGCCGA CGGCCTCTCG
GCGGGTACCA CGAACGGCGA CGGTCGGCAT GCCTTGTTTG CCGTCTCGAT CAGCCTCACG
CCGGAAGGCG CCAAGCATCT CTCGCGCATT CAGGCCAGCC TGTTCGACCA GATCGAGCGC
ATCCGCGAGC AAGGGCTCCA GGCCTGGCGT TACGATGAGC AGGCCCGCCT CAATGAGCAG
GCCTTTCGCT TCCAGCAACG TGGCGAGCCG ATCGAGCAGG CGAGCCAGCT GGCCATGCGC
CTGGCCCACG TGCCGCTCGA GGATGTGCAG TACGCTCCCT ACCGCATGGA CGGATTCGAC
GCCGCCCGCA TTCGCGACTA TCTGGCCGAC ATGACGCCCG CGCACCTGTT GCGCGTGTAC
AGCGGGCCGG ACGTGGAGGG CGAGACCACC TCGCCCTACT TCGACGCCCC CTATACGCTG
AGCCGGGTCG AAACCTGGCC GGAGGCCAGC GCCCTGGACG GCCTCGAGCT CCCCTCACGC
AATCCTTTCA TCGCCGAGGA CCTCGAGGTG CACGCCTTGA GCGGCGATCG ACCCCAGGCC
ATCGTCGACG CACCCAGCGT CGAACTCTGG CACCTGGCCA ACGACCGCTT CGGCACACCA
CGCGTGGAGT GGCGCTTCAG CCTGCAGTCC CCCGACACCT CCGCCAGCGC CCGCAATGCC
GCCCTGACGC GACTGCTCGC GGGCTGGATC ACCGACAGCC TCAATGCACG CTTCTACCCC
GCTCGCCTGG CCGGTCAATC CTTCGATGCC TATGCCCACG CACGGGGCAT CACGCTGACC
TTCTCCGGCT GGCGCGACCG TCAGTCAAGG TTGATGAACG ACGTCGTCGA ACGCCTGAAA
CGAGGCGACA TCAGCGAGGC GAGTTTCTCG CGCGTCAAGT ACCGGCTGTC ACAGCAGTGG
CGCAACGCCG CCCAGGCGCC TCTACACCAG CAGATGTACC GGTCGCTCGG CGAGGCGCTG
CTGCGTCCCC AATGGTCGAC ATCGGCGATG CTCGACGCCT TGTCATCCCT CGACGTCGAG
GATCTGCGCG ACTACCGTGC CACGTTCCTC GGCGACCTCT ACGTGCAGGC CATGGCGGTG
GGCAACCTGA GCGACGAACT GGCGCGTCGC GAGGGACTGC AGATCGCCAA CGCCCTCGCC
CCTCGTCTGC ATGCCGAGGA CATACCGCCC CTGGCGCCGC TGGCGATCCC CGAGACGCCG
CCGACCCTGC ACCCCCGCAG CACGCGCAAC GACGCCGCCG TACTGCGCTA TCTGCAAGGC
CCCGACCGCA GCCTGGAAAG CCAGGCTCGG CTGGCGGTCA TCGGCAAGCT GATCGAGGCA
CCGTTCTACA CCCGATTGCG CACCGAGGAG CAACTGGGCT ATATCGTCAC GGCCGGCTAC
TCGCCGATAC TGGATGCGCC AGGCCTGGCC ATGCTGGTGC AGTCCCCCGA CACCGGAAAA
CAGCGCATCG CCCAGCGCAT GGAGGCCTTC CTGGAGGACT TCGACGCGCG CATGGCCTCC
CTCGACGACA GTGCCCTCGC GCCCTATCGC GCGGCCGTCA GCAGCCGCTT GCGGGAGCGT
GACAATAGCC TGGGCGAACT GACCGACCGG CTCTGGCAGA CACTGGCCTT CGCCGAGCCG
GACTTCGCGC GCCGCGACAA ACTGGCCGAC ACGGTCGACG CCCTGGACGC CGAAGCGGTG
CGTCAGGCCT GGCAGCGTCT GCGTCGCTCG CCGCCGCTCA CGGTCAACTA CGACGCTCGG
ACCACTGCCA GCGACATCCC GTCCCTCGTC GACACCTTCC GTCCCCTGCC GGCAACGCGG
GACTGA
 
Protein sequence
MLSSRSRAGR WLLGMCLFLL AVSQTVYASD PVAKVVDPIA SPNDSRDYRA LTLDNGLEIL 
LVSDPEADEA AAAMNVDVGS SDDPDATPGL AHFLEHMLFL GTDRYPEADA YQNFISAHGG
DHNAFTASRD TNYYFDIEPT ALPEALDRFS RFFVAPRFNP EYVERERNAV HSEYQARLRD
DGRRINEATD RALNPEHPAT RFAVGSLETL QGGERSLREK LIDFYESHYG ANVMHLTVIG
PQSLDTLESM VRDRFAEIPD RGLTRTPIET PLVTDAELPA RLAVKSLSRD REVRFLFPIP
DPQQDYRTKP AEYLANLLGH EGEGSLLAAL RREGWADGLS AGTTNGDGRH ALFAVSISLT
PEGAKHLSRI QASLFDQIER IREQGLQAWR YDEQARLNEQ AFRFQQRGEP IEQASQLAMR
LAHVPLEDVQ YAPYRMDGFD AARIRDYLAD MTPAHLLRVY SGPDVEGETT SPYFDAPYTL
SRVETWPEAS ALDGLELPSR NPFIAEDLEV HALSGDRPQA IVDAPSVELW HLANDRFGTP
RVEWRFSLQS PDTSASARNA ALTRLLAGWI TDSLNARFYP ARLAGQSFDA YAHARGITLT
FSGWRDRQSR LMNDVVERLK RGDISEASFS RVKYRLSQQW RNAAQAPLHQ QMYRSLGEAL
LRPQWSTSAM LDALSSLDVE DLRDYRATFL GDLYVQAMAV GNLSDELARR EGLQIANALA
PRLHAEDIPP LAPLAIPETP PTLHPRSTRN DAAVLRYLQG PDRSLESQAR LAVIGKLIEA
PFYTRLRTEE QLGYIVTAGY SPILDAPGLA MLVQSPDTGK QRIAQRMEAF LEDFDARMAS
LDDSALAPYR AAVSSRLRER DNSLGELTDR LWQTLAFAEP DFARRDKLAD TVDALDAEAV
RQAWQRLRRS PPLTVNYDAR TTASDIPSLV DTFRPLPATR D