Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Csal_3023 |
Symbol | |
ID | 4028989 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chromohalobacter salexigens DSM 3043 |
Kingdom | Bacteria |
Replicon accession | NC_007963 |
Strand | + |
Start bp | 3364702 |
End bp | 3367527 |
Gene Length | 2826 bp |
Protein Length | 941 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 637968229 |
Product | peptidase M16-like protein |
Protein accession | YP_575066 |
Protein GI | 92115138 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG1025] Secreted/periplasmic Zn-dependent peptidases, insulinase-like |
TIGRFAM ID | [TIGR02110] coenzyme PQQ biosynthesis probable peptidase PqqF |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.00252323 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGCTTTCAT CCCGTTCCCG TGCCGGCCGC TGGCTGCTCG GCATGTGCCT TTTTCTCCTC GCCGTCTCTC AGACGGTGTA CGCCAGCGAC CCGGTGGCCA AGGTCGTCGA TCCCATCGCC AGCCCCAACG ATTCGCGCGA CTACCGGGCG CTGACGCTCG ATAACGGCCT CGAGATCCTG CTGGTCAGCG ACCCGGAAGC CGACGAGGCC GCCGCCGCCA TGAACGTGGA CGTCGGCAGC AGCGACGACC CCGACGCCAC GCCCGGCCTG GCACACTTTC TCGAGCACAT GCTGTTTCTC GGCACCGACC GTTATCCCGA GGCCGATGCC TATCAGAATT TCATTTCGGC GCACGGCGGC GATCACAACG CCTTCACCGC CTCACGCGAT ACCAATTACT ACTTCGACAT CGAACCGACG GCCCTGCCCG AGGCCCTGGA CCGTTTCAGT CGCTTCTTCG TCGCCCCGCG CTTCAACCCC GAGTATGTCG AGCGCGAGCG CAACGCCGTG CACTCCGAAT ACCAGGCCCG CCTGCGCGAT GACGGGCGCC GCATCAACGA AGCCACCGAT CGCGCCCTCA ACCCCGAACA CCCGGCGACG AGGTTTGCGG TGGGCAGCCT GGAGACGCTC CAGGGCGGGG AGCGCTCACT GCGCGAGAAA CTGATCGACT TCTACGAGTC CCACTACGGG GCCAATGTCA TGCACTTGAC GGTCATCGGC CCCCAGTCCC TCGACACCCT CGAGTCCATG GTCCGCGATC GCTTTGCCGA GATTCCCGAT CGAGGCCTGA CACGCACCCC GATCGAGACA CCCCTGGTCA CCGACGCCGA GCTTCCCGCC CGCCTGGCGG TCAAGAGCCT GTCCCGGGAC CGGGAGGTTC GCTTTTTGTT CCCGATTCCC GACCCGCAGC AGGACTACCG CACCAAGCCT GCCGAGTACC TGGCCAATCT ACTCGGCCAC GAAGGCGAGG GCAGCCTGCT GGCGGCCCTG CGCCGCGAAG GGTGGGCCGA CGGCCTCTCG GCGGGTACCA CGAACGGCGA CGGTCGGCAT GCCTTGTTTG CCGTCTCGAT CAGCCTCACG CCGGAAGGCG CCAAGCATCT CTCGCGCATT CAGGCCAGCC TGTTCGACCA GATCGAGCGC ATCCGCGAGC AAGGGCTCCA GGCCTGGCGT TACGATGAGC AGGCCCGCCT CAATGAGCAG GCCTTTCGCT TCCAGCAACG TGGCGAGCCG ATCGAGCAGG CGAGCCAGCT GGCCATGCGC CTGGCCCACG TGCCGCTCGA GGATGTGCAG TACGCTCCCT ACCGCATGGA CGGATTCGAC GCCGCCCGCA TTCGCGACTA TCTGGCCGAC ATGACGCCCG CGCACCTGTT GCGCGTGTAC AGCGGGCCGG ACGTGGAGGG CGAGACCACC TCGCCCTACT TCGACGCCCC CTATACGCTG AGCCGGGTCG AAACCTGGCC GGAGGCCAGC GCCCTGGACG GCCTCGAGCT CCCCTCACGC AATCCTTTCA TCGCCGAGGA CCTCGAGGTG CACGCCTTGA GCGGCGATCG ACCCCAGGCC ATCGTCGACG CACCCAGCGT CGAACTCTGG CACCTGGCCA ACGACCGCTT CGGCACACCA CGCGTGGAGT GGCGCTTCAG CCTGCAGTCC CCCGACACCT CCGCCAGCGC CCGCAATGCC GCCCTGACGC GACTGCTCGC GGGCTGGATC ACCGACAGCC TCAATGCACG CTTCTACCCC GCTCGCCTGG CCGGTCAATC CTTCGATGCC TATGCCCACG CACGGGGCAT CACGCTGACC TTCTCCGGCT GGCGCGACCG TCAGTCAAGG TTGATGAACG ACGTCGTCGA ACGCCTGAAA CGAGGCGACA TCAGCGAGGC GAGTTTCTCG CGCGTCAAGT ACCGGCTGTC ACAGCAGTGG CGCAACGCCG CCCAGGCGCC TCTACACCAG CAGATGTACC GGTCGCTCGG CGAGGCGCTG CTGCGTCCCC AATGGTCGAC ATCGGCGATG CTCGACGCCT TGTCATCCCT CGACGTCGAG GATCTGCGCG ACTACCGTGC CACGTTCCTC GGCGACCTCT ACGTGCAGGC CATGGCGGTG GGCAACCTGA GCGACGAACT GGCGCGTCGC GAGGGACTGC AGATCGCCAA CGCCCTCGCC CCTCGTCTGC ATGCCGAGGA CATACCGCCC CTGGCGCCGC TGGCGATCCC CGAGACGCCG CCGACCCTGC ACCCCCGCAG CACGCGCAAC GACGCCGCCG TACTGCGCTA TCTGCAAGGC CCCGACCGCA GCCTGGAAAG CCAGGCTCGG CTGGCGGTCA TCGGCAAGCT GATCGAGGCA CCGTTCTACA CCCGATTGCG CACCGAGGAG CAACTGGGCT ATATCGTCAC GGCCGGCTAC TCGCCGATAC TGGATGCGCC AGGCCTGGCC ATGCTGGTGC AGTCCCCCGA CACCGGAAAA CAGCGCATCG CCCAGCGCAT GGAGGCCTTC CTGGAGGACT TCGACGCGCG CATGGCCTCC CTCGACGACA GTGCCCTCGC GCCCTATCGC GCGGCCGTCA GCAGCCGCTT GCGGGAGCGT GACAATAGCC TGGGCGAACT GACCGACCGG CTCTGGCAGA CACTGGCCTT CGCCGAGCCG GACTTCGCGC GCCGCGACAA ACTGGCCGAC ACGGTCGACG CCCTGGACGC CGAAGCGGTG CGTCAGGCCT GGCAGCGTCT GCGTCGCTCG CCGCCGCTCA CGGTCAACTA CGACGCTCGG ACCACTGCCA GCGACATCCC GTCCCTCGTC GACACCTTCC GTCCCCTGCC GGCAACGCGG GACTGA
|
Protein sequence | MLSSRSRAGR WLLGMCLFLL AVSQTVYASD PVAKVVDPIA SPNDSRDYRA LTLDNGLEIL LVSDPEADEA AAAMNVDVGS SDDPDATPGL AHFLEHMLFL GTDRYPEADA YQNFISAHGG DHNAFTASRD TNYYFDIEPT ALPEALDRFS RFFVAPRFNP EYVERERNAV HSEYQARLRD DGRRINEATD RALNPEHPAT RFAVGSLETL QGGERSLREK LIDFYESHYG ANVMHLTVIG PQSLDTLESM VRDRFAEIPD RGLTRTPIET PLVTDAELPA RLAVKSLSRD REVRFLFPIP DPQQDYRTKP AEYLANLLGH EGEGSLLAAL RREGWADGLS AGTTNGDGRH ALFAVSISLT PEGAKHLSRI QASLFDQIER IREQGLQAWR YDEQARLNEQ AFRFQQRGEP IEQASQLAMR LAHVPLEDVQ YAPYRMDGFD AARIRDYLAD MTPAHLLRVY SGPDVEGETT SPYFDAPYTL SRVETWPEAS ALDGLELPSR NPFIAEDLEV HALSGDRPQA IVDAPSVELW HLANDRFGTP RVEWRFSLQS PDTSASARNA ALTRLLAGWI TDSLNARFYP ARLAGQSFDA YAHARGITLT FSGWRDRQSR LMNDVVERLK RGDISEASFS RVKYRLSQQW RNAAQAPLHQ QMYRSLGEAL LRPQWSTSAM LDALSSLDVE DLRDYRATFL GDLYVQAMAV GNLSDELARR EGLQIANALA PRLHAEDIPP LAPLAIPETP PTLHPRSTRN DAAVLRYLQG PDRSLESQAR LAVIGKLIEA PFYTRLRTEE QLGYIVTAGY SPILDAPGLA MLVQSPDTGK QRIAQRMEAF LEDFDARMAS LDDSALAPYR AAVSSRLRER DNSLGELTDR LWQTLAFAEP DFARRDKLAD TVDALDAEAV RQAWQRLRRS PPLTVNYDAR TTASDIPSLV DTFRPLPATR D
|
| |