Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Csal_3167 |
Symbol | |
ID | 4028634 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chromohalobacter salexigens DSM 3043 |
Kingdom | Bacteria |
Replicon accession | NC_007963 |
Strand | + |
Start bp | 3531484 |
End bp | 3532752 |
Gene Length | 1269 bp |
Protein Length | 422 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 637968381 |
Product | peptidase M24 |
Protein accession | YP_575210 |
Protein GI | 92115282 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0006] Xaa-Pro aminopeptidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGCGATA TCGTTTCACT CGCCGGGTCC GTCCCCACGG GCGGTCCCTA CACCGACGGC AGCTTTCGTC ACGCCACGCT GGATGCCGAG CTGCTCTCGC GCGTCGAGCG CTATCGGCTT CAGCGGCTGC GTCAGCAGAT GCGTCACCAC GAGCTGGACG CGGTGATCCT GTTCGACCCG ATCAACATTC GCTATGCCTG CGGCGTGCGC AACATGCAGG TGTATTCGCA GCGCAACCCG GCGCGCTACC TTTATGTTGC GGCGGACGGC CCGGTGGTGC TGTTTGAGTT CAGCGCCTGC CTGCATCTGG CAGCCAACGC CGAACTGCTC GACGAGGTCC GCCCCGCCCG GGCGGTGATG CCGCAATACA GCGGCCCGCG TTGCGCGACG CACACTCAGG CCTTCGTCGA CGACATTCGC GGTCTGTTCG CGCGCTCGCA TGCCCAAGGG CAGCGCCTGG GGATCGAATC CGCGCCCACC GGCGCCATCG AGGCACTGGG CCAGGCCGGC TTCGAACTGA CCGACGCCGC CACCGTGGTC GAGGGCGCCA AGTCGATCAA GAGTGTCGAC GAGCTGGCGC TCATCCATCG TTCCGTGACG CTGACCGAAG CCGCGATGAC GCACATGGAG GCCGCCCTGC GCCCCGGCAT GAGCGAAAAC GAGCTGTGGT CGATCTTCAA CCAGCATGTG CTGGCGACCG GTGGCGAATA CGCCGAGACC CGCCTGCTCA GTTCCGGCGC GCGCACCAAC CCGTGGTTCC AGGAATGTTC GGACAAGATC ATCGCCCCGG GTGAGCTGGT CGCCTTCGAT ACCGACATCG TGGGATGCTT CGGCTACTAC ACCGACTTCT CGCGGACCTT CCACGTGCCC GGCGCAGGTG CGCCCAATGC CGAGCAGAAA GCGCTGTATC GCATGGCCGC CGACCAGCTC GAGCGCAACA TCGCGCTGCT GCGCCCCGGC ATGAGTTTCC GCGACTACTC GCGCCAGGCA TGGCCGATCC CCGACGGCTA TCGCGAGAAC CGCTACCTGG ATATCGTCCA TGGCTGCGGC ATGACCGGCG AATGGCCGTT GATCGCCCAC GACATGGACT GGGACGACGT CGGCTACGAT GGTGACATCG AGCCCGGCAT GACGCTGTGC GTCGAGGCCT ACATCGGCCA CCGCGACGGC ACCGAGGGCG TCAAGCTGGA AGAGCAGGTG CTGATCACCG AAGATGGTAT CGATCGCCTC TCGACCTACG GCTATTCCGA GACGCTGATG GCAGAATGA
|
Protein sequence | MGDIVSLAGS VPTGGPYTDG SFRHATLDAE LLSRVERYRL QRLRQQMRHH ELDAVILFDP INIRYACGVR NMQVYSQRNP ARYLYVAADG PVVLFEFSAC LHLAANAELL DEVRPARAVM PQYSGPRCAT HTQAFVDDIR GLFARSHAQG QRLGIESAPT GAIEALGQAG FELTDAATVV EGAKSIKSVD ELALIHRSVT LTEAAMTHME AALRPGMSEN ELWSIFNQHV LATGGEYAET RLLSSGARTN PWFQECSDKI IAPGELVAFD TDIVGCFGYY TDFSRTFHVP GAGAPNAEQK ALYRMAADQL ERNIALLRPG MSFRDYSRQA WPIPDGYREN RYLDIVHGCG MTGEWPLIAH DMDWDDVGYD GDIEPGMTLC VEAYIGHRDG TEGVKLEEQV LITEDGIDRL STYGYSETLM AE
|
| |