Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Csal_1105 |
Symbol | |
ID | 4029043 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chromohalobacter salexigens DSM 3043 |
Kingdom | Bacteria |
Replicon accession | NC_007963 |
Strand | - |
Start bp | 1254174 |
End bp | 1255757 |
Gene Length | 1584 bp |
Protein Length | 527 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 637966282 |
Product | hypothetical protein |
Protein accession | YP_573160 |
Protein GI | 92113232 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01167] LPXTG-motif cell wall anchor domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGCCGCT GGTGGTTAAC GCTACTGCTC GTCAATGTCA CGCTCATCGC CACAGCACTC CCGCGTACGG GGGTCACGGC AATGCCCTGG CTGGCCGTGG ACGGCTTGAT GCTGATCGCC GCCCTGAGCC TGTGGCCAGG GCATCGCCAC GCCCGGCGCG GGCTGGGTTA CGCCGCCGGC GCCCTCATCG CCATCGTCGT GCTGGGCACC GTGGGCAACA CGGTCACCCA GGAAATACGC GGCCGCCTCT TCAACCCCTA TCTCGACCCC GCGCAGTTGC CGATCTTCCT GGAATTGCTG CGCGACAATC TGGGGCTGGC GACGAGCCTG GTGCTGGTAG CGGTGGCGAT CGTCGGCCTG CTCGCCATCG GCGGCGGGCT CGGTCATCTG CTCGCCACCC TGCCCCGCCC TCGCCGACCG TCCGCCCGTC TCACTCTCGT CGGGGTGCTG GTCGTGAGTG CGGGACTGGT GCTGCTCGGC CCCACCCGAC ACACGACCTG GCTCGGCACG CCCGGCGTCG GCCTGCTCGC CGACCAGGTC ACGCGCGCCT ACGCCACCCA CGCGGCTGTG CGCGATTTCG ATGCCCAGCT GACACGCAGC GAGACGCCGC TCGATGCCCC CGAGGGGCAC CCCCTGCCCG GGCTCGCGGG GCGCGACGTC ATCCTGGCGT TCGTCGAGTC CTACGGCATG GCCGCGCTGG AACGCGCGCC CTTCGCGGCA CCGGTGAACC AACGTCTCGA CACCATGGCC GAGACGTTCG AACGCGCCGG CCTCAGCGTC GCCAGCGGCC GGCTCGTCTC GCCCACGCTG GGCGGCCAGT CGCGCCTGGC CCACGCCAGC GTGCTCAGCG GGCTATGGAT CGACTCCACG CTACGCTACG ACCTCCTGCT GGAAAGCCCG CGCGCGACTC TCGTCGACGA CTTCGAACGC AGCGGGCACA CCAGCGTGGC GATGATGCCG GCCATCTATC GCGACTGGCC GGCGGGACGG CGCCTGGGCT ATGACACCAT CCACGACGAC CCCCGACTCG ACTATCGCGG GCCGCGCCTG GGCTGGGTGA CGATCCCCGA CCAATTCGTC TGGCATCGCC TGCGCCAGCT GCGCGACGCG CATGCCGAGC CGGTATTCGC CGAACTGGCG CTGATCAGCA GCCACGCCCC CTGGACACCG GTGATCGAAC CGCTGCCCTG GGACGCCATC GGCGACGGCG AGGGCTTCTC GCGCTGGGAA GACGCCGGAA GCGACTTTCT CGAAAAATGG GGAGACAACG CGGGCATGCG TCAGCGCTAC GGCCCCTCGC TGGCGTACTC GCTGGCGGTC GCCGCGGAGT ACGCCGTGCA TGACGTGGAC GCCGACAGCC TGCTGATCCT GCTCGGCGAT CATCAGGCAG CCCCGGGAAT GCTCGGTTTC ACGCCCGACC GCGAAGTGCC GGTGCACGTT GTCAGCGGCG ACCCCGACCT GATCGCCCCC TTTCTCGAGC ATGGCTTCGT GCGCGGCACA CAACCGCTCC GCGATACGCC CGCGCGCTCG ATGGCCGATC TACGCGACTT GCTGCACAGG CTGTACGGGG GCGACGCCTC CTGA
|
Protein sequence | MRRWWLTLLL VNVTLIATAL PRTGVTAMPW LAVDGLMLIA ALSLWPGHRH ARRGLGYAAG ALIAIVVLGT VGNTVTQEIR GRLFNPYLDP AQLPIFLELL RDNLGLATSL VLVAVAIVGL LAIGGGLGHL LATLPRPRRP SARLTLVGVL VVSAGLVLLG PTRHTTWLGT PGVGLLADQV TRAYATHAAV RDFDAQLTRS ETPLDAPEGH PLPGLAGRDV ILAFVESYGM AALERAPFAA PVNQRLDTMA ETFERAGLSV ASGRLVSPTL GGQSRLAHAS VLSGLWIDST LRYDLLLESP RATLVDDFER SGHTSVAMMP AIYRDWPAGR RLGYDTIHDD PRLDYRGPRL GWVTIPDQFV WHRLRQLRDA HAEPVFAELA LISSHAPWTP VIEPLPWDAI GDGEGFSRWE DAGSDFLEKW GDNAGMRQRY GPSLAYSLAV AAEYAVHDVD ADSLLILLGD HQAAPGMLGF TPDREVPVHV VSGDPDLIAP FLEHGFVRGT QPLRDTPARS MADLRDLLHR LYGGDAS
|
| |