Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Csal_2154 |
Symbol | |
ID | 4026494 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chromohalobacter salexigens DSM 3043 |
Kingdom | Bacteria |
Replicon accession | NC_007963 |
Strand | + |
Start bp | 2423631 |
End bp | 2424815 |
Gene Length | 1185 bp |
Protein Length | 394 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 637967359 |
Product | tetratricopeptide repeat protein |
Protein accession | YP_574204 |
Protein GI | 92114276 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2956] Predicted N-acetylglucosaminyl transferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.00078802 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTTGATC CACTGCTGCT TGGCCTGCTG GTCGCGGCGA TTGCCATCGG TTGGTGGCTG GGACGGCGCG AGCGCCGGCG CGCTCGCGAA CAACCCGCTC CCGGCGTGTC CCTGCCCCGG GACTATTTCA TCGGCCTCAA CCACTTGCTC AATGAGCAGC CGGATCGTGC CATCGAGACG TTCGTGCAAG CGCTCGAGGT CAACAGCGAC ACCATCGAAA CCCATATCGC GCTGGGCAAT CTGTTCCGCT CTCGCGGCGA AGCCGATCGT GCCGTCAAGA TTCATCAGAA CCTCCTGGCC CGCCCTACCC TGACCCCCCA TCAGGGCGGC CTGGTGCAGC TGGAACTCTC GCGCGACTTC CTCCACCTCG GCTTGCTCGA CCGCGCCGAA CGCTTGCTGC AGAGCCTCGT CAAGGACGTC CCCGATGAAG GCCTTCGCGA TGCCGCCAAG CGCTTGCTGG TCGACCTCCT GCAGCGCGAG AAGGAATGGC AGCAGGCGCT GGATGTCGCC ACGCCGCAGC TCGTCCGCCA GGACGCGGAC ATCCAGCGCG CCGCAGCGCA CTGGCTATGC GAACTCGCCC AGCGCGACCG GCACAATGCC AGCCCCGGCA TGGCACGCAA GCGCCTGCGC CAGGCCCTGT CGATCGACGA AAGCTGCGTA CGGGCCACTT GGCTGCTGGC CGAGTTGGAA CGGGATACCG GCCACTACAA GGCCGTGATT CGCCATCTCA AGCGCATCCC CGAGCAGGAC CCGGGCTTTC TGCCGGTCAT CCTCGAGCCC TTGCATGAGG CCTATCGACG CCTGGAAGAC GAGTCGGGCT GGCGCGCATT TCTCGAGACC AAGCTGCAGG AAACGCACTT CACCAGCGTC GTGATCATGC TCGCGGCGTC CCGACTCAAT ACCGAAGGCC AGGACGCGGC CATCGAATTG ATCAGCGAAC AGCTCCAAGC GCACCCCAGC CTGCGCGGCG TCGATTACCT CATGGACCTG TACCTGATCG GCGAAAATCA CGGCGACCGC GAACGCCTGC TGCTGCTCAA GCAACATACT CAGAAACTGC TGGCAGCACG CCCCCGTCAT CGCTGTCATC GCTGTGGCTT CGGCAGCGCA CAACTGCATT GGCAATGCCC GCGCTGCCAG AGCTGGGGCA CCACCAAGCC GATCACGGGA CTGGAAGGCG AGTAG
|
Protein sequence | MLDPLLLGLL VAAIAIGWWL GRRERRRARE QPAPGVSLPR DYFIGLNHLL NEQPDRAIET FVQALEVNSD TIETHIALGN LFRSRGEADR AVKIHQNLLA RPTLTPHQGG LVQLELSRDF LHLGLLDRAE RLLQSLVKDV PDEGLRDAAK RLLVDLLQRE KEWQQALDVA TPQLVRQDAD IQRAAAHWLC ELAQRDRHNA SPGMARKRLR QALSIDESCV RATWLLAELE RDTGHYKAVI RHLKRIPEQD PGFLPVILEP LHEAYRRLED ESGWRAFLET KLQETHFTSV VIMLAASRLN TEGQDAAIEL ISEQLQAHPS LRGVDYLMDL YLIGENHGDR ERLLLLKQHT QKLLAARPRH RCHRCGFGSA QLHWQCPRCQ SWGTTKPITG LEGE
|
| |