Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Csal_0226 |
Symbol | |
ID | 4027309 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chromohalobacter salexigens DSM 3043 |
Kingdom | Bacteria |
Replicon accession | NC_007963 |
Strand | + |
Start bp | 256873 |
End bp | 258120 |
Gene Length | 1248 bp |
Protein Length | 415 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 637965377 |
Product | HipA-like protein |
Protein accession | YP_572289 |
Protein GI | 92112361 |
COG category | [R] General function prediction only |
COG ID | [COG3550] Uncharacterized protein related to capsule biosynthesis enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACTGAGT TCGAAGTACA TATCGACCTC GGTGGGAGCA CCTTTCCAGT CGGTCTGGCC AGGTGCAATC GCGTGCGTGG CACGGAGAGG ATTCTTTTCG AATACGACGC CGAATGGCTC GCGTCGCCGG ATCGCTTCTC ACTGGAGCCT GCCCTTGCAC TCACGCGGGG AGCCTTCGCT CCACCTGCAG GGATGACCAG CTTCGGCTCG ATTGGCGACT CGGCGCCCGA CACCTGGGGC CGTCGTCTGA TGCAACGTGC CGAGCGGCGG CGTGCCGCGC GGGAGGGTCG CAGTGTTCGA ACGCTTGCCG AGAGCGATTA CCTGCTGGGA GTGTCTGATG AGACTCGGCT TGGTGCCCTG CGTTTTCGCC GAGTGGGCGA GACGTCATTT CAGGCGCCGA TTCAGGCTGG CGTTCCTGCC CTCGTCGAGC TGGGTCGACT GCTTCAGGTC ACAGAGCGTG TCTTGCGTGA CGAGGAAACG GATGAAGACC TCCAGCTCAT CTTTGCCCCG GGCTCGTCCC TGGGTGGGGC CCGCCCGAAA GCCTCGGTCA TCGATCAGTA CGGCCACCTC GCGATCGCCA AGTTCCCGAA GGATACCGAC GAGTACAGTG TGGAGACCTG GGAAGAAATC GCGCTCCGCT TGGCCAACCA GGCTGGCATC GTGACGCCGC AGCACGAACT GGTCGACGTG GGCGGTCGGG CCGTCATGCT GTCGAGACGA TTCGATCGAG ATGGCACTAT CCGCATTCCG TTTCTTTCCG CGATGGCAAT GATGGGCGCC AGGGATGGCG AACCTGGCAG CTATCCCGAA ATGGTCGATG CTCTGACCGC GCATGGCGCT CAAGGAAAGA CCGACGCGCA CGCACTGTAT CGGCGCGTCG TCTTCAGCGT GATGATCTCC AATGTCGACG ATCATCTGCG CAATCACGGC TTTCTGTGGT TGGGGAACGC TGGATGGTCA CTCGCTCCGG CCTATGACCT CAACCCGGTG CCCAGTGACC TCAAGGCGCG GGTGTTAACC ACGAACATCG ATCTCGACGA GAGCACCTGC TCGGTCGATC TGCTGGAGGC GGCATCCGGT TATTTTGGGC TCACGCTGGC CAACGCCCGG CGAATCATCA AGGACGTTGC GTCAGTAACC GCAACCTGGC GGAACACTGC CAGGGCCGTT GGCGCACGTC CGGCCGAGAT CGAGCGTATG GCCAGCGCCT TCGAGCATGA TGACTTACAG CATGGGCTGG CGTTGTAA
|
Protein sequence | MTEFEVHIDL GGSTFPVGLA RCNRVRGTER ILFEYDAEWL ASPDRFSLEP ALALTRGAFA PPAGMTSFGS IGDSAPDTWG RRLMQRAERR RAAREGRSVR TLAESDYLLG VSDETRLGAL RFRRVGETSF QAPIQAGVPA LVELGRLLQV TERVLRDEET DEDLQLIFAP GSSLGGARPK ASVIDQYGHL AIAKFPKDTD EYSVETWEEI ALRLANQAGI VTPQHELVDV GGRAVMLSRR FDRDGTIRIP FLSAMAMMGA RDGEPGSYPE MVDALTAHGA QGKTDAHALY RRVVFSVMIS NVDDHLRNHG FLWLGNAGWS LAPAYDLNPV PSDLKARVLT TNIDLDESTC SVDLLEAASG YFGLTLANAR RIIKDVASVT ATWRNTARAV GARPAEIERM ASAFEHDDLQ HGLAL
|
| |