Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Csal_2722 |
Symbol | |
ID | 4028211 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chromohalobacter salexigens DSM 3043 |
Kingdom | Bacteria |
Replicon accession | NC_007963 |
Strand | - |
Start bp | 3051095 |
End bp | 3052054 |
Gene Length | 960 bp |
Protein Length | 319 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 637967930 |
Product | ectoine utilization protein EutC |
Protein accession | YP_574768 |
Protein GI | 92114840 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2423] Predicted ornithine cyclodeaminase, mu-crystallin homolog |
TIGRFAM ID | [TIGR02992] ectoine utilization protein EutC |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.667084 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAACTCC ACCAACGCGA GGCGATCGAG GCCGCCGTGT CCCTGGATAC GGCGGCCCTG GCCGCCATCG AGTTGGGCTT CGCGGCCCTG GGACGCGGCG AGGTCGTGCA GCCGCCGATC CTGTCGATGG CCATCGAGGA GGCCAACGGG GAGGTCGACG TCAAGACCGC GCACATTCGC GGCTTCGAGC GCTTCGCCAT CAAGGTGAGC CCGGGCTTCT TCGACAATCC CAAGCAGGGG CTGCCCAGCC TCAACGGGCT GATGATGGTG TTCTCGGCCC GGACCGGGGT GGTGGACGCC GTGCTCTTCG ATGAAGGCTA TTTGACGGCG GTGCGTACTG CCTTGGCCGG TGCCTTGTCG GCCAGGTACC TGGCGCGCGA GAACAGTCGC CGCGTGGCGG TGCTCGGCGC GGGCGAGCAG GCCGAGCTGC AGATCGAGGC GTTGCGCCTG GTGCGAGACA TCGACACCGT CGACGTCTGG GCGCGCCGCC GCGAGGCCGC CGAGGCGTAT GCCGAGCGCC TGCGGCAGCG CGGCTTGACG GTCAACGTGC ACGACGATGT GCATGCGGCC TGCCGCGCGG CGGATATCAT CGTCACCGCC ACGCCCTCGA CGGCGCCGAT TCTGGAAGCC GCCGACCTGC CCGAAGGCGT GCACGTCACG GCGATGGGCT CGGATAGCCC GGACAAGCGC GAGCTCGCCG ACAGCGTGAT GACGCGTGCC GACGCCTTCG TCTGCGATAC TCGCGCGCAA AGCGAGTGCA ACGGCGAACT CAAGGCCTTC GTCAAGGCCG GCGAGACACG CGCCGAGGTT CCCTTCAAGG TATACGAGCT CGGCGAGGTC ATCGACAAGC GACTGCCGCT GCGCTTGTCG GAGGCCAGCA TCACCGTCTG CGATCTCACC GGTACCGGGG TCCAGGATAC GGCGATCGCG AATTACGCGC TGCAGCGTTT GACGACCTAG
|
Protein sequence | MQLHQREAIE AAVSLDTAAL AAIELGFAAL GRGEVVQPPI LSMAIEEANG EVDVKTAHIR GFERFAIKVS PGFFDNPKQG LPSLNGLMMV FSARTGVVDA VLFDEGYLTA VRTALAGALS ARYLARENSR RVAVLGAGEQ AELQIEALRL VRDIDTVDVW ARRREAAEAY AERLRQRGLT VNVHDDVHAA CRAADIIVTA TPSTAPILEA ADLPEGVHVT AMGSDSPDKR ELADSVMTRA DAFVCDTRAQ SECNGELKAF VKAGETRAEV PFKVYELGEV IDKRLPLRLS EASITVCDLT GTGVQDTAIA NYALQRLTT
|
| |