Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Csal_0079 |
Symbol | |
ID | 4027258 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chromohalobacter salexigens DSM 3043 |
Kingdom | Bacteria |
Replicon accession | NC_007963 |
Strand | + |
Start bp | 99140 |
End bp | 99988 |
Gene Length | 849 bp |
Protein Length | 282 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 637965230 |
Product | hypothetical protein |
Protein accession | YP_572142 |
Protein GI | 92112214 |
COG category | [R] General function prediction only |
COG ID | [COG3257] Uncharacterized protein, possibly involved in glyoxylate utilization |
TIGRFAM ID | [TIGR03214] putative allantoin catabolism protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCACAAC GCAGCCACGA CGCACCTGAA AGAACCTACT ACGCCCCCCA TGGCGGTTTG CCGCCGCAGA GCCAGTTGAT TCATGGTCGC GCGGTCTTCA CCGAGGCCTA TGCGGTGATT CCCAAGGGGG TGATGAGCGA TATCGTCACC AGTTTCCTGC CGCACTGGGA GAAGACCCGG CTGTGGGTGC TGTCGCGGCC GCTTTCCGGC TTCGCGGAGA CGTTTTCGCA GTACATCATG GAGGTCTCGC CCGGTGGCGG CAGCGAGCGG CCGGAGCCGG ACGAAGGCGC CGAGGGCGTG CTGTTCGTGG TCGAGGGCGA GATGACGCTG ACCATCGCGG GGGAAGCGCA CGCGATGGGA CCGGGAGGCT ATGCCTATCT GCCGCCGGGC TGCGACTGGC AGCTGCGCAA CGGCAGCGAC GCGCCGGTGC GCTTCCACTG GATACGCAAG GCCTACGAGT TCGTCGAGGG GCTGGCAGTG CCCGAAGCCT TCGTCACCAG CGACAACGAC ATCGCGCCGA TTGCCATGCC GGATACCGAC GGCCGCTGGG CCACGACACG CTTCGTCGAT CCCCAGGACG TGCGCCACGA CATGCACGTC AACATCGTTA CCTTCCAGCC CGGGGGCGTG ATCCCGTTCG ATGAAACCCA TGTCATGGAG CATGGGCTCT ACGTGCTGGA GGGCCGTGCG ATCTACCACC TCAATCAGGA TTGGGTCGAG GTCGAGGCCG GCGACTACAT GTGGCTGCGG GCCTTCTGTC CGCAATCCTG CTACGCCGCC GGGCCGGGGC CGTTTCGCTA TCTGCTCTAC AAGGATGTGA ACCGGCACAT GAAGCTGCGC TTGAGCTGA
|
Protein sequence | MSQRSHDAPE RTYYAPHGGL PPQSQLIHGR AVFTEAYAVI PKGVMSDIVT SFLPHWEKTR LWVLSRPLSG FAETFSQYIM EVSPGGGSER PEPDEGAEGV LFVVEGEMTL TIAGEAHAMG PGGYAYLPPG CDWQLRNGSD APVRFHWIRK AYEFVEGLAV PEAFVTSDND IAPIAMPDTD GRWATTRFVD PQDVRHDMHV NIVTFQPGGV IPFDETHVME HGLYVLEGRA IYHLNQDWVE VEAGDYMWLR AFCPQSCYAA GPGPFRYLLY KDVNRHMKLR LS
|
| |