Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Csal_1620 |
Symbol | |
ID | 4028224 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chromohalobacter salexigens DSM 3043 |
Kingdom | Bacteria |
Replicon accession | NC_007963 |
Strand | + |
Start bp | 1840099 |
End bp | 1841298 |
Gene Length | 1200 bp |
Protein Length | 399 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 637966809 |
Product | hypothetical protein |
Protein accession | YP_573672 |
Protein GI | 92113744 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.583527 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGGTTGT CTCATTGGAG CTATGCAGTG GTGGTGGGCC TGTGCCTGGG GGCCGCCGGC AGCAAGGCGC TGGCGGCAGG GCCGCCGCTG GCGCCGGACA TGGTGAACGC CCTGGAGTCG CTGCAGCAGC GCCTGCAGGA CGACGGGGCC AGCCAGGACG AGATCGACGA CGCCAAGGCG GCGGCACGCC GACTGCAAGG GGGCAATGCG GCGGACCGCT GGGCGCGGGC GCTATTCCTT CAACTGGCGG CCACCGGTGA AGCCCGGCGG GGGCGTGACG GCGCCGCCGC CGATCTGTAT CGCCAGGCGC GGCGCATCGA CGGTGTCGAC GGCGACTCGC GCCGGCGCTG GCTCGATCAG GAAGCTCGCC TGCGGCTGCG TGCCGGCCAG ACCGCGCAAG GCGCCGAGCT GCTCGGCGAG TGGATCGAGC GTCACGGTGG CGACCGTGAC AGCCTGTGGT TGATGGCGCA GGCGCAGGCG ACGCTGGAGC ACTGGTCACA GGCCGCGAAC TGGGTCGACC GTGCACGCCG CGCCGGTGGC ATGAACGATA CGCGTCGTGC CCTGGCGGCG AGCGTCTATC AGCATGCCGA GCGCTATGAG GCGGCGCTGG GATTGCTCGA CACGGCGCTC GAGGGGAAGG GCGACGACCC CGATGCCTGG CGGCGCGCGG CGGCCCTGGC GCAGCGCATG CAGCGTCCCG GCCTGGCCGC CGCGTTGTGG GAAGCCGGTT GGCGGCGGGG GGCGCTGCAA GGGCGCGAGG CGCTCGAGCA GTTGATACGC CTGCACGTGG CGGGCGGTAC GCCGGCACGC GCCGCCGAGT ACCTGCAGGC CGCATTCGAG GACGGTACCT TGCCCCGCGA TGTCGAGCAT CAGCGGCGCC TCGCCGAAGC CTGGACGGCG GCGCGCGCCC ATCGCGAGGC GCTGTCGGCG TGGCGCACGC TGGCCGAACA GACCCGAGCC GCCGAGGACT GGCGGCAGCT CGGCGAACTG GCCTATGGAT GGGGCGAATG GTCGACGGCG GTGGAGGCAC TGCGCGAGGC TCGCGAGCAG GGCGCCGATC CCGCGCGAAC CTGGCTGCTC GAAGGTGTGT CCCTGCTGGA ACTGTCTCGC GAGGAAGACG CACGGCAGGC TTTCGAGGCG GCCCGCGACG CGGGCGCGTC GCAGGCCGAG GACTGGCTGG CATCGCTCGA CGCCGACTGA
|
Protein sequence | MRLSHWSYAV VVGLCLGAAG SKALAAGPPL APDMVNALES LQQRLQDDGA SQDEIDDAKA AARRLQGGNA ADRWARALFL QLAATGEARR GRDGAAADLY RQARRIDGVD GDSRRRWLDQ EARLRLRAGQ TAQGAELLGE WIERHGGDRD SLWLMAQAQA TLEHWSQAAN WVDRARRAGG MNDTRRALAA SVYQHAERYE AALGLLDTAL EGKGDDPDAW RRAAALAQRM QRPGLAAALW EAGWRRGALQ GREALEQLIR LHVAGGTPAR AAEYLQAAFE DGTLPRDVEH QRRLAEAWTA ARAHREALSA WRTLAEQTRA AEDWRQLGEL AYGWGEWSTA VEALREAREQ GADPARTWLL EGVSLLELSR EEDARQAFEA ARDAGASQAE DWLASLDAD
|
| |