Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Csal_1077 |
Symbol | |
ID | 4028092 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chromohalobacter salexigens DSM 3043 |
Kingdom | Bacteria |
Replicon accession | NC_007963 |
Strand | + |
Start bp | 1216481 |
End bp | 1218871 |
Gene Length | 2391 bp |
Protein Length | 796 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 637966255 |
Product | hypothetical protein |
Protein accession | YP_573133 |
Protein GI | 92113205 |
COG category | [S] Function unknown |
COG ID | [COG3410] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTGGTCT ATAACGCCAC CAAGCAGCAG TTCATCGATG ATGTGCGTGC GAATGTAATA AGCGATGCCA TCGAGAACGA GGTGGCGCGC AGACTGAACC GAAACTCGCC GCGTGGTGAG GTGACGTCAT GGGAGAATTC CCTTCGCTTC ATGATGAGCG TTCTGCTCGA TGAGGGAATT CCCGCGTCGG CCGGGGTGGC CATCGAGTAC AACATTCCGC TGACCAATCG CCGCGTGGAC TTCATCCTGA CCGGCAAGAA TCACCAGCGA GACGATGCCG CCGTCATCGT GGAACTGAAG CAGTGGCAGT CCGTCGAGGT GACCAAAAAG GATGCCATCG TGCGTACTCA GCTTGGTGGC GGTGTCCGCG AGACCAATCA CCCGTCGTAT CAGGCATGGT CCTATGGCGC GCTGATCGAG GACTACAACG AGACGGTACG CTCCGAATCG ATCCGCCTGG TGCCCTGTGC CTACCTGCAC AACATGAAGC AGGCCGATGC CATCAATGAT CCCTTCTACG AGCACCACAC AAGCCGTGCG CCGGTCTTCA TCTCGCAGGA TGCCCTGAAG CACTCGGAGT TTCTGAGCCA GCACATCAAG TACGGCGACA GTAACGACAT CATGTATCGC ATCGAGCACG GTGTGATCAA GCCGAGCAAG AACCTGGCGG ATGCGCTGGC CTCGATGATG CAGGGCAATG CCGAGTTCCT GATGATCGAC GAGCAGAAGC TGGTCTATGA AACCGCCATC GACCTTGCGC ATCGTGCGGA AAAAGGCGAC AAGCAGTGTC TGATCGTCAA GGGCGGGCCG GGCACCGGCA AGTCGGTAGT GGCCATCAAC CTGCTGGTGG AGCTGACGAA GCGCGAGATG ATGACGCAGT ACGTCTCGCG CAACTCGGCT CCTCGGGAGG TCTTCAAGAA GAAGCTGACC GGCACGCGCA AGAAGACCCA CATCGACAAC CTGTTCAAGG GCTCGGGCAG CTACGTCAAC GCCGAGCCGG ATACCTTCCA TGCGCTGATC GTTGACGAGG CGCACCGCCT GAACGAAAAG TCCGGCATGT ATCAGAACCT GGGCGAGAGC CAGATCAAGG AGGTGATCGC CGCGTCACGC TTCTCGATCT TCTTCATCGA TGAAGCCCAG CGCGTGACAT TGAAGGATGT CGGGACGGTC GAGGAGATCC AGCGGCGTGC GGCAGAGTGC GGCGCCAATG TTCACGAGCT CGAGCTGGCC TCGCAGTTTC GCTGCAACGG CTCGGACGGT TACCTGGCAT GGCTCGACCA CGCTCTTCAG ATTCGCACCA CGGCCAATAC CGATCTGGAG GACATCGACT ACGACTTCCA GGTGTTCGAT GACCCGAGTG CCATGCGCCG GGCGATTCTC GAGAAGAACC GCAAGGCCAA CAAGGCACGC ATGGTGGCGG GCTACTGCTG GCCCTGGGCG AGCAAGAAAG ACAAGCACGC CATGGACATC ACGTTCCCCG AACATGGCTT TGCTGCCCAG TGGAACCTCC ATGACGACGG CATGCTGTGG GCCGTCGCCG AGGATTCCGT TGAACAGGTC GGCTGCATTC ACACCTGCCA GGGGCTCGAA TTCGACTATG TCGGCGTCAT CATCGGCGAC GACTTCGTGA TTCGTGACGG TCGCGTGGTG ACGGACGCGG GGAAGCGGAC AGGGCAGGAT CGTTCCGTGC ATGGCTACAA GAAGATGCTC AAGGAGCAGC CCGAGCATGC CCGCGCCCTT GCGGACCAGA TCGTCAAGAA CACTTATCGC ACGCTGATGA CGCGGGGCCA AAAGGGCTGT TACGTCTATG CCACAGACCC CGAGACTCGC GAGTACTTCG CGGCCTTTGC TCGCTCTCAA GCAGCGACAG AGCGACTGGA CGCATCTGAA GATGCCGAGA TTCTGGATGG CTTGCACCTT CCCATCGTGA CGCGTGACCA GGCAGCGCCG TTCGAGCGCC ATGTGCCCGT GTACGACTTG AGCATTGCCG CCGGCGAGTT CAGCGAGATG CAAATCGCCG AAGCCGAGCA CTGGGTCGAG CTGCCGGACT TCATGCGAGT GAGCCCCGAT CTGTTCGTCA GCCGAGTAGT AGGGGAGTCG ATGAACCGGC GCATCCCCAA TGGCGCCTGG TGCCTGTTCC GTATGAACCC AGGCGGTACG CGGCAAGGCA AGGTCGTCGT GGTGCAGCAT CGCGCCATTG AGGACCCGGA CCACGGCGGC AGCTTCACCA TCAAGCTATA CCAGAGCGAG AAGATTGAGG AGTACGGCGA GTTCGTGAAC CAGCGCATCG TGCTCAAGCC CCAGACCAAC GCCTTCGGCT ACAAGGACAT CGTGCTCGAG GACGAGCTTG AGGACCTGAA GGTCATCGGC GAATTCCTCT CCGTGCTGTG A
|
Protein sequence | MLVYNATKQQ FIDDVRANVI SDAIENEVAR RLNRNSPRGE VTSWENSLRF MMSVLLDEGI PASAGVAIEY NIPLTNRRVD FILTGKNHQR DDAAVIVELK QWQSVEVTKK DAIVRTQLGG GVRETNHPSY QAWSYGALIE DYNETVRSES IRLVPCAYLH NMKQADAIND PFYEHHTSRA PVFISQDALK HSEFLSQHIK YGDSNDIMYR IEHGVIKPSK NLADALASMM QGNAEFLMID EQKLVYETAI DLAHRAEKGD KQCLIVKGGP GTGKSVVAIN LLVELTKREM MTQYVSRNSA PREVFKKKLT GTRKKTHIDN LFKGSGSYVN AEPDTFHALI VDEAHRLNEK SGMYQNLGES QIKEVIAASR FSIFFIDEAQ RVTLKDVGTV EEIQRRAAEC GANVHELELA SQFRCNGSDG YLAWLDHALQ IRTTANTDLE DIDYDFQVFD DPSAMRRAIL EKNRKANKAR MVAGYCWPWA SKKDKHAMDI TFPEHGFAAQ WNLHDDGMLW AVAEDSVEQV GCIHTCQGLE FDYVGVIIGD DFVIRDGRVV TDAGKRTGQD RSVHGYKKML KEQPEHARAL ADQIVKNTYR TLMTRGQKGC YVYATDPETR EYFAAFARSQ AATERLDASE DAEILDGLHL PIVTRDQAAP FERHVPVYDL SIAAGEFSEM QIAEAEHWVE LPDFMRVSPD LFVSRVVGES MNRRIPNGAW CLFRMNPGGT RQGKVVVVQH RAIEDPDHGG SFTIKLYQSE KIEEYGEFVN QRIVLKPQTN AFGYKDIVLE DELEDLKVIG EFLSVL
|
| |