Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Csal_3028 |
Symbol | |
ID | 4028994 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chromohalobacter salexigens DSM 3043 |
Kingdom | Bacteria |
Replicon accession | NC_007963 |
Strand | + |
Start bp | 3371362 |
End bp | 3374124 |
Gene Length | 2763 bp |
Protein Length | 920 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 637968234 |
Product | hypothetical protein |
Protein accession | YP_575071 |
Protein GI | 92115143 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCTGGAT TCTTCCGCCG TTTTTTCGCG CCACGCTGGC GACATCGCGA CCCCCGCGTT CGCCACATGG CCATCGCCCA ACTCGATAAG CGCAAGCCTG CCGACCACCA AGCCCTCGAA CAATTGGCAC GCGACGAGGA TGCCACGGTC CGGCGCGAGG CACTCGCCCA GTTCGACGAT CCCGCCTTGC TTCTTGCGTG GCTGGAGGCG CACGACAATG CCGAACTCCG GGCCCGGCTC AGCCAGTTGC TGTGCGGCAG CGTCCCCGAC GCTCCGCCCC TGAATCAGCG CCTCACCCTG GTCGACACGC TGGAAGACCA TGACCTGCTC GTGACGCTCA CCGAGGCGGG CGACAACCAG CAATTGCGCC TGGCCGCCCT GGCCCGGCTC GACGACGAAG CGGCCTTGAT CCGCCAGGCC TGCGACAACG GCATCGCCGC GGTGCGTCAC GCCGCCGCCG AACGCGTCAC GAGCCACGCA GGACTGCAAC GTCTGGCGCG CGAGGCGCGA CGCGACAAGG CCGTCATCCG CCAGGCGCGC GAACGCCTGC AGCGCCTGCG GGCCGACGCC GAGGAAACCC AGGCCCAGCA AGCCGCCCGG CAACGCATTC TGGAGGCCCT GGAGGCACAT GCACTGCATG CCTGGGAACC GCTCTACGGC GCCCGCTACC GCCACCTGAT CCGCGAGTGG GAGGCACTCG GCGATACGCC CAGCGACGAG CAGGAGCGTC GTTTCCAGGA AGCCAGCCAG CGGTGCCACA AGACGCTCTC CGATCACGAG ACCGAGGGAC ACGCCCAGCA TCAGGCGCTG CAGCGGCAGG AAGAAGCCGC ACATACCCGC GCCACCCTGG TCGAAGCCCT CGAGGACGCC GTTGCCAGCC TCGCCCAGGC CTCGAGCCTG ACCCATCAGG ACATCGACAG CCTGCGCGCT CAGCAGCGCC TGCATGGCGA GCGCTGGCTT GCGCTCTCCG ATCACTATCC GCCGGATGAC GACATCCAGG CCCGCTATGC CGCGGCACAA GCCGCCTGCC AGCGCGTCAA CGATGCCTGG GATCGCGCCA CGACCCATGC CAAGGCGCTC GAGGATGCCT TGCGCGACGA TCATGCCGCC ATCGACGATT GCCTGGCACG CATCGACTGG CCCAACGATC TGCCGCCGAC ACCGCTGATT CAGGAGGCAC GCCGGTTGCG TCAGGCCGAA ACGGCCGACG GCGATGCGCC CGTCGACCTG GCGGCACTGC AGGAAGAGCT CGACGCACTC GAACATCACC TCGACCGCGG CACCCTGAAA ACCGCCAGTC GCTCGTATCG ACAATTGCGC CAGCGCCTCG ACGCCCTGCC CAGAGCATCG CGCGGCGAGC TTGAAGCACG ACTCAAGCGC CTCGGCGCGC GCCTGGCCGA GCTGCGCGAT TGGCGAGGCT TCGTGGCGGG CCCCAAACGC ACCCAGCTCA TCGACAGCAT CGAGGCACTG GCCGAGGACG ATACCCTGCC CGATGCCGAC CGTGATCGCC GTCACCGGCA GTTGATCAAG GAGTGGGCGG CCCTCGGCGA CGCCGCCGCC ACCGCGGAGC TATCGCAGCG CTTCCGGAGC GCATCGCAAC GCATTCACGA CGCGCTGGCC GACTGGTACC AACGCCGTGA CGACGAACGC AAGCGCAATC TCGAAGCGCG CGAAGCGCTC TGCGAACAGC TGGAAGAGCT CATCGCGCAT CCCGATTCCC GGGCCGACCC GGATATCCTG CGCCAGATCC GCAACCGCGC GCGGGAGCAG TGGCAGCAGT TCTCTCCGGT GCCTCGCGAG CAGGCCAAGC CGCTCGGCCA ACGCTTCGGC CGTGTACGTC ACGAGCTGCA GGCGCTGATC GATCGCCGTG CCGGAGAAAT CGGCGAGGCC AAGCGACAGT TGATCGCCGA GGCCGAAGCC CTGGTGGAAG CCGACATGTC CGCCTCCCAA CGCAGCGAAG ATGCCAAGGC GCTGCAGGCG CGATGGCGCG CCCTGGGACG GGCCGCCAAG GGCGAGGAAC AAATGCTGTG GCGCCAGTTC CGGGGGCTCT GCGATCGCAT TTTTGCCGCA CGCGAGGCCG AGCGTGAAAA TCGCGCGCAA CAAGCCCAGG CACGCCTGGA CAGCATGCAG GCCCTGATCG ATCGCTTCGA TGCCTGGCAG CCTGAGCGGG CCGACGAGAG CGCCACGCTG GATGCAGGCA TTCGCGAAGC CGAGGCGCTG GAGCCCCTGC CCGGTGGACG TCGCAGCGAA GGCATGCGCC GTCGCTGGCA GGGTATCGTG CGGACACGTC GGGAACGCCT CGTGCATTTG GCGCTCGCTG AACAGGCCGA CCGTTGGCAG GTCTTGCGGC CCCTGGTCGA CGCCCATGTC GCGGCGGACG CCCAGATGCT CCAGGGTGAA GACGCCGGGG ACGTCGCCGC CCCCGAGGCC CTGCCGCGCG ACTGGCAACA GGCCCATGCG GCGCGTAACG CATCACGTCG GGAAGGGACA ACCGATACGG CAGAGCACCT GCTGGCGCGA CTCGTCGTTC AGGTGGCGTT GCTCGCCGAC GAACCGGTCG CGGCGGAGGA CGAGCCACTA CGCCTCGAGG TTCAGGTCGC GCGACTCAAC AACGGCCTGG GACAGGCACC GCAACCCGAT CAGGAACTCG CCGAGGTCTT GCGGCAGCTG TTGGCCACGG GCCCGGTGAC GCCGACCGCC TGGACCACCC TGGTTGCACG CTTCGACACG CTGTTCGACG CTCTCGCCCG CCGCGTGGCA TAA
|
Protein sequence | MSGFFRRFFA PRWRHRDPRV RHMAIAQLDK RKPADHQALE QLARDEDATV RREALAQFDD PALLLAWLEA HDNAELRARL SQLLCGSVPD APPLNQRLTL VDTLEDHDLL VTLTEAGDNQ QLRLAALARL DDEAALIRQA CDNGIAAVRH AAAERVTSHA GLQRLAREAR RDKAVIRQAR ERLQRLRADA EETQAQQAAR QRILEALEAH ALHAWEPLYG ARYRHLIREW EALGDTPSDE QERRFQEASQ RCHKTLSDHE TEGHAQHQAL QRQEEAAHTR ATLVEALEDA VASLAQASSL THQDIDSLRA QQRLHGERWL ALSDHYPPDD DIQARYAAAQ AACQRVNDAW DRATTHAKAL EDALRDDHAA IDDCLARIDW PNDLPPTPLI QEARRLRQAE TADGDAPVDL AALQEELDAL EHHLDRGTLK TASRSYRQLR QRLDALPRAS RGELEARLKR LGARLAELRD WRGFVAGPKR TQLIDSIEAL AEDDTLPDAD RDRRHRQLIK EWAALGDAAA TAELSQRFRS ASQRIHDALA DWYQRRDDER KRNLEAREAL CEQLEELIAH PDSRADPDIL RQIRNRAREQ WQQFSPVPRE QAKPLGQRFG RVRHELQALI DRRAGEIGEA KRQLIAEAEA LVEADMSASQ RSEDAKALQA RWRALGRAAK GEEQMLWRQF RGLCDRIFAA REAERENRAQ QAQARLDSMQ ALIDRFDAWQ PERADESATL DAGIREAEAL EPLPGGRRSE GMRRRWQGIV RTRRERLVHL ALAEQADRWQ VLRPLVDAHV AADAQMLQGE DAGDVAAPEA LPRDWQQAHA ARNASRREGT TDTAEHLLAR LVVQVALLAD EPVAAEDEPL RLEVQVARLN NGLGQAPQPD QELAEVLRQL LATGPVTPTA WTTLVARFDT LFDALARRVA
|
| |