Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Csal_2278 |
Symbol | |
ID | 4026431 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chromohalobacter salexigens DSM 3043 |
Kingdom | Bacteria |
Replicon accession | NC_007963 |
Strand | + |
Start bp | 2565323 |
End bp | 2567218 |
Gene Length | 1896 bp |
Protein Length | 631 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 637967482 |
Product | hypothetical protein |
Protein accession | YP_574327 |
Protein GI | 92114399 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.262759 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGCCGG CGCTGTTCGT CCTCGACGAT CCCGCCGGGG TGACCAGTGA CCTCGCCTCA CTACTCGCGG CCCGGGAGAA TGAGTTCAAC GAGCAGGGAG CGATCAAGCG CCCCTTCGTC ACGGCGGGTG TCGTCAATGC CCTGCGCGAT GGATTTCGAG AGCGAGCGCG GCAGCAACGC GCCAACGAGA TCGTAGAAAA CCAGCTCAAG GAAACCTACT ACGGCGCGGA TAAGACCGGC GGACATCCGG GGTTTGAGCA CAGCATCAAG GCCAATTTCG AGCGACGGCA GAAAGAAGAC CCTGCGTTGC GGCAGGAGGT CGAGGGCGTT CGTAGGGAGA CGTTGCGGAA TATCAGCGAG AATGAACTCA AGGACGCCGC CGACGATGCC TGGGGCAAGT ATGGCGACAA GCTACGCCCC GGCGAGCCCG ACGACTGGAT GGAAAATACC TATCGCACGC AGTTGTCAGC ATACGATAAA GACAACATGC GGCCACTGGC ATCGGCCTAT GTGAGCTGGT TGACGGGCAA TCCGCTGCTG ATCTACCTCA ACAGCCGATT TGATGATGCA AATGTAGAGT CGGGTATCAA TTTCGTTTCC GTCATATCCT TCGTGCTGCT GGGTACCCAG AGCTACGCGC CCGCCTTCAC GCAGTATGCC AAATGGCTGT CGGCGAGCGA GATCAGCGAT GACAACCTGC TGCTGAGAGG CATGTGTCTC AATCAGCAGG GGCTGGTCGA GCAGATCATG GCAGCCAGCG ACGTCGACTT TGGCGCGCAA GCCAATAATC CCGCCACCCT GCCATGGGGG CCGTTGATAG CGGCTTATCT TGGATTGACG GAAGATTACC CCGATGAGGT TGCCTCGCCC GCCAGAATGC TCGGCCATGT GCTGGGGCCG CTGCAAAGTG TCGTGGCAAA AGAACAAAAC CCGACGCCAT TTCTGCTCAC ACTGGGGATG ATCTCGGGAG GCTCGGTCCA GATCAACCAA GTCAGCGTGA CGTTCGAGGA AGCCCTATCC CGAACGCGCG GGCCGCTCAA GCTGATCCAC CCCGACTGGG GCGAGAATGA GCTGCGCCAG TTCCAGCAGA AGGTTCGGGT CGAGGCGCGC TCGCTCATCA CGGTCAATGG GGAAACCCCG GTGACCTTCG ATATGCGCCA GATAGACCTC ACTCCGCTAG AGGACTTTAT GGCGGAGACG TCTTCGCCAT CCAGGGCTAT CAGCCTGACG GGACGAGAAA AACTGGAGTT TTCGGCAGGC ATTGTGGGAG TGGCGTTGGC CTATACCAGT CTGAATGATC AGCTTGGGAA GATGAACAGC ATTCTGGAAG CAGGACTCCG ATCTCAGGGG CGTTTACTCG GCATGAGCCT CGCCTTGGGC GGCGGAGCCG CCGAAGTGGT GGGAAAGACG CTGGAGCATG CACGTGGCGT ATTCTTCGCT GAGGCACGTT TTGTCGGTAC ACAGCGTGCA CTTCTGGCCG CCGGGCGATT CCTTGGGTTG GCCGGAGGTT TGATTCTTGG GGCGATGGAT CTATGGGAAG GAGTTGAAAA AATCGGTGAG GATAAAGTGG TGATTGGTTC TTTATATTTA ACCTCCGGTT TCTTTACTGG CACTGCAAGT TTCGTTTTGT TTTTGGCCAG TATCGGTTCC GCTGCTTCTA TTGCCGGTAG CACTGGTCTT CTCGCTACTC TGGCGATAAC CCTATGGTGG ACTGGGTTAA TGCTCGGCGT TATCGCGATT GCGGCGGCTG TTGCCATTTT GTGGTTCACC GAGGATGACC TGCAAGAGTG GCTGGCCGAA TGTGCCTTCG GAGAGCGTGG TGAACCTGGG CCTGGTACGG CAAAGTTGGA AGTCGAAATG CAACGCCTAG AAGCCATAAC TCACAAGGAA GAGTGA
|
Protein sequence | MTPALFVLDD PAGVTSDLAS LLAARENEFN EQGAIKRPFV TAGVVNALRD GFRERARQQR ANEIVENQLK ETYYGADKTG GHPGFEHSIK ANFERRQKED PALRQEVEGV RRETLRNISE NELKDAADDA WGKYGDKLRP GEPDDWMENT YRTQLSAYDK DNMRPLASAY VSWLTGNPLL IYLNSRFDDA NVESGINFVS VISFVLLGTQ SYAPAFTQYA KWLSASEISD DNLLLRGMCL NQQGLVEQIM AASDVDFGAQ ANNPATLPWG PLIAAYLGLT EDYPDEVASP ARMLGHVLGP LQSVVAKEQN PTPFLLTLGM ISGGSVQINQ VSVTFEEALS RTRGPLKLIH PDWGENELRQ FQQKVRVEAR SLITVNGETP VTFDMRQIDL TPLEDFMAET SSPSRAISLT GREKLEFSAG IVGVALAYTS LNDQLGKMNS ILEAGLRSQG RLLGMSLALG GGAAEVVGKT LEHARGVFFA EARFVGTQRA LLAAGRFLGL AGGLILGAMD LWEGVEKIGE DKVVIGSLYL TSGFFTGTAS FVLFLASIGS AASIAGSTGL LATLAITLWW TGLMLGVIAI AAAVAILWFT EDDLQEWLAE CAFGERGEPG PGTAKLEVEM QRLEAITHKE E
|
| |