Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Csal_0769 |
Symbol | |
ID | 4028479 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chromohalobacter salexigens DSM 3043 |
Kingdom | Bacteria |
Replicon accession | NC_007963 |
Strand | - |
Start bp | 861223 |
End bp | 862734 |
Gene Length | 1512 bp |
Protein Length | 503 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 637965935 |
Product | hypothetical protein |
Protein accession | YP_572825 |
Protein GI | 92112897 |
COG category | [S] Function unknown |
COG ID | [COG4320] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.598571 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCCAGG AATCGGAACC GACGCGCAAG GCACTCTACG CGCAAGCGCA ACAGCAGGAG ATCTCGGGAC GCTCGCAGAT GAGCAAGGCG GAACTGGCGG CGGCGCTCGA GCGGCAGTCG TTCGAGAAGG CCGCCGCGCC GGTGGTGGCG ACGCGTTTCG ATACCTTCAA GCGCCTGGCC GAGGCAGTGG CGGCTGGCGA GTTCGTGCTG CGCCCGCGCG CACTGACCGG GTTCGAGCGT CGTCGGCACG TGCGCCAGAC GCTGCGCGAG GATCACCAGA CACGCATCGC GGAAGGTTGC GAGGAAGCCG GCGCCAAGTT CGACCAGCTC TCCGACTCGC TGTTTTCGTT CTTTCGCGGC ACGGCGCTGT TGTTCTACCG CGACATGGCC GGCGACGACG CCTGGATGCC CACCGTACTC GCGCTCGGCG ACGTTCACCC CGGCAACTTC GGCGTCATGC CCAACGTGGA CAACGTGCCG ATCTTCTCGG TCAACGATTT CGACGAAGCC TACTACGCGC CCTTCACCTG GGATATCAAA CGCGGCGCGG TCGGCTTCAT GATCGCCTCC GAAACCGAGG GCGAGCTCAA GCACAAGCAT CGCGTCAAGG TCGTGCGTCG TTTCGTGCAG GGCTACATCG AGACGATGGA ACGGCTGGCG CGCGAAGGCA CCGAGCAGGA TGAGGAAATG CGTCACGACA ACGCGCCGAA GCTCATCCGC AAGCTGTTCG AGGACGCCGA TGAAGACCGT GCCGAGTGGC TCGCCGACGA CTACCTGAAC GAGACACGCA GCGGCTTCCG CCCGACCAGG AAACTGGTGC CGATCTCGTC GCGCCGTGAC GAATTCCAGG CGATCACCGA CCGGCTGGTC GAGGAAAACG AGATCGACGT GCCGGCACGC GCCAAGGGAC GAGACAAGCA TGGCATGCAC GTGAAGGACG TGGCGATACG TCTCGGCCAG GGCACCGCCT CGCTGGGGCT CAACCGCTAT TACGTGCTGA TCGAAGGCCC CCGGCGCGAC GGCACGGACG ACTTGATCAT CGAGTACAAG CAGGCCCGGC GCTCCGCCCT GTCCGGCCTG GTACCGCCCT CCGCCTACGA AATGGACACC CTCGCCGAAC GCATCAGCCA TGCCCAGGCC GTGCACCTGA TACGCGGCGA TCTTTTCTAC GGCCACGTCA CGTTCGAGGG GCACAGCTAT CTATCCCGCG AGCGCGCGCC GTTCCGCGAC GACATGGACC TCGACGACCT CTCCAAGAGT GAATGGAAGG AATACGCCCA TATCTGCGGC GGCGTGCTTG CCACCGTGCA CGCCCTCTCG GACGAATCCG GCAAGCTCGA CTACGACATC GAGCCCGCGA TCCTCAATGC CATCGGCCCA CCGGCACTGT TCGCCGAGGA CATGGTGGAA TTCGCCACCG AGGCCGCCGA CCGCCTGCAT CGGGACCACG CAATGTTCCG CGAGGATCAC GCTCGGGGCG CGTTCGAGCA CCTGGACTGG GTACACCGGT GA
|
Protein sequence | MSQESEPTRK ALYAQAQQQE ISGRSQMSKA ELAAALERQS FEKAAAPVVA TRFDTFKRLA EAVAAGEFVL RPRALTGFER RRHVRQTLRE DHQTRIAEGC EEAGAKFDQL SDSLFSFFRG TALLFYRDMA GDDAWMPTVL ALGDVHPGNF GVMPNVDNVP IFSVNDFDEA YYAPFTWDIK RGAVGFMIAS ETEGELKHKH RVKVVRRFVQ GYIETMERLA REGTEQDEEM RHDNAPKLIR KLFEDADEDR AEWLADDYLN ETRSGFRPTR KLVPISSRRD EFQAITDRLV EENEIDVPAR AKGRDKHGMH VKDVAIRLGQ GTASLGLNRY YVLIEGPRRD GTDDLIIEYK QARRSALSGL VPPSAYEMDT LAERISHAQA VHLIRGDLFY GHVTFEGHSY LSRERAPFRD DMDLDDLSKS EWKEYAHICG GVLATVHALS DESGKLDYDI EPAILNAIGP PALFAEDMVE FATEAADRLH RDHAMFREDH ARGAFEHLDW VHR
|
| |