Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cpha266_2263 |
Symbol | |
ID | 4568485 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides DSM 266 |
Kingdom | Bacteria |
Replicon accession | NC_008639 |
Strand | - |
Start bp | 2592471 |
End bp | 2594162 |
Gene Length | 1692 bp |
Protein Length | 563 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 639766825 |
Product | sulfatase |
Protein accession | YP_912679 |
Protein GI | 119358035 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCATCTCC CTACTTGCAG GAGCGATGTG AACAAAGGCT CCCGTTCCGT TCGCCTTATT GCCACGTTTG CCATTGTACA ACTGGCGCTA CATCTCTATA CAGTCGTTGA ATCACCGGAC CCGCGTTTGT TTTTGCCGCA TTTTATAGCA TGGAGCCATG ACCTGCTGAT ACTCTCGATT TTGTTTTTCG TCTTCAGCAG AGCAATCGCC CTGTTTCCAT CGCGGTTTCG GAATTGCGCT GAACTCATGA CTCTGCCGAT CATCGCTCTG GCGCTTCTGC CACTGACGCT CTATCCCCGG ATGCTTCGCG AGTACCTCTC CTTTCCAGTG AACCTGTTTA CGGCAACCCC TGCATCAGCA TCTGCGATGC TGACCAAATA TCTTGGGCTT TCGAAACTGA TGCCGGTAGC CTTTGCGGTA GCTTCCGCTC TGGTTGCGCT GCTGATGCTG CCGTTTCCTT CGTGGTCGAA AAAGGGAAAG CTCTTGTTCA CGGTTTTCCG GGGAATTCTG CTTACCGTTG CAATCCTGAC ACTGCCGCGT TCACCCCATC CGGTTGTAAA CAGCCTTAAA GAAGAGATGT CGGCTGTACT TTCTCACGAA CGACGGGAGG TGCCTGCGCT TTTTTCCGCA CCACACCGAC AGGATAATCA GAAGCCTTCA GGTTCCGGCG TCTTGTCGTT GCAGGAAAAA CTGAAAGCGG ACCATATCTA TCTGATTGTG CTCGAAGGGG TGAGTGCAGA TCAGTTCGAG AACGCAATTT CCGGTACAGA GTCAAGGTTT TATCGTCGCA TCTCCAGACA TGCCAGATAT TTCGACCGGT ACTACACGAC CAATCTCGAC TCCTATACCA GCCTGATCGC CATGCTCACA TCCGAGCAGG TCCCGTACCG TTCTTATACC GATACCGGAT TGTACGATGC GGTCAACAAT GCTCCTAACC TTGCACGCAG TTTTAAAGAT ATCGGATTCC ATACTCTTTT TATCAGCATC TACGACGATC AGCCGTTCAT CCCTGTTCGT CGGGACTGGT CGAAAATCAT GCACCGACAT GACCTTCCTG CCGGAAAACA ATGGGTCTCC GTTGAATCAA GCCGCATGGA GTCCGCAACA GAGGACAGGG CTGCGCTTTC GACGCTGGGA AAGCTCCCCT CGCTGTATCC GAAGACTTTC GTTTTGCACG AACTGGCCTA TGGCCACACG ACGGAGTGGC GGGCAAAGAC AGGTATTCCA CAACTTGCTT ATTACGATAC CTATCTGAAT GAACTGCTTG ACCTGCTCAT TGCGAATGGA ACCTGGTCAA AAAGCCTTAT GGTAATCGTT TCGGACCATG GCGACCGGGC GAAAGGAGCG AATACCGAAA GCTATCGTGT GCCGTTGATG ATTGTTGGGC AGGATGTGGC GCAAGGCATC GATCATACGT TTCGCTCTCA TCTGGAGCTG CAGCAGATCA TGGTATCATC GCTAACCGGA AACACCATGC CTGAGCCAAA AAAGGAGGCG ATTGTTGTCG GTTCAACCGA GCGCTGGATA TATGGACTGA TCGATGCTCA CGGCGATCAT CTGGTTATCG ATGATCGTAC CGGCAAAGTT GTCGCATCGA ATGGAAAATT GAGTTCAAAG GCTGTTCACA ACAGATTTCA GGAAATAATC AACAATTTCG GAATGCGTTT TGGTCCGGAA AACGAAAAAT AG
|
Protein sequence | MHLPTCRSDV NKGSRSVRLI ATFAIVQLAL HLYTVVESPD PRLFLPHFIA WSHDLLILSI LFFVFSRAIA LFPSRFRNCA ELMTLPIIAL ALLPLTLYPR MLREYLSFPV NLFTATPASA SAMLTKYLGL SKLMPVAFAV ASALVALLML PFPSWSKKGK LLFTVFRGIL LTVAILTLPR SPHPVVNSLK EEMSAVLSHE RREVPALFSA PHRQDNQKPS GSGVLSLQEK LKADHIYLIV LEGVSADQFE NAISGTESRF YRRISRHARY FDRYYTTNLD SYTSLIAMLT SEQVPYRSYT DTGLYDAVNN APNLARSFKD IGFHTLFISI YDDQPFIPVR RDWSKIMHRH DLPAGKQWVS VESSRMESAT EDRAALSTLG KLPSLYPKTF VLHELAYGHT TEWRAKTGIP QLAYYDTYLN ELLDLLIANG TWSKSLMVIV SDHGDRAKGA NTESYRVPLM IVGQDVAQGI DHTFRSHLEL QQIMVSSLTG NTMPEPKKEA IVVGSTERWI YGLIDAHGDH LVIDDRTGKV VASNGKLSSK AVHNRFQEII NNFGMRFGPE NEK
|
| |