Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_1863 |
Symbol | |
ID | 5733752 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 2196874 |
End bp | 2197896 |
Gene Length | 1023 bp |
Protein Length | 340 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641279007 |
Product | cysteine synthase |
Protein accession | YP_001544634 |
Protein GI | 159898387 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0031] Cysteine synthase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTAACCC ACGGATTCAC CAAATTAGCC CGTTCACACG ACGGGATTGT TCAACTGATT GGTAATACGC CCTTAATTCC GTTACGTCGG ATTTTTGCTG AGCATCCGAT TCAGCTTTTT GCCAAGCTAG AAATGTTCAA TCCAGGTGGC AGCGTCAAAG ATCGGGTGGC CCGCTCGATT ATTACCAAAG CACTGGCTGA GGGCCGAATC GATCAAGCCA CCACAATCAT CGAATCAAGT TCGGGTAATT TAGGCATTGG CCTCGCTCAG GTCTGTCGTT ATTTTGGCTT ACGCTTTATT TGTATTGTTG ATTCACGCAC GACCAACCAA AATATTCAAA TTCTACGGGC TTATGGGGCT GAGGTTGAAA TTATTACCCA GCCCGACCCT GATTTGTTGA CGGCTCGGAT CAAGCGCGTG CATGCGCTCC AAGCCAGTAT TCCTAATAGC TTTTGGTGTA ATCAATACGC AAATTTGCAT AATCCAATGG CCCATTACGC CACCATGAGC GAGATTCTGG CAAGTTTGCC GAATGTTCCC GACTATCTGT TTTGTGCGAC CAGTAGCTGT GGCACGCTGC GCGGTTGTGC TGAATATATT CGCGAACAAG GTTTAGCGAC CAAGGTGATC GCGGTTGATG CCTTGGGCAG TGTGATTTTT GGCGGCCAAC CCGGCCAGCG CTTGATTCCA GGCCATGGCG CATCACGAAT TCCCGAATTG TTCCAACCCG ATTTAGCGGC TGGTGTAGTC TATGTTTCCG ATGCCGATTG TGTAGCTGGC TGCCATAGCT TGCTCGACCA AGAGGCGATT TTTGCTGGCG GCTCGTCGGG TGGCGTGATT CGCGCTATCG CCCAGATGTT GCCAAGTATG CCCGCTAATG CAACGTGTGT GGCGATTTTG TGTGATCGTG GCGAGCGCTA TCTTGATACC GTTTTCAACC ACGCTTGGAT CGATCAACAT CTCCCAAGTG TCCTGATTAA TCAATCGCAA CCACTGATCG AACGCGAAAA TGTTGTCGGC TAA
|
Protein sequence | MLTHGFTKLA RSHDGIVQLI GNTPLIPLRR IFAEHPIQLF AKLEMFNPGG SVKDRVARSI ITKALAEGRI DQATTIIESS SGNLGIGLAQ VCRYFGLRFI CIVDSRTTNQ NIQILRAYGA EVEIITQPDP DLLTARIKRV HALQASIPNS FWCNQYANLH NPMAHYATMS EILASLPNVP DYLFCATSSC GTLRGCAEYI REQGLATKVI AVDALGSVIF GGQPGQRLIP GHGASRIPEL FQPDLAAGVV YVSDADCVAG CHSLLDQEAI FAGGSSGGVI RAIAQMLPSM PANATCVAIL CDRGERYLDT VFNHAWIDQH LPSVLINQSQ PLIERENVVG
|
| |