Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_4016 |
Symbol | |
ID | 5735877 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 5122957 |
End bp | 5124075 |
Gene Length | 1119 bp |
Protein Length | 372 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 641281166 |
Product | cystathionine gamma-synthase |
Protein accession | YP_001546776 |
Protein GI | 159900529 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0626] Cystathionine beta-lyases/cystathionine gamma-synthases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.683871 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAACCTG AAACCCTGTT AATTCATGCT GGTCGTAGCG TCGATGCTGC GACCAGTGCA GTTACGCCGC CGATTCATCT TGCCAGTACC TTTGAACGAG CGGCTGATGG TAGTTTGCCC CATGGGTTTG TCTATACACG TTTTGGTAAC CCCACCCGCC AAGCCTTGGA ACAAGCCTTA GCGGCACTTG AAGGTGGAGT TGAGGCGGCG GCATTTGGCT CTGGTTCGGC GGCTACAACC GCAGTGCTGC AAAGTTTAGC GCCAGGCCAG CGCGTGTTGT TGCCACGCGA TTGCTACAAC GGCACGGCCA ATTTGGTGCG TCAGGTATTT GCTCAACTTG ATGCTCAATT TGTCGATATG ACCGACTTGG CGGCAGTGCA AGCAGCGTTG GAGCCAGCCC CAGCCTTGGT TTGGCTCGAA ACGCCATCCA ACCCAACCTT GCGATTAACA GATTTGGCGG CGGTGAGCAA TTTGGCGCAT GCGGTTGGGG CATTGGTGGT TTGCGATAAT ACCTGGGCCA CGCCGCTTGG CCAGCGCCCA TTTGATTTGG GCGTGGATTT GGTGATGCAT TCGACCACCA AATATCTTGG CGGCCATAGC GATGTGCTGG GTGGTGCATT AATTACCAAA ACTGTAACAC CTTGGTGGCA ACGGTTGCAG CAAATTCATG TGCTGGCCGG AGCCGTGCCC TCGCCGTTTG AATGTTGGCT GATTTTGCGC GGCATGCAAA GTTTGGCCTA TCGGCTGCGC GGCCATTGTG CCAATGCTTT GGCGGTGGCT GAGTGGTTGG CACAACATCC CAAGGTGCAG GCTGTGCATT ATCCTGGCTT GACAAGCCAT CCTCAATTTG AATTGGCTCA ACGCCAAATG CTGCTGATGG GCGGTATGGT TTCGTTCGAG GTAGTTGGTG GTGCGGCTGA AGCGATTGCA GTAGCGGCCC ACGTTAAGTT ATGGACGCGG GCAACCAGCC TTGGTGGCCC TGAAAGTTTG ATCGAGCATC GGGCCACACT CGAAGGCCCA GATTCGCCAA CCCCGCCAGC CTTGTTGCGG CTTTCGGTCG GCCTCGAACA CCCCGATGAT TTGATTGCCG ATTTGGCCCA AGCCTTGGCA GTATTGTAG
|
Protein sequence | MKPETLLIHA GRSVDAATSA VTPPIHLAST FERAADGSLP HGFVYTRFGN PTRQALEQAL AALEGGVEAA AFGSGSAATT AVLQSLAPGQ RVLLPRDCYN GTANLVRQVF AQLDAQFVDM TDLAAVQAAL EPAPALVWLE TPSNPTLRLT DLAAVSNLAH AVGALVVCDN TWATPLGQRP FDLGVDLVMH STTKYLGGHS DVLGGALITK TVTPWWQRLQ QIHVLAGAVP SPFECWLILR GMQSLAYRLR GHCANALAVA EWLAQHPKVQ AVHYPGLTSH PQFELAQRQM LLMGGMVSFE VVGGAAEAIA VAAHVKLWTR ATSLGGPESL IEHRATLEGP DSPTPPALLR LSVGLEHPDD LIADLAQALA VL
|
| |