Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3801 |
Symbol | |
ID | 5735665 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 4771917 |
End bp | 4772972 |
Gene Length | 1056 bp |
Protein Length | 351 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641280953 |
Product | alpha/beta hydrolase fold |
Protein accession | YP_001546565 |
Protein GI | 159900318 |
COG category | [R] General function prediction only |
COG ID | [COG0596] Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTCAATC CACCATTGTT AGCTGGATTA ACCGCCCGTG TTGTTCAAAT GCCGCGCTTG GCGATGCATG TGATTGAGCG TGGCGACCCA AGCAAGCAGC CAATTGTGCT CGTTCATGGC AATGTTTCAG CCGCCCGTTT TTGGGAAGAA TTGATGCTGG CCTTGCCCGA TGATTATTAT GTGATTGCCC CTGATCTGCG TAGTTATGGT CGCTCGGAGC GCTTGCCGCT TGATGCAACC CGTGGTGTGC GTGACTTCAG CGACGATCTC GATAGCCTGT TGCAAACCCT CAATATTCGC CGACCGCACT TGGTTGGCTG GTCGTTGGGT GGTAATGTGG TGCTGCAATA TGCCCTCGAT TATGCCACGA ATGTGCGTTC GCTGACGCTT GTTGCCCCAG GTTCGCCCTA TGGCTACGGC GGCACGCAAG GCCTCGACGG CCAACCCAAC AGCCATGACT ATGCTGGGAG CGGCGGTGGT ACGGCCAATG CAGCCTTTGT CGAAGCCTTG AAAAATGAAG ATCGCGGTGA TGGCCCAGTT TCGCCACGCA CCATTATGAA TAGCTTCTAC TTCAAACCGC CGTTCCGCTC AAGCCGCGAA GATGTGTTTG TTGATGAAAT GCTTCAGACC TATTGCTCGC CCCAAAATTA CCCTGGTGAT TTGGTGGCTT CAGCCAATTG GCCGATGGTT GCCCCAGGTA CAGGCGGCGT AAACAATGCG CTCTCGCCCA AATATCTGAA CCAAAGCAGT TTTGCCAGCA TTCAGCCCCA ACCGCCAGTG CTCTGGATTC GCGGCGATGC TGACCAAATT GTCTCTGATG CTTCGATGTT TGATTTGTGT ATGCTTGGGC AATTGGGCTT AGTTCCAGGC TGGCCTGGCG CGGAAACTCA TCCGGCTCAG CCGATGGTTG GCCAAATGCG GGCAGTGCTT GAAAACTATG CCGCCCACGG TGGCCAATAT CAAGAAACGG TGCTGGCAAA TTGTGGCCAC TCGCCACAGA TCGAACAGCC CAGTTTGTTC AATACGGCTC TACTCGATTT TCTGACCCAA GTTTAG
|
Protein sequence | MLNPPLLAGL TARVVQMPRL AMHVIERGDP SKQPIVLVHG NVSAARFWEE LMLALPDDYY VIAPDLRSYG RSERLPLDAT RGVRDFSDDL DSLLQTLNIR RPHLVGWSLG GNVVLQYALD YATNVRSLTL VAPGSPYGYG GTQGLDGQPN SHDYAGSGGG TANAAFVEAL KNEDRGDGPV SPRTIMNSFY FKPPFRSSRE DVFVDEMLQT YCSPQNYPGD LVASANWPMV APGTGGVNNA LSPKYLNQSS FASIQPQPPV LWIRGDADQI VSDASMFDLC MLGQLGLVPG WPGAETHPAQ PMVGQMRAVL ENYAAHGGQY QETVLANCGH SPQIEQPSLF NTALLDFLTQ V
|
| |