Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_1594 |
Symbol | |
ID | 5733481 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 1847174 |
End bp | 1848403 |
Gene Length | 1230 bp |
Protein Length | 409 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 641278733 |
Product | putative esterase |
Protein accession | YP_001544365 |
Protein GI | 159898118 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG2382] Enterochelin esterase and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.48746 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAGCGAT TTATGAGTTT ATTCTTGATT TGTGCGCTTT TGGCAAGTTG CGGTGGGGCG GCGGTCAGCT ATGATCCAAC CCAGCCGATC ACTAGTTTTA ATTTGTTGCG CGAGCAATTG GCGAAAACCA ATGATCAAGC CGAGCAACTC AAATTAATCA ACGATTTTAA TGCGGTGTTT CCAACCAGCC CCTTGACCCA AGGCGATCAG GCCTTGTTTA TGGTCGAAAA AGATGCTCCA CGTATGGAAT TATCCAGTGA TCTGACTAAT TGGTTTCAGA CGACTTCGAT GCAACGACTT GGCTCAACCA ACTGGTGGGG CTTGGTACAA ACGATCAGCT CAACCGCACG AATCGATTAT CGCTTTGGGG TTGGCGGTGG TGGCGGTCTG ATGAATGACC CGCGTAATCC AACCTTGGTG CCAAGCAGCA TGGGCATGAA TTCAGAGTTG CGCATGCCCG AATATCTCAC GCCAACCGAA ATTATTTCGC GTAGCGACGT GCCCAAGGGT ACGCTAGAAG ATTTGGGTGA TTACTACACT GACATTAGCA AAACCACCCA CCGCTTGCAT GTCTATCTGC CGGCCAATTA CGATCCAGCA AAGCAATATC CGAGTGTTTA TTTTCAAGAT GGCGATGATT ACCAAAACTA TGCTTTTACG CCAACAATTT TAGATAATGC GATTGCTGAT CAGATTTTGC CGCCGTTGAT TGCGATTTTC GTCAAGCCCA GCCGTGAGCA AGGCCGCCAA CGCGATTATG ATCTCAACGA TGCCTATAGC GAATTTTTCG CCACCGAATT GGTCAGCCTG ATCGACAGCA AATACAGCAC GATCAACGAT CCTAAGCAAC GGGTGGTGGT TGGCGATTCA TATGGTGGCT TGATTTCGCT CTATTTGGCC TTACAATATC CTGAGGTATT TGGCGGTGTG GTCAATCAAT CGGGCTTTGT TTCACGCCAA AATGGCCGTC TGCTGACGCT CATGTCAATT CAGCCGCCAG TTAATGCGCG GATTGTGACC GTGGTCGGCA CATATGAGAC ATGTATTGGC GGGCCAGTGA CTGGCGATGA ATGCAATTTC CTTGAGGGCA ATCGAACTTT GCGCGATATT CTGGTTGGGG CTGGAGTGCA ACTCAACTAT GCTGAATATC CGCAAGGCCA TGCTTGGGCC TTTTGGCGCG ACCACATTGA TCGAGAGGTT GCATGGGCTT TAGATTGGCA AAAACCCTAA
|
Protein sequence | MQRFMSLFLI CALLASCGGA AVSYDPTQPI TSFNLLREQL AKTNDQAEQL KLINDFNAVF PTSPLTQGDQ ALFMVEKDAP RMELSSDLTN WFQTTSMQRL GSTNWWGLVQ TISSTARIDY RFGVGGGGGL MNDPRNPTLV PSSMGMNSEL RMPEYLTPTE IISRSDVPKG TLEDLGDYYT DISKTTHRLH VYLPANYDPA KQYPSVYFQD GDDYQNYAFT PTILDNAIAD QILPPLIAIF VKPSREQGRQ RDYDLNDAYS EFFATELVSL IDSKYSTIND PKQRVVVGDS YGGLISLYLA LQYPEVFGGV VNQSGFVSRQ NGRLLTLMSI QPPVNARIVT VVGTYETCIG GPVTGDECNF LEGNRTLRDI LVGAGVQLNY AEYPQGHAWA FWRDHIDREV AWALDWQKP
|
| |