Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_1996 |
Symbol | |
ID | 5733885 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 2465135 |
End bp | 2466262 |
Gene Length | 1128 bp |
Protein Length | 375 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 641279140 |
Product | GDSL family lipase |
Protein accession | YP_001544767 |
Protein GI | 159898520 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2755] Lysophospholipase L1 and related esterases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGTCGTA TGATGATGGG ATTATTCGTT GGCTTGATCG CAATCAGTAG CGGGCAAATT CAAGCCAAAC CACAGGTTAC TGTGATCGAT CCGCCAGTTG TAGGCTATCC ACAGCGGATG GTTGCGTTTG GCGATTCAAT TACCCAAGCC TTTCTGGCCG ATGGAAATAT TGGCCAGATT GGCGATAGAC CCCAGTATAG CTGGGCAACC GGCACGAATG CAACGGTCAA TAGTTTGGCC GAGCGCATCC GCAGTAGCAC TGGAGTAATC ACTGCCACCA ATGTTGCGGT CAGTGGCTCA AGCATGAACG CCTTACTAAG CCAAGTTAAT ACGGCCAATA GTGCCAATGC GCAATATGCA ACGATTTTGC TTGGGGCAAA CGATATTTGT CGTTCGAGTG AAAGTGCCAT GACCAGCGTC GCAACCTACC GAGCGCAACT CATCAGCGGC TTAAATCAAT TAACCAGCAA CGAGCCAGAA GCACGAATTT TTATTGCCAG CATTCCCGAT ATCTTTCAAG TCTGGCAAAC CTTCAAAGGC AATCCAACTG CTCGCGCGAT TTGGAACCAA TTTAATGTCT GTCAGTCGAT GTTTGAAAAT CCTGAGTCCA CTGCGCCAGC TGACGTAGAG CGTCGCCAGC GCGTTCGCCA ACGGATTATC GATTTCAACA GCCAATTGAG CGAGGTTTGT AACGACTATT TGCGTTGCCG CTTTGACCAA AATCTGTTGT TTAATGCGCC CATCTCACCA ACGTTGATTA CCGCCGATTA TTATCACCCT TCGATCGTTG GGCAACAGGT ACTAGCAACG AATTTAGCCC AAGCGTCGTT TGATTTTACT GATCAACAAG CTCCAGTATC CACGGTAACA TTTAGCCAGA CACATACCAC ATGGCAAGCT CGCTTGAGTG CCAGCGATGA TCAAGGAGTA CGCGGTTTAG AATATCGTCT CCCCAACCAA ACAACCTGGA CTCGTTATCA GCAGCCGTTT GAGTTGGCTG CTCAGGCCAC GCTGATTGTC CGCGCGGTTG ATATTAATGG CAATACTGAG GGGTCGCGTG CCTGGACAGC ACCGCCAATT AACCAACCTA CGTATAAACT ATTCTTGCCG TTTGTCATTC GTAACTAA
|
Protein sequence | MRRMMMGLFV GLIAISSGQI QAKPQVTVID PPVVGYPQRM VAFGDSITQA FLADGNIGQI GDRPQYSWAT GTNATVNSLA ERIRSSTGVI TATNVAVSGS SMNALLSQVN TANSANAQYA TILLGANDIC RSSESAMTSV ATYRAQLISG LNQLTSNEPE ARIFIASIPD IFQVWQTFKG NPTARAIWNQ FNVCQSMFEN PESTAPADVE RRQRVRQRII DFNSQLSEVC NDYLRCRFDQ NLLFNAPISP TLITADYYHP SIVGQQVLAT NLAQASFDFT DQQAPVSTVT FSQTHTTWQA RLSASDDQGV RGLEYRLPNQ TTWTRYQQPF ELAAQATLIV RAVDINGNTE GSRAWTAPPI NQPTYKLFLP FVIRN
|
| |