Gene Haur_1996 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_1996 
Symbol 
ID5733885 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp2465135 
End bp2466262 
Gene Length1128 bp 
Protein Length375 aa 
Translation table11 
GC content49% 
IMG OID641279140 
ProductGDSL family lipase 
Protein accessionYP_001544767 
Protein GI159898520 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2755] Lysophospholipase L1 and related esterases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGTCGTA TGATGATGGG ATTATTCGTT GGCTTGATCG CAATCAGTAG CGGGCAAATT 
CAAGCCAAAC CACAGGTTAC TGTGATCGAT CCGCCAGTTG TAGGCTATCC ACAGCGGATG
GTTGCGTTTG GCGATTCAAT TACCCAAGCC TTTCTGGCCG ATGGAAATAT TGGCCAGATT
GGCGATAGAC CCCAGTATAG CTGGGCAACC GGCACGAATG CAACGGTCAA TAGTTTGGCC
GAGCGCATCC GCAGTAGCAC TGGAGTAATC ACTGCCACCA ATGTTGCGGT CAGTGGCTCA
AGCATGAACG CCTTACTAAG CCAAGTTAAT ACGGCCAATA GTGCCAATGC GCAATATGCA
ACGATTTTGC TTGGGGCAAA CGATATTTGT CGTTCGAGTG AAAGTGCCAT GACCAGCGTC
GCAACCTACC GAGCGCAACT CATCAGCGGC TTAAATCAAT TAACCAGCAA CGAGCCAGAA
GCACGAATTT TTATTGCCAG CATTCCCGAT ATCTTTCAAG TCTGGCAAAC CTTCAAAGGC
AATCCAACTG CTCGCGCGAT TTGGAACCAA TTTAATGTCT GTCAGTCGAT GTTTGAAAAT
CCTGAGTCCA CTGCGCCAGC TGACGTAGAG CGTCGCCAGC GCGTTCGCCA ACGGATTATC
GATTTCAACA GCCAATTGAG CGAGGTTTGT AACGACTATT TGCGTTGCCG CTTTGACCAA
AATCTGTTGT TTAATGCGCC CATCTCACCA ACGTTGATTA CCGCCGATTA TTATCACCCT
TCGATCGTTG GGCAACAGGT ACTAGCAACG AATTTAGCCC AAGCGTCGTT TGATTTTACT
GATCAACAAG CTCCAGTATC CACGGTAACA TTTAGCCAGA CACATACCAC ATGGCAAGCT
CGCTTGAGTG CCAGCGATGA TCAAGGAGTA CGCGGTTTAG AATATCGTCT CCCCAACCAA
ACAACCTGGA CTCGTTATCA GCAGCCGTTT GAGTTGGCTG CTCAGGCCAC GCTGATTGTC
CGCGCGGTTG ATATTAATGG CAATACTGAG GGGTCGCGTG CCTGGACAGC ACCGCCAATT
AACCAACCTA CGTATAAACT ATTCTTGCCG TTTGTCATTC GTAACTAA
 
Protein sequence
MRRMMMGLFV GLIAISSGQI QAKPQVTVID PPVVGYPQRM VAFGDSITQA FLADGNIGQI 
GDRPQYSWAT GTNATVNSLA ERIRSSTGVI TATNVAVSGS SMNALLSQVN TANSANAQYA
TILLGANDIC RSSESAMTSV ATYRAQLISG LNQLTSNEPE ARIFIASIPD IFQVWQTFKG
NPTARAIWNQ FNVCQSMFEN PESTAPADVE RRQRVRQRII DFNSQLSEVC NDYLRCRFDQ
NLLFNAPISP TLITADYYHP SIVGQQVLAT NLAQASFDFT DQQAPVSTVT FSQTHTTWQA
RLSASDDQGV RGLEYRLPNQ TTWTRYQQPF ELAAQATLIV RAVDINGNTE GSRAWTAPPI
NQPTYKLFLP FVIRN