Gene Haur_1594 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_1594 
Symbol 
ID5733481 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp1847174 
End bp1848403 
Gene Length1230 bp 
Protein Length409 aa 
Translation table11 
GC content49% 
IMG OID641278733 
Productputative esterase 
Protein accessionYP_001544365 
Protein GI159898118 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG2382] Enterochelin esterase and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.48746 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAGCGAT TTATGAGTTT ATTCTTGATT TGTGCGCTTT TGGCAAGTTG CGGTGGGGCG 
GCGGTCAGCT ATGATCCAAC CCAGCCGATC ACTAGTTTTA ATTTGTTGCG CGAGCAATTG
GCGAAAACCA ATGATCAAGC CGAGCAACTC AAATTAATCA ACGATTTTAA TGCGGTGTTT
CCAACCAGCC CCTTGACCCA AGGCGATCAG GCCTTGTTTA TGGTCGAAAA AGATGCTCCA
CGTATGGAAT TATCCAGTGA TCTGACTAAT TGGTTTCAGA CGACTTCGAT GCAACGACTT
GGCTCAACCA ACTGGTGGGG CTTGGTACAA ACGATCAGCT CAACCGCACG AATCGATTAT
CGCTTTGGGG TTGGCGGTGG TGGCGGTCTG ATGAATGACC CGCGTAATCC AACCTTGGTG
CCAAGCAGCA TGGGCATGAA TTCAGAGTTG CGCATGCCCG AATATCTCAC GCCAACCGAA
ATTATTTCGC GTAGCGACGT GCCCAAGGGT ACGCTAGAAG ATTTGGGTGA TTACTACACT
GACATTAGCA AAACCACCCA CCGCTTGCAT GTCTATCTGC CGGCCAATTA CGATCCAGCA
AAGCAATATC CGAGTGTTTA TTTTCAAGAT GGCGATGATT ACCAAAACTA TGCTTTTACG
CCAACAATTT TAGATAATGC GATTGCTGAT CAGATTTTGC CGCCGTTGAT TGCGATTTTC
GTCAAGCCCA GCCGTGAGCA AGGCCGCCAA CGCGATTATG ATCTCAACGA TGCCTATAGC
GAATTTTTCG CCACCGAATT GGTCAGCCTG ATCGACAGCA AATACAGCAC GATCAACGAT
CCTAAGCAAC GGGTGGTGGT TGGCGATTCA TATGGTGGCT TGATTTCGCT CTATTTGGCC
TTACAATATC CTGAGGTATT TGGCGGTGTG GTCAATCAAT CGGGCTTTGT TTCACGCCAA
AATGGCCGTC TGCTGACGCT CATGTCAATT CAGCCGCCAG TTAATGCGCG GATTGTGACC
GTGGTCGGCA CATATGAGAC ATGTATTGGC GGGCCAGTGA CTGGCGATGA ATGCAATTTC
CTTGAGGGCA ATCGAACTTT GCGCGATATT CTGGTTGGGG CTGGAGTGCA ACTCAACTAT
GCTGAATATC CGCAAGGCCA TGCTTGGGCC TTTTGGCGCG ACCACATTGA TCGAGAGGTT
GCATGGGCTT TAGATTGGCA AAAACCCTAA
 
Protein sequence
MQRFMSLFLI CALLASCGGA AVSYDPTQPI TSFNLLREQL AKTNDQAEQL KLINDFNAVF 
PTSPLTQGDQ ALFMVEKDAP RMELSSDLTN WFQTTSMQRL GSTNWWGLVQ TISSTARIDY
RFGVGGGGGL MNDPRNPTLV PSSMGMNSEL RMPEYLTPTE IISRSDVPKG TLEDLGDYYT
DISKTTHRLH VYLPANYDPA KQYPSVYFQD GDDYQNYAFT PTILDNAIAD QILPPLIAIF
VKPSREQGRQ RDYDLNDAYS EFFATELVSL IDSKYSTIND PKQRVVVGDS YGGLISLYLA
LQYPEVFGGV VNQSGFVSRQ NGRLLTLMSI QPPVNARIVT VVGTYETCIG GPVTGDECNF
LEGNRTLRDI LVGAGVQLNY AEYPQGHAWA FWRDHIDREV AWALDWQKP