Gene Haur_3690 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3690 
Symbol 
ID5735539 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp4642417 
End bp4643742 
Gene Length1326 bp 
Protein Length441 aa 
Translation table11 
GC content49% 
IMG OID641280842 
Productputative esterase 
Protein accessionYP_001546454 
Protein GI159900207 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG2382] Enterochelin esterase and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0739014 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTGGTTGA CCTTAATCTT ATTTGTGGTT GGGCTGGGGC TGGCATTTGG CTTAGGAGTG 
ATGGTTGGCC GTCAACGCGG CTTCGCGCTA TTTTCTGCGC CTGAAGCTAC GCCATTGGTT
AATTTTGAGT TTAAAATTCC TGCCGATACC GCGCCTGGTG ATCAACTGTA TCTGACTGGT
TCGTTTAATC AGTGGCGACC GAATGATCCG ACCTATGTGC TTTCGCGCAG TGCCGATATT
GCCTATGGCG CTTGGCCCTT TACCAATGGC TTGCGGCTTG ATTTCAAGTT AACCCGTGGT
TCGTGGTCGA ATGTCGAAAA AGCCGCTGAT GGCAGCGAAA TGCCCAATCG AACTGGAATC
GCCGCCAGCG GTGCGCAAGT TAAGGGTACA GTGGCGGCTT GGGCTGATCG TCAGCGTGAT
GCAGCTAAAA TTTATGATGA GCGGGTTGAA CGGGTCGATT TTTTCAGCCA GGCTTTGGGA
ATTACCCGCA CATTTTATAT TTATCTGCCA ATTGAAACTC GTAGCGATGA AAACCTGCGT
GTACCCAGCC TCTATCTTTT CCGTGGTCAT GAACGCGAAT GGATCAATAA AACTGAAGAT
GGGACGCGCG GTGGCAATCG CAATGTGATT GATGTCTACG AGGAATTACG TCGCCAAGAT
CAGATTGGCC CGATGGTGAT GGTGTTTCCA GGCATGACCA ACGCCAATGA TGGGATTCAT
AGCTTAGGCA TCAATCTCCA TTCACCAGAA TTAGTGGCTG ATCCTTCAAT TGGCACTGGC
TTATTCGAAG ATTTTATCTA TCGCGATTTA ATTCCCTATG TCGAAACCCA TTATCCGGTG
CTATTTGGCG GTGCGCATCG CTCGCTTGAT GGCTTTTCGT TGGGCGGCTT TATCAGTGTT
AATCAAGCTT TGCGCCATCC CAACGAGTGG GCTTCGGTCG GGGCTTACGA TGGCTTATTC
TTTTGGGACG ACCCTGAGAA TGCCGAAATT ATCGCTGCTC GTGATAGTGT TTTCGAACGT
AATTTATTTG ATGCCAATTT TGGCGTGCCG CGCGACCATA CTTTTGCAGC CCAACATAAC
CCATTGACCT TATTGCGAAT TGATGGAGCG CAGGCTTCAA AATTGCAATG GTTGATCGAA
TATGGCCCTG AATCCGCCGA GCCCAATGTT AATTATTATC GTGGGGCACG GCTCGATGAG
TTGCTGCGCG AAGTCGGGGC GCACAATCGG CTCAGCGGGG TTGTGCCAAA TGCCAATCAT
TCATGGCAAA TGGCCGATGA ACATATGCGG CGCAGTTTAC CCTATCACTA TCAACAAACC
CAATAA
 
Protein sequence
MWLTLILFVV GLGLAFGLGV MVGRQRGFAL FSAPEATPLV NFEFKIPADT APGDQLYLTG 
SFNQWRPNDP TYVLSRSADI AYGAWPFTNG LRLDFKLTRG SWSNVEKAAD GSEMPNRTGI
AASGAQVKGT VAAWADRQRD AAKIYDERVE RVDFFSQALG ITRTFYIYLP IETRSDENLR
VPSLYLFRGH EREWINKTED GTRGGNRNVI DVYEELRRQD QIGPMVMVFP GMTNANDGIH
SLGINLHSPE LVADPSIGTG LFEDFIYRDL IPYVETHYPV LFGGAHRSLD GFSLGGFISV
NQALRHPNEW ASVGAYDGLF FWDDPENAEI IAARDSVFER NLFDANFGVP RDHTFAAQHN
PLTLLRIDGA QASKLQWLIE YGPESAEPNV NYYRGARLDE LLREVGAHNR LSGVVPNANH
SWQMADEHMR RSLPYHYQQT Q