Gene Haur_3801 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3801 
Symbol 
ID5735665 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp4771917 
End bp4772972 
Gene Length1056 bp 
Protein Length351 aa 
Translation table11 
GC content54% 
IMG OID641280953 
Productalpha/beta hydrolase fold 
Protein accessionYP_001546565 
Protein GI159900318 
COG category[R] General function prediction only 
COG ID[COG0596] Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily)  
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTCAATC CACCATTGTT AGCTGGATTA ACCGCCCGTG TTGTTCAAAT GCCGCGCTTG 
GCGATGCATG TGATTGAGCG TGGCGACCCA AGCAAGCAGC CAATTGTGCT CGTTCATGGC
AATGTTTCAG CCGCCCGTTT TTGGGAAGAA TTGATGCTGG CCTTGCCCGA TGATTATTAT
GTGATTGCCC CTGATCTGCG TAGTTATGGT CGCTCGGAGC GCTTGCCGCT TGATGCAACC
CGTGGTGTGC GTGACTTCAG CGACGATCTC GATAGCCTGT TGCAAACCCT CAATATTCGC
CGACCGCACT TGGTTGGCTG GTCGTTGGGT GGTAATGTGG TGCTGCAATA TGCCCTCGAT
TATGCCACGA ATGTGCGTTC GCTGACGCTT GTTGCCCCAG GTTCGCCCTA TGGCTACGGC
GGCACGCAAG GCCTCGACGG CCAACCCAAC AGCCATGACT ATGCTGGGAG CGGCGGTGGT
ACGGCCAATG CAGCCTTTGT CGAAGCCTTG AAAAATGAAG ATCGCGGTGA TGGCCCAGTT
TCGCCACGCA CCATTATGAA TAGCTTCTAC TTCAAACCGC CGTTCCGCTC AAGCCGCGAA
GATGTGTTTG TTGATGAAAT GCTTCAGACC TATTGCTCGC CCCAAAATTA CCCTGGTGAT
TTGGTGGCTT CAGCCAATTG GCCGATGGTT GCCCCAGGTA CAGGCGGCGT AAACAATGCG
CTCTCGCCCA AATATCTGAA CCAAAGCAGT TTTGCCAGCA TTCAGCCCCA ACCGCCAGTG
CTCTGGATTC GCGGCGATGC TGACCAAATT GTCTCTGATG CTTCGATGTT TGATTTGTGT
ATGCTTGGGC AATTGGGCTT AGTTCCAGGC TGGCCTGGCG CGGAAACTCA TCCGGCTCAG
CCGATGGTTG GCCAAATGCG GGCAGTGCTT GAAAACTATG CCGCCCACGG TGGCCAATAT
CAAGAAACGG TGCTGGCAAA TTGTGGCCAC TCGCCACAGA TCGAACAGCC CAGTTTGTTC
AATACGGCTC TACTCGATTT TCTGACCCAA GTTTAG
 
Protein sequence
MLNPPLLAGL TARVVQMPRL AMHVIERGDP SKQPIVLVHG NVSAARFWEE LMLALPDDYY 
VIAPDLRSYG RSERLPLDAT RGVRDFSDDL DSLLQTLNIR RPHLVGWSLG GNVVLQYALD
YATNVRSLTL VAPGSPYGYG GTQGLDGQPN SHDYAGSGGG TANAAFVEAL KNEDRGDGPV
SPRTIMNSFY FKPPFRSSRE DVFVDEMLQT YCSPQNYPGD LVASANWPMV APGTGGVNNA
LSPKYLNQSS FASIQPQPPV LWIRGDADQI VSDASMFDLC MLGQLGLVPG WPGAETHPAQ
PMVGQMRAVL ENYAAHGGQY QETVLANCGH SPQIEQPSLF NTALLDFLTQ V