Gene Haur_3168 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3168 
Symbol 
ID5735040 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp4000942 
End bp4001955 
Gene Length1014 bp 
Protein Length337 aa 
Translation table11 
GC content49% 
IMG OID641280311 
Productbeta-lactamase domain-containing protein 
Protein accessionYP_001545933 
Protein GI159899686 
COG category[R] General function prediction only 
COG ID[COG0491] Zn-dependent hydrolases, including glyoxylases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00770543 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGAGCAGC GTGTTTTACG ATTTGAAACA CATACTGGGG TGCGCATCTA CAGCCTTCCT 
GTAGAGGTTT TCCCTCAATT TTATGCCAAT GTGTATCTGG TTATCGCCGA TGGTTTGGTC
TGTTTGATCG ATACTGGCTC CGGTCATGGC AATTCGAATA CCGATCTGGT CGCTGGCTTT
GCCGCAATTC AAGCTCTATG GGGCGAGAAG CTGAGTTTGG CCGATGTTAA TCAGATTATT
TTGACCCACG CTCATATCGA TCATTTTGGC GGCTTAGGGT TTGTGCGCCA ATTTACCCAA
GCTCCGACTG CTATCCACAT TGCTGATCGG CGGGTGATTA CCAATTATGA TGGGCGCTTG
TTGTTTGCTT CCAAGGCATT AAGTCGCTTT TTGCGCCATG CGGGGGTTTC GGCTGAGCAA
CATCAGCATT TGATGGGTAT GTATTTGCAT AATAAAGATG CGTTTCGGCC TCAGCCGATT
GAGCAAACCT TTGAAGATGG CGATCGGCTT TGCAATTTGT TTGAGGTTAT TTACACACCT
GGCCATTGCC CGGGCCAAGT TTGTTTACGT TTGGATGAGG TGTTGTTTAG TGCTGATCAT
ATTTTGGCCC GCACCGCTCC GCATCTTGCG CCCGAAATGA TTACCCCAGG CACAGGCTTG
GAGCATTATA TTAGTGGCTT GCGCCAAGTT GCCAAACTCG ATGGCATTGA ATTGACCTTG
GGCGGCCACG AGCAGCCAAT CGAAAACTTA GCCCAGCGCT TAGAGCAAAT TTTCGCTTCT
AATCAACGTA AAATCGATCG GATTAGCGAT TTGCTCAAGG CCCAGCCGCA AACTGTTGCC
CAGCTTAGCG CCGCCATGTA TCGTGGTTTA CAGGGCTACG ATCAGTTATT GGGCATCGAA
AAAGTGGGCG CATTCGTCGA ATATTTGTAT CTGCGCGGCT ATGTGCGGCC AGTCAACATC
GCCCCCGACA ACGACGACCG CGACCCGATT AATGTGATGT ATGAATTGTT TTAA
 
Protein sequence
MEQRVLRFET HTGVRIYSLP VEVFPQFYAN VYLVIADGLV CLIDTGSGHG NSNTDLVAGF 
AAIQALWGEK LSLADVNQII LTHAHIDHFG GLGFVRQFTQ APTAIHIADR RVITNYDGRL
LFASKALSRF LRHAGVSAEQ HQHLMGMYLH NKDAFRPQPI EQTFEDGDRL CNLFEVIYTP
GHCPGQVCLR LDEVLFSADH ILARTAPHLA PEMITPGTGL EHYISGLRQV AKLDGIELTL
GGHEQPIENL AQRLEQIFAS NQRKIDRISD LLKAQPQTVA QLSAAMYRGL QGYDQLLGIE
KVGAFVEYLY LRGYVRPVNI APDNDDRDPI NVMYELF