Gene Haur_1643 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_1643 
Symbol 
ID5733527 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp1900091 
End bp1901485 
Gene Length1395 bp 
Protein Length464 aa 
Translation table11 
GC content53% 
IMG OID641278782 
Productbeta-lactamase domain-containing protein 
Protein accessionYP_001544414 
Protein GI159898167 
COG category[P] Inorganic ion transport and metabolism
[R] General function prediction only 
COG ID[COG0491] Zn-dependent hydrolases, including glyoxylases
[COG0607] Rhodanese-related sulfurtransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTTTTCC AACGGTTGTA TCACGATGGT TTAGCCCAAG CAAGTTATTT AGTCGGTTGC 
CAAAAAACTG GCGAAGCTTT GGTCATTGAT CCCCATCGTG ATAGTGCAGT GTATTTGAAT
GCAGCGCAAG CGGCTGGTTT GCGGATTAGC GCAATCACCG AAACCCATAT TCACGCCGAT
TTTGTCTCGG GCAGCCGCGA GCTGGCGCAA ATTACGGGCG CAAAACTGTA TCTTTCGGCA
GCAGGCACAG CAGCTTGGCA ATATGCTTTT GCCAGCAGCA ACCCCAATGT GCAGTTGATC
AACGATGGCG ATAGCTGGAT GGTTGGCAAT TTGAAGATCG AGGTCTTGCA TACCCCAGGC
CATACGCCAG AACATGTGAT TTTTATGCTC ACCGACACGC CTGCTGGCGA TGAACCAATG
GGTATTTTCA CAGGCGATTT GTTGTTTGCT GGCGATGTTG GCCGCCCTGA CTTGCTCGAA
AAAGCCGCTG GGATTCAAGG CACCAGCGAG GCCGGAGCCA AAGATTTGTT TAAATCGTTG
CAGCGCATCG CTTCCTTGCC CGATTTCTTG CAAGTTTGGC CAGGCCATGG TTCGGGCAGC
GCTTGTGGCA AAGCACTGGG AGCTGTGCCG CAAAGTACCT TGGGCTACGA AAAGAAAGTT
AATTGGGCCT TCAAACAGCA GAACGAGGCT GATTTTGTGG CCGCAGTGCT CGATGGTCAG
CCAACGCCAC CGCCCTATTT CGCCGCGATG AAGCGCATCA ACCGCGATGG TCCTGAGTTG
CGCCCGCAAC GTTTGCCAGC CCTCAGCCTT GCCGAAATTA AGCTTGATAA TCGGTTGGTT
GTTGATTTAC GTTCGGCCAA TGATTTTGCT CAAGCCAGCA TTCCAGGCGC ACTCAGCCTG
ACCAGCGGCG CGATGCTCAA TCGCTGGGCC GGTTGGTTTG TGCCGGTCGC AAGCAAGGTT
GTGTTAATTG GCACTGCCGA AGCGGCCAAA ACTGCCCAAA CCGAGCTAAG CATGATCGGC
ATCGACCAAA TCGAAGGCTA TATCACGCCC GATATGCTAA CCGAATGGCT CAAAACCAAC
CCAAGCCAAA GCTATCAACG GGTGCCAGCC GCCAATTTGA GCGAACAACT TGACCAGGCG
TTTGTGCTCG ATGTGCGCAC GCCCGAAGAA TATGCCAACG GTCACGGAGC TAAGGCGGTC
AATATTCCAT TGAATGAATT ACCCAAGCGC CTGGCCGAAA TTCCCAATGA TCAACCGTTG
ATCGTGCATT GTCAGGCGGG TGGGCGTTCG CCAATTGCCA TGAGTTTGCT CAAACCACAT
TTCAGCCAAG CCATGGTTGA GATGAGCGAC GGCTGGAATG GTTGGTATCA ATTGAAAAAG
GAGCGTCACG TATGA
 
Protein sequence
MFFQRLYHDG LAQASYLVGC QKTGEALVID PHRDSAVYLN AAQAAGLRIS AITETHIHAD 
FVSGSRELAQ ITGAKLYLSA AGTAAWQYAF ASSNPNVQLI NDGDSWMVGN LKIEVLHTPG
HTPEHVIFML TDTPAGDEPM GIFTGDLLFA GDVGRPDLLE KAAGIQGTSE AGAKDLFKSL
QRIASLPDFL QVWPGHGSGS ACGKALGAVP QSTLGYEKKV NWAFKQQNEA DFVAAVLDGQ
PTPPPYFAAM KRINRDGPEL RPQRLPALSL AEIKLDNRLV VDLRSANDFA QASIPGALSL
TSGAMLNRWA GWFVPVASKV VLIGTAEAAK TAQTELSMIG IDQIEGYITP DMLTEWLKTN
PSQSYQRVPA ANLSEQLDQA FVLDVRTPEE YANGHGAKAV NIPLNELPKR LAEIPNDQPL
IVHCQAGGRS PIAMSLLKPH FSQAMVEMSD GWNGWYQLKK ERHV