Gene Haur_4866 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4866 
Symbol 
ID5736712 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp6199364 
End bp6201346 
Gene Length1983 bp 
Protein Length660 aa 
Translation table11 
GC content53% 
IMG OID641282032 
Productamidohydrolase 
Protein accessionYP_001547624 
Protein GI159901377 
COG category[F] Nucleotide transport and metabolism
[R] General function prediction only 
COG ID[COG0402] Cytosine deaminase and related metal-dependent hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTTTAC AACAAATTGA TACCCTGCTG CTTAATGGCA TCGTCGTCAC AATGGACGCA 
GCAGGCACGA TCATTCGTGA TGGTGGGGTG GCAATTCAGG CTGGTGCAAT CGTCGAGGTC
GGGCCAAGCA GCCAGTTGCG CGAACGCTAT ACCGCCAGCC AAACGATCGA CTGTAACAAT
CATGCCATTG TGCCTGGCTT AATCAACGGC CATGCTCACG TTCCAATGAG TTTGTTGCGC
GGCATCGTCG CCGATCAGCA GCTGGATGTG TGGCTCTATG GCTATATGTT TCCGGTCGAG
AGTCGCTTTG TTAACCCCGA ATTTGTCTAC CATGGCACGC GCCTCTCGTG TGCCGAAATG
ATCAAGGGCG GCATTACCAG CTTCGTCGAT ATGTATTATT TCGAGGAAGA AGTAGCCCGC
GCCGCTGATC AAGCTGGGAT GCGGGCGATC TGTGGCCAAA CCGTGATGAA ATGGCCAACC
CCCGATGCTG CCTCTTACGA CGAAGGTCTA GAGCGCACTC GCCGCTTTTT CGAGCAATGG
AAAGATCATG GTCGGGTGAT TCCGGCGATT GCGCCGCACG CACCCTACAC CTGTAACAGC
ACCATCTATC GCGCTGCCGC CGAATTGGCT CGCGAATTCG ATGTGCCGCT AGTAACTCAC
CTCAGCGAAA CCGCCCGCGA AGTCGAAGAA GCCCGCCAGT TGGTCGATAA ATCGCCAATC
GCCTATGTCG CTGATTTGGA TGCCTTCACC GATAAAGGGA TTGCGGCCCA CTGTGTACAC
ATCGACAAAC GTGATATGCA ATTACTCAAA AAGCATAACG CAGGAGCCGT GCCCTGCCCA
ACCAGCAACC TCAAACTGGC GAGCGGCGTG GCCAAATATG GCGAAATGCT GCAAGCTGGT
GTGAATGTAG GGCTTGGCAC TGATGGCCCC GCCTCCAACG ACGACCAGGA TTTGTGGTTG
GAAGTGCACT TAGCAGCCAT CTTGCCCAAG GGTGTCACGG GCGATCCCAC CGTAGTCAAC
GCCAAGCAGG CTTTTGCTAT GGCTACGAGC ATGGGTGCTA AAGCTGTGCA CCTTGATCAT
TTGGTTGGCA GCGTCGAAGC GGGCAAACGC GCCGATATCA CAATTGTTGA TTTGGGTGGA
TTGCACGTCG TGCCAGCTCC AGCCTATAAC TACAGCAACG ATTCAATCTA CAATCACTTG
GTCTACTCAG CCCGTTCGGG CGATGTACGC CATGTGTTGA TCGATGGTGC TTGGGTACTG
CAAGATCGCC AATTGTTGAC GCTTGATGAA ACCGAAGTTC GGAGCAATGC CCTGCGGATC
GCCGAAACGA TCAATCAATT CCTCAGCCAA CGTGAAGTCA ATTTGTTGGA TAAAATTCTG
GCGATCGGTG GAGTTAAACA AGCCGAAATC TTCGAAGTTC AAGTCAAGGC GCGTTTGGAA
GATGCTAGCA GCGTTGAAGC TTTCCTCGAA TCAGAGGCGG TCGAAATTAC CAAGGCGAGC
GAACGCAAGC AATACGACAC CTACTTCAGC TTCGATGATC CTAGTCGTGG GCGGATTCGT
TACCGTGAAG ATCATCGGGT TGATGGCTCG CGGCTTGAGC CAAAATACAA TTTGACTTTG
ACCATGCCCA ATGAGCGTGA AGATTTGCCC TCGGCAGTCT TGCTTTCACG GGCACGCTAC
ACCGCCCCAG CCGATCGTTC GTTACGCTTC TATCGTGAAT ATTTCTCGCC CGATCATGTG
ATCGAAGTTG AGAAATATCG CCGCCGCTGG CGGATTATGT ATGGCGAGGT TGATTTTGCG
ATCAATATCG ACATGATCAC CAGTGGCCCC GCCAAAGGCA CCTATTTAGA AATCAAGAGC
CGTACTTGGT CGGCCCGTGA TGCTGAAGGC AAAACCCAAA TTATCAGCGA ATTGTTGCAG
ATGGCCGGAG TCAAGCCCGA CCACATTATC AAGCAAGAAT ACGTTGAACT AGCCCAAGCT
TAA
 
Protein sequence
MALQQIDTLL LNGIVVTMDA AGTIIRDGGV AIQAGAIVEV GPSSQLRERY TASQTIDCNN 
HAIVPGLING HAHVPMSLLR GIVADQQLDV WLYGYMFPVE SRFVNPEFVY HGTRLSCAEM
IKGGITSFVD MYYFEEEVAR AADQAGMRAI CGQTVMKWPT PDAASYDEGL ERTRRFFEQW
KDHGRVIPAI APHAPYTCNS TIYRAAAELA REFDVPLVTH LSETAREVEE ARQLVDKSPI
AYVADLDAFT DKGIAAHCVH IDKRDMQLLK KHNAGAVPCP TSNLKLASGV AKYGEMLQAG
VNVGLGTDGP ASNDDQDLWL EVHLAAILPK GVTGDPTVVN AKQAFAMATS MGAKAVHLDH
LVGSVEAGKR ADITIVDLGG LHVVPAPAYN YSNDSIYNHL VYSARSGDVR HVLIDGAWVL
QDRQLLTLDE TEVRSNALRI AETINQFLSQ REVNLLDKIL AIGGVKQAEI FEVQVKARLE
DASSVEAFLE SEAVEITKAS ERKQYDTYFS FDDPSRGRIR YREDHRVDGS RLEPKYNLTL
TMPNEREDLP SAVLLSRARY TAPADRSLRF YREYFSPDHV IEVEKYRRRW RIMYGEVDFA
INIDMITSGP AKGTYLEIKS RTWSARDAEG KTQIISELLQ MAGVKPDHII KQEYVELAQA