Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_4866 |
Symbol | |
ID | 5736712 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 6199364 |
End bp | 6201346 |
Gene Length | 1983 bp |
Protein Length | 660 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641282032 |
Product | amidohydrolase |
Protein accession | YP_001547624 |
Protein GI | 159901377 |
COG category | [F] Nucleotide transport and metabolism [R] General function prediction only |
COG ID | [COG0402] Cytosine deaminase and related metal-dependent hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTTTAC AACAAATTGA TACCCTGCTG CTTAATGGCA TCGTCGTCAC AATGGACGCA GCAGGCACGA TCATTCGTGA TGGTGGGGTG GCAATTCAGG CTGGTGCAAT CGTCGAGGTC GGGCCAAGCA GCCAGTTGCG CGAACGCTAT ACCGCCAGCC AAACGATCGA CTGTAACAAT CATGCCATTG TGCCTGGCTT AATCAACGGC CATGCTCACG TTCCAATGAG TTTGTTGCGC GGCATCGTCG CCGATCAGCA GCTGGATGTG TGGCTCTATG GCTATATGTT TCCGGTCGAG AGTCGCTTTG TTAACCCCGA ATTTGTCTAC CATGGCACGC GCCTCTCGTG TGCCGAAATG ATCAAGGGCG GCATTACCAG CTTCGTCGAT ATGTATTATT TCGAGGAAGA AGTAGCCCGC GCCGCTGATC AAGCTGGGAT GCGGGCGATC TGTGGCCAAA CCGTGATGAA ATGGCCAACC CCCGATGCTG CCTCTTACGA CGAAGGTCTA GAGCGCACTC GCCGCTTTTT CGAGCAATGG AAAGATCATG GTCGGGTGAT TCCGGCGATT GCGCCGCACG CACCCTACAC CTGTAACAGC ACCATCTATC GCGCTGCCGC CGAATTGGCT CGCGAATTCG ATGTGCCGCT AGTAACTCAC CTCAGCGAAA CCGCCCGCGA AGTCGAAGAA GCCCGCCAGT TGGTCGATAA ATCGCCAATC GCCTATGTCG CTGATTTGGA TGCCTTCACC GATAAAGGGA TTGCGGCCCA CTGTGTACAC ATCGACAAAC GTGATATGCA ATTACTCAAA AAGCATAACG CAGGAGCCGT GCCCTGCCCA ACCAGCAACC TCAAACTGGC GAGCGGCGTG GCCAAATATG GCGAAATGCT GCAAGCTGGT GTGAATGTAG GGCTTGGCAC TGATGGCCCC GCCTCCAACG ACGACCAGGA TTTGTGGTTG GAAGTGCACT TAGCAGCCAT CTTGCCCAAG GGTGTCACGG GCGATCCCAC CGTAGTCAAC GCCAAGCAGG CTTTTGCTAT GGCTACGAGC ATGGGTGCTA AAGCTGTGCA CCTTGATCAT TTGGTTGGCA GCGTCGAAGC GGGCAAACGC GCCGATATCA CAATTGTTGA TTTGGGTGGA TTGCACGTCG TGCCAGCTCC AGCCTATAAC TACAGCAACG ATTCAATCTA CAATCACTTG GTCTACTCAG CCCGTTCGGG CGATGTACGC CATGTGTTGA TCGATGGTGC TTGGGTACTG CAAGATCGCC AATTGTTGAC GCTTGATGAA ACCGAAGTTC GGAGCAATGC CCTGCGGATC GCCGAAACGA TCAATCAATT CCTCAGCCAA CGTGAAGTCA ATTTGTTGGA TAAAATTCTG GCGATCGGTG GAGTTAAACA AGCCGAAATC TTCGAAGTTC AAGTCAAGGC GCGTTTGGAA GATGCTAGCA GCGTTGAAGC TTTCCTCGAA TCAGAGGCGG TCGAAATTAC CAAGGCGAGC GAACGCAAGC AATACGACAC CTACTTCAGC TTCGATGATC CTAGTCGTGG GCGGATTCGT TACCGTGAAG ATCATCGGGT TGATGGCTCG CGGCTTGAGC CAAAATACAA TTTGACTTTG ACCATGCCCA ATGAGCGTGA AGATTTGCCC TCGGCAGTCT TGCTTTCACG GGCACGCTAC ACCGCCCCAG CCGATCGTTC GTTACGCTTC TATCGTGAAT ATTTCTCGCC CGATCATGTG ATCGAAGTTG AGAAATATCG CCGCCGCTGG CGGATTATGT ATGGCGAGGT TGATTTTGCG ATCAATATCG ACATGATCAC CAGTGGCCCC GCCAAAGGCA CCTATTTAGA AATCAAGAGC CGTACTTGGT CGGCCCGTGA TGCTGAAGGC AAAACCCAAA TTATCAGCGA ATTGTTGCAG ATGGCCGGAG TCAAGCCCGA CCACATTATC AAGCAAGAAT ACGTTGAACT AGCCCAAGCT TAA
|
Protein sequence | MALQQIDTLL LNGIVVTMDA AGTIIRDGGV AIQAGAIVEV GPSSQLRERY TASQTIDCNN HAIVPGLING HAHVPMSLLR GIVADQQLDV WLYGYMFPVE SRFVNPEFVY HGTRLSCAEM IKGGITSFVD MYYFEEEVAR AADQAGMRAI CGQTVMKWPT PDAASYDEGL ERTRRFFEQW KDHGRVIPAI APHAPYTCNS TIYRAAAELA REFDVPLVTH LSETAREVEE ARQLVDKSPI AYVADLDAFT DKGIAAHCVH IDKRDMQLLK KHNAGAVPCP TSNLKLASGV AKYGEMLQAG VNVGLGTDGP ASNDDQDLWL EVHLAAILPK GVTGDPTVVN AKQAFAMATS MGAKAVHLDH LVGSVEAGKR ADITIVDLGG LHVVPAPAYN YSNDSIYNHL VYSARSGDVR HVLIDGAWVL QDRQLLTLDE TEVRSNALRI AETINQFLSQ REVNLLDKIL AIGGVKQAEI FEVQVKARLE DASSVEAFLE SEAVEITKAS ERKQYDTYFS FDDPSRGRIR YREDHRVDGS RLEPKYNLTL TMPNEREDLP SAVLLSRARY TAPADRSLRF YREYFSPDHV IEVEKYRRRW RIMYGEVDFA INIDMITSGP AKGTYLEIKS RTWSARDAEG KTQIISELLQ MAGVKPDHII KQEYVELAQA
|
| |