Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_0024 |
Symbol | |
ID | 5736858 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 31252 |
End bp | 32151 |
Gene Length | 900 bp |
Protein Length | 299 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641277145 |
Product | amidohydrolase |
Protein accession | YP_001542804 |
Protein GI | 159896557 |
COG category | [F] Nucleotide transport and metabolism [R] General function prediction only |
COG ID | [COG0402] Cytosine deaminase and related metal-dependent hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTATACCA ATGCCCATAC TCACTTAGAA CTTAGCGCCT TGGCCGAGGT TTGTCCCACT GGCCAAGAAT TCGGCTCATG GCTACAAACC ATGCTTGCGC GGCGCATGCA GCTTGATAAC GCTACCCTCG AACGTGGTAT CCACAAGGCT ATCGAACAAC TCCAGCAAAA CCAAACCACC ACAGTTGGCG ATATTTCAGC CACCCGATTA AGCATCGAAC CCTTGCTCAG CAGCGGCTTG GCGGGCGTGG TCTATCTCGA AGTGCTGGGC TTTGATCCAA CGGTGGCGCT GGAGCGATTG AGCGCCGCCC AACGCGAAAT CGACACATGG CGCAAACGGG AAACCGCCAT GAAAATTGGC TTGAGCATTC ATGCACCCTA TAGTTGTGCG CCAAGTTTGT TCGAAGCTGC GGCGCGTTGG TGTCGAGCTG AAGCTGTGCC TTGGGCCATC CACATTGCCG AATCGCCTGC CGAAGTAGCT TTTTTGCAAC AAGGCATTGG CTCATTACGT GAACTCAACC GCCGCATAAC CCCCCATATC GAATGGCAAG CTCCGCAATG TTCGCCAATC GCCTATCTCG AACGCTTGGG CGTGCTCGAA GCCCAGCCTG TGTTAGTGCA TGCCGTCCAA GTCGATGCCC ATGATCTGGC GTTAATTGCT CAATATGATT GTGCTGTCGT GCATTGCCCG CGCTCTAACC ACAACCTCTT ATGTGGGCGC ATGCCGCTTG AACAGATGCT GGCCCAAGGC ATTCGGGTTG GCTTAGGCAC TGATAGCCTG ACCTCCGCCC AATCGCTGGA CATGCGCGAT GAAATTAGCT TTGCTCAGCA ACTTCATGCC TACAAAGTTG CGCCAAACAT CATTGAACAA TTAGCAACTC AACCGAGCAT TTTGGGCTAG
|
Protein sequence | MYTNAHTHLE LSALAEVCPT GQEFGSWLQT MLARRMQLDN ATLERGIHKA IEQLQQNQTT TVGDISATRL SIEPLLSSGL AGVVYLEVLG FDPTVALERL SAAQREIDTW RKRETAMKIG LSIHAPYSCA PSLFEAAARW CRAEAVPWAI HIAESPAEVA FLQQGIGSLR ELNRRITPHI EWQAPQCSPI AYLERLGVLE AQPVLVHAVQ VDAHDLALIA QYDCAVVHCP RSNHNLLCGR MPLEQMLAQG IRVGLGTDSL TSAQSLDMRD EISFAQQLHA YKVAPNIIEQ LATQPSILG
|
| |