Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_1138 |
Symbol | |
ID | 5733030 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 1300871 |
End bp | 1301941 |
Gene Length | 1071 bp |
Protein Length | 356 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641278277 |
Product | DNA/RNA non-specific endonuclease |
Protein accession | YP_001543914 |
Protein GI | 159897667 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG1864] DNA/RNA endonuclease G, NUC1 |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.0536941 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACTGGTC TGCGTCGGCT TCTTGGTTTA CTCGTTTTAG GATTGTTGGC GCTAGGCCTG CGTCAACCGA CTACTCAAGC GGCTCCCAAC GATAGTTTAC ACCTCACGCT GGGCAACCCT AGCAATGCTG TAGCTGATCC TTTAGTGCCA AATAATTATT TAATTGAACG TACCCAGTAT GCCTTGGCCT ATCAGCGCGA TGCGGGAATT CCAGCATGGG TCAGTTGGCA TTTGGAAGTG CAAGATTTGG GCAGCACTTC GCGCGGCGAT TTTGCGGTTG ATACCAGCTT GCCCAGTGGT TGGTATCGTG TCGCGACCAG CGACTATAGC GGTAGCGGCT ATGATCGTGG TCATATGACC CCATCGGGCG ATCGCACCAG TTCGCGGGCA GCCAACGATC AAACCTTCAT TATGTCGAAT ATTATTCCTC AAGCGCCCGA CAACAACCAA GGCCCATGGA ACGATTTGGA AAACGATAGT CGTACCTGGG CACGCGCTGG CAATGAGTTA TACATTATTA GTGGTGGCTA TGGTACGAAA GGCACGATTG CTAGTGGTCG GGTGTTAATT CCTGCGGTAA CGTGGAAGGT GATTGTGGTG CTGCCAGTGG GGAATGATGA TGCCAATCGT GTGGCCGCCA ATACCCGCGT GATCGGGATT TGGATGCCAA ATGATCAAGG AATTCGGAGT AATGCTTGGG AGCAATATCG GGTCAGCGTC GATTATATTG AAAGTATGAC TGGCTACGAT TTTCTCTCGA ATGTACCAAC TGCGATTCAA GCGATTGTCG AGGCGCAGGT TGATGGTTCG CCGGTAGTAA CCGTCACGCC TGGAACGGTT TCGCCAACTG CGACGCGCAC GCCAACCGCG ACTCCAACTG CTACCCGTAC CGCTACCCCT ACAGCAACGA CTAGCGCAAC CGCTACGCCA AGCAATACGC TTACCCCAAC TGCGACAGTA ACCACAATTC CTAGCGTAAC CCCATCGATC ACGGCCAGTG CTACGCCTTC AGTCAGCGCA ACTGTGGTTA TTCCTCAATA TTATTCGTTT TTGCCCTACG TCACTCAATA A
|
Protein sequence | MTGLRRLLGL LVLGLLALGL RQPTTQAAPN DSLHLTLGNP SNAVADPLVP NNYLIERTQY ALAYQRDAGI PAWVSWHLEV QDLGSTSRGD FAVDTSLPSG WYRVATSDYS GSGYDRGHMT PSGDRTSSRA ANDQTFIMSN IIPQAPDNNQ GPWNDLENDS RTWARAGNEL YIISGGYGTK GTIASGRVLI PAVTWKVIVV LPVGNDDANR VAANTRVIGI WMPNDQGIRS NAWEQYRVSV DYIESMTGYD FLSNVPTAIQ AIVEAQVDGS PVVTVTPGTV SPTATRTPTA TPTATRTATP TATTSATATP SNTLTPTATV TTIPSVTPSI TASATPSVSA TVVIPQYYSF LPYVTQ
|
| |