Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3058 |
Symbol | |
ID | 5734930 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 3863914 |
End bp | 3865581 |
Gene Length | 1668 bp |
Protein Length | 555 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641280202 |
Product | hypothetical protein |
Protein accession | YP_001545824 |
Protein GI | 159899577 |
COG category | [S] Function unknown |
COG ID | [COG5267] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.417569 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCGCTGT CGCGTCGCCA GTTATTGTTG GGCGCTGCGC TAGGGGCCAC AGGCGCAGTC GTTCATGAGG ATGTTGGGGC AGCGCCGCTG CAACCCTCGA CAACTGAAGC CATGGCAACT CCGCCATTCG AGGTCATCGC GCTCTCGCGC ATGGCCTACG GAGCACGTTC GGGCGATTTC GCCCGAGTTC GTAGTATGGG TTTAACTGCC TATGTCGATG AGCAGCTTAA CCCTAATTTC AACAACGACA CCGATTGTAA CACCCGTATC GCCAACGCCA CCTTGCGCAT TGTCTACGCC GCTGGCACGG GCTTTCCGGC CATGGATGAA ATGCGTGGCC TTGTTACCCT CAACAAAACC CAGCCCGAAT TATGGGAATT GCGCGTACAT CCAGCCAACG CTGAGCGCAT TCGCCCAATC GATGAAGTGG TAGCCGCCAA TTGGATTCGC GCAATCTACA GTAAATGGCA ACTATTCGAG ATTATGACCG ATTTTTGGCA TAATCACTTC AATGTCTGGG CCTATAGCGA TACGCGCATC TCTTCGCTTT GGCCCTATTA CGACAAAAGC GTGATTCGCG CCAATTGTTT TGGTAACTTC CGCAGCTTTC TCGAAGCGGT GGCCACCAGC CCAGCCATGC TGTATTACCT TGATAATGCG ACCAGCCGCG ATGGCCCCGC CAACGAAAAT TATGCCCGTG AACTATTTGA ATTGCACACC TTTGGTTCGC AAAATTACCT CAATAATATC TACGACAACT GGCAAGAAGT ACCGCGCGAT TCGCAAGGCC GCCCAATCGG CTATATCGAC CAAGATGTGT ATGAGGCGGC CCGCGCCTTT ACAGGCTGGA CGGTGGCCGA TGGCACAGGC GGCATTCCCA ACACGGGTTT ATTCCATTAT CTCGATACAT GGAACGATAA TGCCCAAAAG ATTGTGCTGG CCAACTTCTT AAATGCCAAT GCTGGCCCCC AAGCTCATGG CAAAAAGGTT TTGGATCTTG TAGCCCAACA TCCGGCCACA ATTCGCAATC TTTGTACCAA ACTGTGTCGC CGTTTGGTCA GCGACAATCC ACCAAGCACG CTGGTAGATA AAGCAGTCGC CACATGGACA GCCAACTATT CAGCCCCTGA TCAGATTAAG AAAACCATTC GCACAATTTT GCTAGCTCCC GAATTTTTGA GCACATGGGG TGGCAAGATT CGCCGCCCGA ATGAAGTTGT TGCCGCCTAT TTGCGCTCAA CTGGAGCCGA AGTTAAACCG AGCGCCGAGC TATTTAGCTG GGTTACCTTG GCGGGGTATC GCATGTTCAA TTGGGCTACA CCCACCGGCC ACCCCGACGA AAGCGGCTAT TGGAGCAGCA GCAACGCCCT GCTCAATACC TGGAACTTGT TATTCCACTT GCAGCAAAGC TACTTCCCGC CTGCAACCTT CGATTTGCAA GGCCAAATGC CGGGCAGCGT CACCACCGTG CGCCAAATCG TCGATTTCTG GATTATGCGC ATGCTTGGCT ATCAACCTTC AGCCTTGGTC AAAACCAAAT TGCTCAAACT GATGGGCCAA AATGGCAATC TCGATCAACC ACCAACTGGA ACCGCCAATG ACGTTAAATT ACGCTTGAGC AGCCTTGTGC ATATGATCGG CATGCTCCCT GAATTTTATA CGCGCTAG
|
Protein sequence | MSLSRRQLLL GAALGATGAV VHEDVGAAPL QPSTTEAMAT PPFEVIALSR MAYGARSGDF ARVRSMGLTA YVDEQLNPNF NNDTDCNTRI ANATLRIVYA AGTGFPAMDE MRGLVTLNKT QPELWELRVH PANAERIRPI DEVVAANWIR AIYSKWQLFE IMTDFWHNHF NVWAYSDTRI SSLWPYYDKS VIRANCFGNF RSFLEAVATS PAMLYYLDNA TSRDGPANEN YARELFELHT FGSQNYLNNI YDNWQEVPRD SQGRPIGYID QDVYEAARAF TGWTVADGTG GIPNTGLFHY LDTWNDNAQK IVLANFLNAN AGPQAHGKKV LDLVAQHPAT IRNLCTKLCR RLVSDNPPST LVDKAVATWT ANYSAPDQIK KTIRTILLAP EFLSTWGGKI RRPNEVVAAY LRSTGAEVKP SAELFSWVTL AGYRMFNWAT PTGHPDESGY WSSSNALLNT WNLLFHLQQS YFPPATFDLQ GQMPGSVTTV RQIVDFWIMR MLGYQPSALV KTKLLKLMGQ NGNLDQPPTG TANDVKLRLS SLVHMIGMLP EFYTR
|
| |