Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3941 |
Symbol | |
ID | 5735802 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 4937690 |
End bp | 4940035 |
Gene Length | 2346 bp |
Protein Length | 781 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641281092 |
Product | hypothetical protein |
Protein accession | YP_001546703 |
Protein GI | 159900456 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.254734 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTTCGCA ATAGGAAGCG CAGACGAGTA CTATCCTCGT TTCTTATTAC TCTTACACTT TTGGTCACCT TTATCTCTTC GGTTTCGGCA ATCTCACTCA ATGCTACCGT TTCACTCAGC TCCGGTGATT TTGCCAGCGG CTATTTTGGT CTAACCGGCC TAACCCAACA AGATCAGGTG ATCGGTGATG TTACCTATGG CGGGGTGCAG TTGATCCCTC AGGGTGCGTT GGCGCAATGG AGTGATGCCA GTAATCAACT TTGTCGGACT TTGGCCGATA TGGGGACTGT TAGTTTCAAG CAGCACCTCT ATGCGATTGG TGGCTCGACC GCATCAAGCG GGAATGTAGC GGTTGTGTCT GAGGTCTGTC GAGCAACCGT CACTGATACT GGGGGTGAAA CGACCGAGTG GACTGAACTT CCTCAAACCT TACCAGTTCC CCTGACGCGG ATGAGTACGG TGGTGGTGAC GAACACCAGT GATCCTACTA AGGGGATTAT GTATACCTTT GGTGGCCAAA GCGCCTCAAC CGGCGATATT GAGTATACCG ATAAGATTTA TTCGAATGTG ATCAATGCTG ATGGTTCATT GAACACTTGG CAAACTCAGG CGTTGACCAC TGGCGAAAAA CTGATCAATA CCACAGCAAC CGCCTACACC ACACCCAATG GTCAAACCTA TATCTACCTA ATTGGTGGTA AAACGCGCGA TACATCGGCA CTCTTCTCGC CAATTTATGT GCGCCGATCA GTTCGACGAA CCTTGGTAGG GCCAAATGGG GTTTTGGGGC CATGGCAGTC AATGCCCGAT TTGCCAATTA CCCCCGATAT GTTTACCCCA ACCAATGGCT GTGATGAGAA CGTTGGTTTG CATAGTATGG ATGTTGCCAA CTTTGATGCA ATTACGCTTA CCAGCACCTA TCGGGCCTTT TTAGTGGTTG GTGGCACTTT CGAGTTGGGC ACAGGCCATG TCGCTGTTGG TTGTACCCGC ACGGTCGAAG GTTCTGCTCA AGCGATGTTG GGCAAACTCG ATACGAATGG GATGCTGACT TGGGAAACCC AGCGCTATAT CTTGCCTGAA CCGCTTTCTA GCCCCCGCGT GATTGGGGTC AACCAAAAGA TCTATGTGGT TGGGGGCCGC CAAGGCAGTG CTGGTGATCC CACCCATCGG ATTTATACTT CCTATATCAA TATCGATAAC TTTACCTTAC CTGTGTTCGG CCAAAGCAAC TTCCGCGTTT CGGAAAATGC GCTTTTGACC TCGCAAGCAC GCTCAGGTCA CGGTCTTGAG TTAATCTATA TCAACTTCCG ACCAGTTGCC TACATGTATG GTGGGATTCG GGTTGGGAAT ACCTATCAAC AAGATGTGTT GTTTGGCTTT GTTGGGACAG ATGCCGATAT CGACTTGACG GTTGGGGGTT ATCCTTCACC AGGGGTTTAT CGCTCATCAC CACTCCAACT CCGTGCTCCA GCCATCATTG AGCAAATGCT GTGGGATGCG ACGCTGCCAA ATCCACCTAT CAACACTGAT ATTCAAATGC AATATAAGCT AGCGGCAACC CGCGGTGCAC TGGAAACTGC CCAATGGCAA ACTGTTGATG CCTCGCCTGG GAATGATAAC TACTCGGTGC AAGGGCCGAA TGTTGCTAAT GGGACTCCCG CCGTCCAAGG CCAGTGGTTC CAGTATCAAG CATTGATGAC CACCCAAAGT CCGACTGAAG TTGGGGCAAC TCCCATTTTA CGCAATGTGC GGATTAAGTA TAAGGTTGAT GGTCACCCAA GCTTATACGT TGATTCGGCT ACGATGTCAA CTGTAAGCAC AACTGGCATT ACAGCCTTTA CGGCAACCTT TAAAAATGGG ATCAAACCTG GCTCAAATGA CACCGAAAAT GTGCTCGATG CCGATATTGA AAGCCAAGGC ACCTTCTTTG TGGACATGTA TCTCTTGCCA CCTGGCTCTG CCGATGTGCC ACCAGCTCGT GATCCTGATA GCGGTGCCTA CCCACTAGGG ATGGTCTTTA CCGAGATTAA TCGCTTGAAT TTGCCCCAAG ATGGTGAATT TACGCTTGAT GCAATTTCGG ATAATACCAT TTGGCGCAGA ACATGTCCCG CAGCCACTGT CGATTGTCCG TTAGTCGTCT GGCAGGCACT CTTCAATAAA ACGGGGACAT GGAAGGTCTA TTTGGTGATT GATAGTGGTA ATTATGTAAC CGAGGCTGAA ACACCAGCCG GGCAACGTGA GTTGGATAAC GTTTATTCGT TCAATGTTAA CTCAACGGTT GTTGGGAGCA CAATTCACAT GCCGGTGGTC GGGATTAACT TCTTGGCTAC GCCACCGCAA CCATAA
|
Protein sequence | MVRNRKRRRV LSSFLITLTL LVTFISSVSA ISLNATVSLS SGDFASGYFG LTGLTQQDQV IGDVTYGGVQ LIPQGALAQW SDASNQLCRT LADMGTVSFK QHLYAIGGST ASSGNVAVVS EVCRATVTDT GGETTEWTEL PQTLPVPLTR MSTVVVTNTS DPTKGIMYTF GGQSASTGDI EYTDKIYSNV INADGSLNTW QTQALTTGEK LINTTATAYT TPNGQTYIYL IGGKTRDTSA LFSPIYVRRS VRRTLVGPNG VLGPWQSMPD LPITPDMFTP TNGCDENVGL HSMDVANFDA ITLTSTYRAF LVVGGTFELG TGHVAVGCTR TVEGSAQAML GKLDTNGMLT WETQRYILPE PLSSPRVIGV NQKIYVVGGR QGSAGDPTHR IYTSYINIDN FTLPVFGQSN FRVSENALLT SQARSGHGLE LIYINFRPVA YMYGGIRVGN TYQQDVLFGF VGTDADIDLT VGGYPSPGVY RSSPLQLRAP AIIEQMLWDA TLPNPPINTD IQMQYKLAAT RGALETAQWQ TVDASPGNDN YSVQGPNVAN GTPAVQGQWF QYQALMTTQS PTEVGATPIL RNVRIKYKVD GHPSLYVDSA TMSTVSTTGI TAFTATFKNG IKPGSNDTEN VLDADIESQG TFFVDMYLLP PGSADVPPAR DPDSGAYPLG MVFTEINRLN LPQDGEFTLD AISDNTIWRR TCPAATVDCP LVVWQALFNK TGTWKVYLVI DSGNYVTEAE TPAGQRELDN VYSFNVNSTV VGSTIHMPVV GINFLATPPQ P
|
| |