Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_2707 |
Symbol | |
ID | 5734588 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 3460627 |
End bp | 3462438 |
Gene Length | 1812 bp |
Protein Length | 603 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641279850 |
Product | hypothetical protein |
Protein accession | YP_001545473 |
Protein GI | 159899226 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.0029913 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATACGAA CACATGCTTC AATCGAATGG CTGGCCAGTT TACCTACCAG CGAACGCCAT GTGCTGGCAC ATTTATGGCA AGTTGCCGAT GCTGCTCAGC TTGGGCAACC AAGTGCCATC CAAGCCTTGG TTGCGCGTTT ATCCACGCCT GAACGCTCAG CGCTTGATCG GGTGATTGCG GCTGGTGGTA AGCTCGCGGC CAAATCGCTT GAACGTGAGT TTGGCAAAAT TCGCTCACAT CGGGATATTG TGACTCCTCG GGCCTATTTA TTGGCGCTGC ACGGCCAAGC TTCGGTGCTT GAACGTCTCT ATATTTTGGG CTTGCTTCAG CCACTCAAAA CGCTGGACGG CGAGTATTAC GTGATTTTTA GCGATTGGTT GCAGGCGTTG CCAGCAGTTT CGGCCCCCAG TTTGCCAACC TGGCAGCAAC ATCCCAGCCC AAGCAACTGG ATCGAAGCAG ATCTCAATCA AACCGAAACG CTGCTGACTA CGATCTTGGC GCTGTGTTAT CAGCAACCGC TCCAGCTGAC CCGCCAACTT CAACTTGAAC GCGATGGTTT GAAAGCAATC TGTCAACGCA TAGCAGTTTC TACGCCAGCA AGCGAACGCC AATTTCCTCA ATTAGCCTGG CTGCGAAATT TGGCGCTTGA GGCTGGCTTA TTACACATCC AACATCAACA GCTTCAGCTC GCTGGCAACC CAATTAATTG GCTAGAAGCT ACGCCCAAAC AGCGGTTAGA GCGCTTATTT AATGCCTGGT TGGTTTGTGA TTTTGATGAG TTTAGCTTGA CTGAGTTGCA GCCTCAGACT CCATTTACCC TCCAAGCTGC TCGCCAAGCC TTATGGCAAG TATTAACAAC CGCGCCACCC GATCAATGGT TGGCCTTCGA CGATCTACTG GCACAAATCC AAGCCTTACA TCCCGAACTG TTGCGCAGCG ATTTCGAGCA GCCCGTAATT CACAATCAAT CCAATGATTC ATTCGTTGGT TGGCAACATT GGGCCAAAGT TGAGGGCGCA TGGATCAAAG CCGCCTGCCA AGGGCCATTG TTCTGGCTCG GCTTGCTTGA TGTCGATCAA CTTAACCATC CACAGGCTTT GCGTTTAACT CAATGGGCAA GCTGCTTACT CGATCCAGCG CACGAGCCAA GTCAATTTGC TGGGCAACTA CAACTAAGCA GCGATGGCCT GATTCGGGTT CCACCAACGG TTGAGCCGCT GCCGCGCTTT CAAATCCAAC GCATCACCGA ATGGCAATCA ACCGACAGCC ATGGCACCAT GCTCGTGCGC TTGACCGCCC ATTCGTATAG CCAAGCCTTG CAACGTGGCA TTCAGGCCAG CCAAATGCAA ACATTTTTGC AACGCTGGTG TGACCGACCA GTGCCAAACG ATTTGCAAAG CTTATTTCAG CAATGGCAAA ACGATCGCCA GCACTTATTG GCTCGTCCGG CTTTATTGCT GGAAGCCGAT GATCCCAGAT TACTCAACGA GCTGGCTAAA CTGCCTAACT TACCACCCTA CGCCGAGCTT AATCCCCAAC TTTGGGAATT GGAAATAGCT GATAGTGCCG CATTAACCAA TCTGTTGCAT ACAGCAGGCT ATGCAATCAA CCCAGTCAGC GAGCCAGATC AACGGATCAG TGACCATGAT CTTAAACAGT TGATTACGGC CTTATTGACG GTTCAGCGTT TAGCGCCAAC TGTGGTCAGC CAAGCAGTGA TTGAGCGGGT GGTGCAGGCC TTGCCCAGCA GCGAACGCCA ACAGCTCACA GCCAACGTCA ACCAATGGCT ATCAATCATT AATCGAAGCT AG
|
Protein sequence | MIRTHASIEW LASLPTSERH VLAHLWQVAD AAQLGQPSAI QALVARLSTP ERSALDRVIA AGGKLAAKSL EREFGKIRSH RDIVTPRAYL LALHGQASVL ERLYILGLLQ PLKTLDGEYY VIFSDWLQAL PAVSAPSLPT WQQHPSPSNW IEADLNQTET LLTTILALCY QQPLQLTRQL QLERDGLKAI CQRIAVSTPA SERQFPQLAW LRNLALEAGL LHIQHQQLQL AGNPINWLEA TPKQRLERLF NAWLVCDFDE FSLTELQPQT PFTLQAARQA LWQVLTTAPP DQWLAFDDLL AQIQALHPEL LRSDFEQPVI HNQSNDSFVG WQHWAKVEGA WIKAACQGPL FWLGLLDVDQ LNHPQALRLT QWASCLLDPA HEPSQFAGQL QLSSDGLIRV PPTVEPLPRF QIQRITEWQS TDSHGTMLVR LTAHSYSQAL QRGIQASQMQ TFLQRWCDRP VPNDLQSLFQ QWQNDRQHLL ARPALLLEAD DPRLLNELAK LPNLPPYAEL NPQLWELEIA DSAALTNLLH TAGYAINPVS EPDQRISDHD LKQLITALLT VQRLAPTVVS QAVIERVVQA LPSSERQQLT ANVNQWLSII NRS
|
| |