Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_4296 |
Symbol | |
ID | 5736155 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 5485030 |
End bp | 5486958 |
Gene Length | 1929 bp |
Protein Length | 642 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641281456 |
Product | hypothetical protein |
Protein accession | YP_001547056 |
Protein GI | 159900809 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATGAAAC GTTGGATGTT ATTCGTGCTG ATGGTTGCAA GTTTGGGCTT GCCGCAAGCC AGCAGGGCCG CCGAACAACA ATGTTTCGAG CAAACGGGCT TTTGTATCGA AGGCCGCTTC GCCGAATATT GGCAGCAAAA TGGCGGCTTG CAGGTCTTTG GCTATCCACT GAGCAGCGCC CAAGATGGCT ATAATCACGA TAGTCAAAAA GCCTTTTTAA CTCAGCAATT TGAGCGTGCT CGCTTTGAAT TGCACCCTGA GTTTGCTGCA CCCTACGATG TGCTACTTGG TCGTTTGGGC GATGATCTGC TGCGCTATCG CAATATTGAT AGTCCAATGT TGCCACGCGA GGCTGGCGCA ACATCAGGCT GTTTGTGGTT TGAAACAACT GGACACAACG TCTGTAACCA AGCCAATGGC CTCGGTTTTA TGAGCTATTG GCAAAACCAT GGCCTCAACG ATCCCAAACT TGATGCCTTT GGCCGTTCAT TGCAATTATT TGGCTACCCG CTGACTGAGC CAGCCATAGA AACCAACGCC AATGGTGATC GCGTGCTCAC CCAATATTTC GAACGTGCCC GTTTCGAGTG GCATCCAACC CAGCCCGATC AATTTAAAGT GCTGCTAGGT TTAGTCGGCA AAGAATCGCT ACAATTGCCC TATGGCGCAA GTGCTGAACC ATTAGAATTG ACCTTGGTTG GCGATACGCT CTTTTTCACC GCCGACGATG GCGTGCATGG CCGCGAATTA TGGACAAGCG ACGGCACCGA AGCTGGCACA CGCTTAGTCA AGGATCTGAA AGTTGGCGCT GAATCAAATT GGCCATATCA ACTAGCAGCA GCCAAGTACG GTGTCTTTTT CTTGCAATTG AACGAGACGG CCTATGAATT GTGGTTTAGC GATGGTAGCG CAGCGAATAC CCGAATGGTT AAGTCAATTC AAGCTAATTC CAACCCATAT ATAACCAACT TGATCGCTTT AGGTGATGGC GTGCTCTTTT TCGCCAACGA TGGGCTGAGT GGCCTTGAGC CATGGTATAG CGATGGCACC GCGGCTGGCA CTCGTTTGCT CCGCGACATT AATCCAGGTG CAGGCCACTC AAATGTTGAA ACTGGCTATA GCTATTTTTG GACAGAATAT ACGCCCATGG CGGGCGGAAT GGCCTTCTTG GCGAGAAACG CTCAGGCGGG TGCACAAGTT TGGTGGACTG ATGGAACTGA GGCAGGCACA CGCCAAATCA GCAATTTTGC TGGTGAAATC CATAGTGTGT TTGAGTTAGA GCCTTTGGAT GCCCAGCATT TGATCGTGGC AGCCAGCAAT GAAGGCACAT TCGGCATCTG GAAACTGAAC ATGAGCACTG GCGAGCAACA AAACTTGGCT ACTTACCCGT GGATTACCGG CAGTATTCGT AATCCAATTC CGGCTTCGCA ACTCACTCAG GTTAACGGCG AGGTATTTTA TACGGTGATA AGCCAAGGCG TTGGCACAAG CCTCTGGCGC ACCAATGGTC AAAAAGCCCA AGTAGTTGAT TTAGCTGGCA AGAGTCCTTA TCGCCTGCTA AGCGCCAATA ATACATTCTA TCTCCAGCTT TACCAAGGCA CTGAACAGGC TGGCTGGTGG ATACTGAATG CAGATGCCAG TTTGCGCCAA TTCAGCTCAC TCGACCTCAT GCTTGCGGAT ACTGGCCAAG GCTTGATTGG CTGGGAATTG CTCAGCAATC GGGTGCGGGT TTACCGCAGC ACCGCCGATT ATCAAGCGAT GCACTATCAA GGCTCAGTCT TGAGTCAATC CAGCTTTACC TTCTATCCAT CAGATACGGC CAGTAATTCC CAACGCAGCT TTTTTAGCTT ACCTGATCAG CAGCATGGCA CTGAGCTTTG GGTCAGCGAT GCTCAAGGCC TGCGCTTAGT CAAGGATATT CGGCCATAA
|
Protein sequence | MMKRWMLFVL MVASLGLPQA SRAAEQQCFE QTGFCIEGRF AEYWQQNGGL QVFGYPLSSA QDGYNHDSQK AFLTQQFERA RFELHPEFAA PYDVLLGRLG DDLLRYRNID SPMLPREAGA TSGCLWFETT GHNVCNQANG LGFMSYWQNH GLNDPKLDAF GRSLQLFGYP LTEPAIETNA NGDRVLTQYF ERARFEWHPT QPDQFKVLLG LVGKESLQLP YGASAEPLEL TLVGDTLFFT ADDGVHGREL WTSDGTEAGT RLVKDLKVGA ESNWPYQLAA AKYGVFFLQL NETAYELWFS DGSAANTRMV KSIQANSNPY ITNLIALGDG VLFFANDGLS GLEPWYSDGT AAGTRLLRDI NPGAGHSNVE TGYSYFWTEY TPMAGGMAFL ARNAQAGAQV WWTDGTEAGT RQISNFAGEI HSVFELEPLD AQHLIVAASN EGTFGIWKLN MSTGEQQNLA TYPWITGSIR NPIPASQLTQ VNGEVFYTVI SQGVGTSLWR TNGQKAQVVD LAGKSPYRLL SANNTFYLQL YQGTEQAGWW ILNADASLRQ FSSLDLMLAD TGQGLIGWEL LSNRVRVYRS TADYQAMHYQ GSVLSQSSFT FYPSDTASNS QRSFFSLPDQ QHGTELWVSD AQGLRLVKDI RP
|
| |