Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_4284 |
Symbol | |
ID | 5736143 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 5470500 |
End bp | 5472068 |
Gene Length | 1569 bp |
Protein Length | 522 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641281444 |
Product | hypothetical protein |
Protein accession | YP_001547044 |
Protein GI | 159900797 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAAACCT ACCCTTGTAA GCTAACCTCT CCCGCACGCG AGCGAGGGGG AATTGATACC ATCATCACAG GTGGAATGCC CCTCTCCCGC CGCAGTGGGC GAGGGGTCGG GGGTGAGGGA AAGCCATTAC ACATTGAGGA ATCTAGGCCC ACTATGAAGC GTCTTTTATC TCAAGCCGGG CCGGTGGTGA TTGTGGCCCT GTGGCTGGCG CTGCTGCCAA TTAGTTTGTT TCGGCTGTAT GCCACCGATG AAGTGCAATA CTTTGCCTAT TTGCGCTCGG TCTATTTCGA TGGCAATTTG GATTTCGCCA ACGAATATGG CTATTTTGCC GATCTCGGCA TGCAAAAGGG CGATCCAGCA GTCTATAATG CGCTGCTCAA AGATCGTTCA AGCGATCCGC CGCTCAACCC TATCACAGGC TTATATCGCA ATGTTGCGCC TGTTGGCTCA GCAATCTTGT GGTCGCCGTG GTATGTGGTT GCCGATGGTT TGGTTGGCGT GGGTATCTTT GGCGATGCGC CACGCGATGG ATTTAGCCAG CCCTACATCA TTGCCGTGTG CTTAGCATCG GCCTGCTACA CGCTGTTTGG TTTGCTGCTT TCCTATCGTT TGGCGCGGCG TTGGGTTGGT ATGTGGGCAG CCACATTAGC CACCTTGAGC ATTTGGCTAG CCTCGCCATT GATTTGGTAC ACCTACATTC AAGTGCCTTG GTCGCATGGC GCGGGCTTTG CGATGGTGGC CTTGTTTATC ACGATCTGGC TTGGGCCTAC CGATCAACCG CTGTTAGCCC AAGGTTCGCA GCGTTCATGG GTGCGTTGGC TGGCCTTGGC CATCGTCGGC GGCTTGATGA CACTGACGAG GGAACAGCTT GGCCTATTTT TGCTCTTGCC AGCAGTTGAA GGTTTAGTCG CCTATGCTAG CTTAATTCGC CAAGGTCAAT GGCTGCAAGT GCGCCAACTT TTGGCTAAGC ATGTATTTTT TGTGTTGATA TTTGCCCTGA GCCTTGCTCC ACAGTTGATC AGCTACAACA TTTTGTATGG CCAGCCCAAG CCGTCAGGCA CGGTTTCGGG CAAATTGAAT CTGATCAGCT ATAAATTTTT GCATACCTTG TTCGACCCAC GGCGGGGAGC GTTTATGTGG CATCCGCTGT TGCTGGTCGG CTTAGCTGGC TTGATTTGGC TCTGGCGCAA GGATCGGCTG CTGACTGGAT TGCTCAGTTT AGGCCTATTT GCCCAAATTT ATCTGAATGG GGCGTTTGGC TCGACATGGC ATTTGCAAGG CTCGTTCGGC TTTCGGCGCT TGATCGAATG CACGCCAATT TTTATTATTG GTTTGGCATT ATTGATCGAG CGAATTCGCT GGCCCAAAGC GGCGATTGCC AGCCTAGCAC TCGTATTCAT TGTTTGGAAT GGCGGCTTAA TTTTTCAAGC GGCGACTGAC CGCGAGATTC GTGGGCCAGG CTTGCGCTGG AATACCATGC TCGCTGATCA GCTTAAAGTG CCGCAATTGG TTTGGCAAAA AGCCGATCAA CTGCTGTTTA ATCGCTGCGA AGTCGTTAAA AATTGCTAA
|
Protein sequence | MQTYPCKLTS PARERGGIDT IITGGMPLSR RSGRGVGGEG KPLHIEESRP TMKRLLSQAG PVVIVALWLA LLPISLFRLY ATDEVQYFAY LRSVYFDGNL DFANEYGYFA DLGMQKGDPA VYNALLKDRS SDPPLNPITG LYRNVAPVGS AILWSPWYVV ADGLVGVGIF GDAPRDGFSQ PYIIAVCLAS ACYTLFGLLL SYRLARRWVG MWAATLATLS IWLASPLIWY TYIQVPWSHG AGFAMVALFI TIWLGPTDQP LLAQGSQRSW VRWLALAIVG GLMTLTREQL GLFLLLPAVE GLVAYASLIR QGQWLQVRQL LAKHVFFVLI FALSLAPQLI SYNILYGQPK PSGTVSGKLN LISYKFLHTL FDPRRGAFMW HPLLLVGLAG LIWLWRKDRL LTGLLSLGLF AQIYLNGAFG STWHLQGSFG FRRLIECTPI FIIGLALLIE RIRWPKAAIA SLALVFIVWN GGLIFQAATD REIRGPGLRW NTMLADQLKV PQLVWQKADQ LLFNRCEVVK NC
|
| |