Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_5106 |
Symbol | |
ID | 5737064 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009973 |
Strand | - |
Start bp | 139311 |
End bp | 141884 |
Gene Length | 2574 bp |
Protein Length | 857 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 641282271 |
Product | hypothetical protein |
Protein accession | YP_001547862 |
Protein GI | 159901616 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.141867 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGGGTTT ATAACTATGT GCTTGTTTTG ATGCTGCTCG TGGCGGTGCC GCGTGCCGCG CTGGCAGTGG CGACACCGCT CACCGTGACC ACGGCAGACC AAACCGTGGT GTTGGCGGGT GTGCCGTCCC AACCGACCAC CGTGGCCGTG CGCGGCCTGC ACGCTGCCGC GCCATCCCTG ACGATCACCA TGGACACAAC CACGCTGCCT GAGGCCGATC CGGCACGATG GGCCGACCTA CCCACCAGTC CCGTGACCCT GCTGCGCAGC GGGCGCTATC GCGGCTATGG CGTATGGGTC TATCTGGTGC AGCCGATGAT CCAGCAACGC GGCCAGATCC AGCAGGTGAC GCAGCTGCAC GCCGTGCTCG ATGGGGCGGT GCTGGTGGCA TCCCCTGCCG ATCTCCAAGC GTTGCCGCGT GTGCCGTTTG GTGATCCCGT GCCGCCGACC AACCCATTGG CTTTGGCCGA TCACACCTGG ACGCTGACCG TGACCGAGCC GGGCATGCAG CGGGTCACGG GGGCGATGCT GGCCGCAGCG GGCATCGACC TCACGACCCT CACTCCGGCG ACGGTGCAAG TGCAGCACCA TGGCGTAGTG CTCCCGCTCG ATTGGCGTGG GGTTGGCGAT GGCGTGGTCG ATGCGCAGGA CGAAGTGCGC TTGTGGGTGG ATAGCGTCGG CGACCGCTGG AACCGTGCTT CGACCCTGTG GCTCACGACC GCTCCCGCCA CCCCGTCGCC GCCCATGGCC TCGCGCCTTG CCCTCGCCAG CACCGCGCCG TTGACTGATA CGGTCTGGAT GACGCAAACG TGGGATGATC CGCAGATCCT CGACAGTCGC CATGCGGGCA TGCGCGGCTG GCACACCTTC AGCACCCGCC TGAGCAGCCT CGCGGGTGGC GATGCGCAGA CAATGACGAT CCCAGTGACC GCAACGTTGC CATTGGCCAG CGGCCTGATG ACCGTTACCC TGCGCGGCGC GACGGCGACC AACCTACCCG TGCCGCTCGT GGTGAATACC GTGCCGCTGA CCGTGCCTGC GACTGCGGCA TGGCAAACGA CGCTGGCCGT CTCCACGAGC GCGGCCATCA CGGTGACGCT GCCTGCTCCG GCCATCGGCG CGGCCAGTGT GCTGCTCGAA ACCATCACCG TGACCCGTCC GACGCGGCTC GCGACCATGC CGACTGACCC ATGGGAGAGT GGGTCAACGC CTGCGCGGTA TGCCCTGCCG GACGCGCCGC CGCTGCGCAC GCTCTATGAT GTGACCGAGG TACAGTCGCC GCAGATCGTG ATCCTGCCTG CCGGACCCAC GCCCGTGCTG GCTGATCCGT TGGTCAATCG GCGGTATCTG CTGGTTGGTG CGTCGCCACT GCCCACGCCA ACGCTTACCC GCCATACCCC TATCGTGCTG CTCACCGTCG GATCGGACGT GATCATCGCC CCACGCGCCT TCCTGCCCGC CCTCGACCCG CTGCGTACAC CGACCACGGT GCTGGTGGCG CGGGAGGATC TCGATGCGGC ATGGGCGTTT GGCCACGTAT CCCCGATGGC GATTCGCACC TTTCTCCAGC ATGCAGCGGC GACGTGGCCA AGCCCGCCCA CCAGTGTGCT GCTGGTCGGC GATGGCACGA CTGACCCGCG TGATGTGCTT GGCTATGGTC AGCCACCGCT GATCCCGCCT TATCTGGCCG AGGTCGATTT ATGGCTGGGC GAAACCGCCT GTGAAGCCTG TTATGGCCAA TTGGACGGCG CTGATCCACT CAGTGACCTG CTACCCGATC TGCCCGTGGG CCGCTGGCCC GCCACAACGG TGGAGGACGT GACCGCCCTG ATCGCCAAAC AGCAGCGCTA TGCGGCGGCC CCGTGGGGGG CGTGGCAAAG CACAGTCGGG AGTCTTGCTG ATAATGCCGA AGGAGCGCTC GACTTTCCGC AACTGGCGGC GCAGAGTGAG GCCGTCTATC CCCTGACGAT GACCTTGCAT CGCGCCTATT ACGCCCCACA GGCCACGAGC ATTGCCCCCG CGTGGCATGA AGCGGATGCT CGTGCCGTGC GCGAGCGCGT GCTGGCGATC TGGCAGGCGG GGGCGATGCT GATGCAGTAC ACGGGCCATA GCCACGCCTA CCAATGGGCC GTGACTGATC CCGTGGTCGA GCCGCGTGGG TTGCTCGATC TGAATGCGGT GGGCGACCTG CACAATGGCG AACGCCTGCC GCTGCTGCTG GCCTTGACCT GCCTGACCAG TGCCTTTCAT CAGCCCAGCC CCCGTGGAAC CACGCTCGAT GAGGCGCTGG TGCTGCATCC TGACGGCGGG GCGCTGGCAA CCTGGGGATC AAGTGGTTTG GGGGTCGCCC ATGGCCATGA TCACCTTCAG CATGGGCTGG TAACGGCGGC GCTGACGATG CCTCGGCCAA CCTTGGGGCA GGTGACGGAG GCGGGAGTGC TTGAACTGGC GCTCACGGGG CACTGTTGTA CCGATGCGCT GCGCACCACC CTGCTCTTGG GGAATCCGGC CACGGTACTG CGGGTTGCGC CTACGCCGCA GCAGGTGTGG CTGCCATTGG TCGGGTGGGA ATAA
|
Protein sequence | MRVYNYVLVL MLLVAVPRAA LAVATPLTVT TADQTVVLAG VPSQPTTVAV RGLHAAAPSL TITMDTTTLP EADPARWADL PTSPVTLLRS GRYRGYGVWV YLVQPMIQQR GQIQQVTQLH AVLDGAVLVA SPADLQALPR VPFGDPVPPT NPLALADHTW TLTVTEPGMQ RVTGAMLAAA GIDLTTLTPA TVQVQHHGVV LPLDWRGVGD GVVDAQDEVR LWVDSVGDRW NRASTLWLTT APATPSPPMA SRLALASTAP LTDTVWMTQT WDDPQILDSR HAGMRGWHTF STRLSSLAGG DAQTMTIPVT ATLPLASGLM TVTLRGATAT NLPVPLVVNT VPLTVPATAA WQTTLAVSTS AAITVTLPAP AIGAASVLLE TITVTRPTRL ATMPTDPWES GSTPARYALP DAPPLRTLYD VTEVQSPQIV ILPAGPTPVL ADPLVNRRYL LVGASPLPTP TLTRHTPIVL LTVGSDVIIA PRAFLPALDP LRTPTTVLVA REDLDAAWAF GHVSPMAIRT FLQHAAATWP SPPTSVLLVG DGTTDPRDVL GYGQPPLIPP YLAEVDLWLG ETACEACYGQ LDGADPLSDL LPDLPVGRWP ATTVEDVTAL IAKQQRYAAA PWGAWQSTVG SLADNAEGAL DFPQLAAQSE AVYPLTMTLH RAYYAPQATS IAPAWHEADA RAVRERVLAI WQAGAMLMQY TGHSHAYQWA VTDPVVEPRG LLDLNAVGDL HNGERLPLLL ALTCLTSAFH QPSPRGTTLD EALVLHPDGG ALATWGSSGL GVAHGHDHLQ HGLVTAALTM PRPTLGQVTE AGVLELALTG HCCTDALRTT LLLGNPATVL RVAPTPQQVW LPLVGWE
|
| |