Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3849 |
Symbol | |
ID | 5735714 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 4832750 |
End bp | 4835197 |
Gene Length | 2448 bp |
Protein Length | 815 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641281002 |
Product | hypothetical protein |
Protein accession | YP_001546613 |
Protein GI | 159900366 |
COG category | [S] Function unknown |
COG ID | [COG4485] Predicted membrane protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCGCCA TGCAACCAAG CCTATTTTTG CGCCAGTGGC AACGCTGGTG GCCGTATCTC AGCATTACCT TTGTCGCCCT GCTTTTGCTG TGGCGGGTGG TGCTGGGCAA CATTTTCTTG CCGCTGGATA TCGTGGCCCA CCTGCATCCT TGGCGCTTTT CCTACGAACG GGTGGCGGTC AATAATCCAA TCAATAGCGA TCTGGTTACC CAAATTTACC CGCGCCGCTT GGTAACCAAC CAGATTCTTG AGCAAGGCGC GTTGCCCCTA TGGAACCCGA CGATTTTAAC TGGCACGCCG TTGTTGGCCG ATGGTCAGTT GGCCTTTTTC TACCCTTTGA GTTGGCTGTT TGTGCTGCTG CCAGTTGGCT ATGCCTTTGG GATTTATACG CTGTTGAATG TGTGGTTGGC GGGGATTGGC ACGTTTAAAT TTGCTCAACG CATGCAACTT GAGCCAATGC CAGCAACGCT CGCTGCCGTG GGCTATATGC TCAGTGGCTT TTTACTCAAT TGGCTGCATT TTCCCGAGTT TAGTGCGGCT TGTGCCATGC TGCCGTGGTG TTTTTGGGCG GTGCTGCGGG CCTGCCAAAG CCAACGTTGG CACGATTGGC TGCTCGCCAG TTTGGTGCTG GCCTTGCCGT TGGTCAGCCA AATTCAACTA GCCTTCTATG TGTATGTCGG GGTTGGCTGT TTGCTGCTGG CTCAACTGTT GGCGTTGCCA ACTTGGCGCT TACGCTTCCA ACAAATCGGC CAATTTAGCA GTGCAATTGG CTTGGCGCTC GGATTGAGTG CGGTGCAGTT GTTGCCACAA ATTGCCCTTT CGGCTCAGGG CCAACGCCTT GATATTGGCT CAGGGCTTGG CTCGGCCAGT TCAATCATGG TGTGGCTGCT GCGCTTGGCG TTGCCGATTG TTGATGGAGC CGCCCGCGAA ACTGCTAGCG CATGGCAACC ACATTTGTTG CAGGGCATTC AACCCTATGC AGGCATCGTA AGTTTGGCCT TGGCAGGCTT GGCGATTTGG CGCAGCAAAC AGCCTGGCGT GAGGCTGTTT GCCTGCTTGG CGCTTGGCTC GTTTGCGGTG GCGATTGGCA CACCGTTGCT CCAATTGCTG CTTTGGTTGG TTCCGCCCTA TCGCCAATTT GCTGATCATC AGCGGTGGTT TAGCCTGTGG GGTTTTGCGA TAGCCCTGTT GGCAGGCTTT GGCTTACAAC GTTTGCAGCA ACCGAGCAAC AAGCCAAATC GGGCGCTCTG GGTGCAACGC GGCCTCTTGC TGCTTGGATT AATTGGTGTG GCTGGCTGGG CTTTGCAACA TATCGCCTTA TTCACTGTCG ATTCGCGTTA TGCCCAATAT AGCACCATGC TGCGTTTGGC ACTCAACCCA ACCAGCCTTG CTATTCTGGG CTTGAGTGGT TTGGCATTGG TAGGGTTGTT GATCAAACGC ATTCCACGGC GTTGGAGCAA TCTTGCTGTT TTGTTGATTC TGCTGGGCGA TTTGCTTTGG TATGGTGGCA GCTACAACAC CAGCATTGAT CCTGCAATCT TCCAGCCAAC CGCTGATCAG CAAGCCAGTT TGGCTGCCGA GCCAGCCTTG CAAGATCCGG CGATTCTGTA TCCGCCAACT CGCCAGATCA ATTTTCTGCT GAGCCAACCC GGGGTGTTTC GCGTGTTTGG AGCCGATTAT CAAGCCATGC CGACGAATGT GTTTAGTGCT TTTGGACTTG AGGATATTCG CGGCTATCAA TCGCTCTATT TAGCCCAATA CAATCGGCTA ACGCGCCTAA TGGATGGCAA GGATTATCAT AAACTTGGCG AGGGTGGCAA CAGCCTACAC GCGTATTTCA CCATGGCCTA CAACCAGCGG CGTTTGCTGA ATATGCTGAA TGTGGAATAT CTGATTTTTA CGCCTAATAG CCCCAACCCT GAGTTGTATC AACCCTTAGA ATTGGTGCAA CGTAACGATG AAGGCACGAT TTATCGCAAT CCTGAGGTGT TGCCACGCGC CTGGATGGTC TATCAAACCG AGGTAATTAG CGACGAATTA GCCCAACTTG ATCGGCTGGC AGCTAACGAT TTTGACCCAG CCAAGCAAGC AATTGTGGCC GAGCCAATTC CGGCGCTTGG CCAAGCACCG AGCCAAACGC TGACTCCTAC GGTGAGCTAT GAGCCAAATC GGGCGCTGGT GCAGGTCGAA ACTTCAGCGG CAGGTTTGTT GGTTTTGGCT GATGCCTACA CCAACGATTG GCAAGTAAGC GTTGACGGCC AAACAGCTCA ACTCTATCGC ACCAATTATG CTTTGCGCGG CGTATGGGTT GATGCAGGCC AACACACAGT CGAATTCAGC TATCGACCCA AGAGCTTAAT CGTTGGTGGT TGGGTTAGTG GCCTAAGTTT AGCCCTGATT TTGCTTGGCC TAGCTTTGAG CTGGTATAAA ACGAGAAAGG CTGCATAA
|
Protein sequence | MSAMQPSLFL RQWQRWWPYL SITFVALLLL WRVVLGNIFL PLDIVAHLHP WRFSYERVAV NNPINSDLVT QIYPRRLVTN QILEQGALPL WNPTILTGTP LLADGQLAFF YPLSWLFVLL PVGYAFGIYT LLNVWLAGIG TFKFAQRMQL EPMPATLAAV GYMLSGFLLN WLHFPEFSAA CAMLPWCFWA VLRACQSQRW HDWLLASLVL ALPLVSQIQL AFYVYVGVGC LLLAQLLALP TWRLRFQQIG QFSSAIGLAL GLSAVQLLPQ IALSAQGQRL DIGSGLGSAS SIMVWLLRLA LPIVDGAARE TASAWQPHLL QGIQPYAGIV SLALAGLAIW RSKQPGVRLF ACLALGSFAV AIGTPLLQLL LWLVPPYRQF ADHQRWFSLW GFAIALLAGF GLQRLQQPSN KPNRALWVQR GLLLLGLIGV AGWALQHIAL FTVDSRYAQY STMLRLALNP TSLAILGLSG LALVGLLIKR IPRRWSNLAV LLILLGDLLW YGGSYNTSID PAIFQPTADQ QASLAAEPAL QDPAILYPPT RQINFLLSQP GVFRVFGADY QAMPTNVFSA FGLEDIRGYQ SLYLAQYNRL TRLMDGKDYH KLGEGGNSLH AYFTMAYNQR RLLNMLNVEY LIFTPNSPNP ELYQPLELVQ RNDEGTIYRN PEVLPRAWMV YQTEVISDEL AQLDRLAAND FDPAKQAIVA EPIPALGQAP SQTLTPTVSY EPNRALVQVE TSAAGLLVLA DAYTNDWQVS VDGQTAQLYR TNYALRGVWV DAGQHTVEFS YRPKSLIVGG WVSGLSLALI LLGLALSWYK TRKAA
|
| |