Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_5091 |
Symbol | |
ID | 5737049 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009973 |
Strand | - |
Start bp | 116616 |
End bp | 118904 |
Gene Length | 2289 bp |
Protein Length | 762 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641282256 |
Product | hypothetical protein |
Protein accession | YP_001547847 |
Protein GI | 159901601 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTGCGG TATCCCCTCT ATCGAGCACG CCACAGCATG GGTGGTTTGA ACACCTTCAT TTGCGTCGCT ATTCGCGAAC TATGCTGTTT GATATCATGG TATTAATCGC CTGTATCATG CCGGGCATTA TGCTCTCACA CCAGCAACTG CAAGAACCAT GGAATGCGTT TCGTGTCGCA CGGGCACAAC TGTCCACGAA TCAAGGGTCC GTTGCCGACT ATACGACTAG TGCAGGGGCG TATCGAGCAG AACATGTAGG CAGTGACCTC ATTCATACGG CGCTGATTAA TGCAACGGAT TGGCCACTCG ATGGACTCGT CTTAGCCCCC ATTGGGGCAT TGATGTTAGC ACTGGCCTAT TATGGGATTG CGATCTTGTT AAGCCCGTCA CGACGCTGGG CCGCAGGGAT TGCCTTATTT AGCAGTTGGT ATTATCCCGG TATCTATAGT CAATTTGGGA CACAAACCTA TGTCTGGGTG TATAGTCTGC TGATCGGCTT TCTGATTGTT TTTTTTCAAT GGTTGAATCA TCCAAAACGG AGCTATGAAT GGCTGCTCAC GGCGATCTTT CTAGCAACCT TTTTGCATTA CCATACAACC CCACTCTGGA TGATTGGCTT AATGATCATT GGGACTGCTG CTAGCAAACG TGCCCAACGA ACAACCTCCT CGTCCTATCG CTATTCATGG CTGCTCCCGC TCAGTTGGTT GGCGTGGGAT ATCACGTTTG AAAGTGTCTT ACGGAATGGC TGGCAGCAGC TTCAAGCCGT TAATACGAAT ACGATTGGTC AAAGTTTTCT GAGTAAAGTC TTTGGGCCGT TGCTCCAACG CACACCCGCA GGGCTTGATC CGTTTGAAAT TGCCCCCATC AACCCACCAC TTGCAACATG GAGCACCTTG CTCTGTTTGC TGATCCTCAT CGTGCCCGTG GGATGGTGGA TGATCACACA GCTACACGTT GCCCGTAAAA CGCGTGATTT CCATGCCCTC ATCGCAACAC CCCGGCACAT CTTTATTTGG GCCATTATCG GTATTGCGAT GGCGCATACC ATCATGTATT TGTTGTATGG AGCATTAAGT TTTCGGGTTA TCCCGGTATT GTTTCCCTTA TTAATTCCCA TGATTTGGCG CGATGCTCCT TGGAAATCAT GGAGCATGGG CTTGCTCATC CTGCTGGTAG GAAGCGCGAT AGTGGGCTTT TTCAGTTTCG CCCCAACGAT CCAGCCCGAT ATGACCGCCG CCCAAACGGG CCGTGCCAGT CGGTTGCTTG CGAAGAACAG TACAATACTC AGCGATGCGA ACCTCTACGG ATCGTTGGCG CTGCAACGAG CGATCGATAA TCAAACGATC AATCTTGCGT GGATTGATGC TCCGCTCTAT CGCGCCATGG TGGACGGACA ACCCCTTCCA ACATCGATTG ATGCCATTGC GATCGCCAAA ACCACCAAGC CGTTAATTAC ATCCCACTGG CAGTATTTTG CAGCGTGGAT GGTCTCGAAC CCCATGATCG AACACCATCC ATCGATGAAT CAAGTGTATG CAAGTTCGGT GTTAAGTGTC TTACAACCGA TCAATCATGC GCTTCCAATG ACCGTCCCAT CAGACCCACC AACAAAATCG CTCATGTGGT TTGTTCCAGC ACTGCGATTA CTGGGGGCGA TCCTTGGGCT GTTTCTCATC CCTGGCCTTG CGATCTTTTG GGTTGCCCAA CGGAAACAGA TCTTTGGTGC GGATCTCGAT ATTCGCACGA TCCTTGCCTG CACGATTGCG TTGTCCGTTT TGAACCTGAC CGTGATGGGG TATCTCGTGC ACTTAAGTGG GATGAACATT AATGGGATTG TCGTGCTAAC GATGATGCTC CCCGTTGCGA GTCTTATCGC ACTCCGGCTA AAGGGACAAC GGTTCATGAT CGCATTTCGC TGGTGGCGGT ATGGCATCAG CATTGTGATT GTTCTTGGGT GTTGGCTTGG ATTTGCAACC CATGGAGTGC TTGCGCAACC CACGGCTATG CCCGCACTTG AAGTCTTTGC AACCCAAGCG ACGACCGGAT CATTAACAAT TGAGGTCGCG AATAATACAG ATACGGCGAC TCCTGCAACG GTGGTTATTG AAACGCCGGA GGGGAGCGTG CTCATGACCA CCCAGCAAGT CCTTCCTGCC CATCGGGTCA CTGATATTCC ATGGGTGATT CCCGCAACCG CAGCGCATCA GCCAGCGGTC ATACGCGTGA TAACGGATCA GCAGCCTCCA TTGACCTTGT GGTTTGCTCA GGTACCAACA GGAAAATGA
|
Protein sequence | MSAVSPLSST PQHGWFEHLH LRRYSRTMLF DIMVLIACIM PGIMLSHQQL QEPWNAFRVA RAQLSTNQGS VADYTTSAGA YRAEHVGSDL IHTALINATD WPLDGLVLAP IGALMLALAY YGIAILLSPS RRWAAGIALF SSWYYPGIYS QFGTQTYVWV YSLLIGFLIV FFQWLNHPKR SYEWLLTAIF LATFLHYHTT PLWMIGLMII GTAASKRAQR TTSSSYRYSW LLPLSWLAWD ITFESVLRNG WQQLQAVNTN TIGQSFLSKV FGPLLQRTPA GLDPFEIAPI NPPLATWSTL LCLLILIVPV GWWMITQLHV ARKTRDFHAL IATPRHIFIW AIIGIAMAHT IMYLLYGALS FRVIPVLFPL LIPMIWRDAP WKSWSMGLLI LLVGSAIVGF FSFAPTIQPD MTAAQTGRAS RLLAKNSTIL SDANLYGSLA LQRAIDNQTI NLAWIDAPLY RAMVDGQPLP TSIDAIAIAK TTKPLITSHW QYFAAWMVSN PMIEHHPSMN QVYASSVLSV LQPINHALPM TVPSDPPTKS LMWFVPALRL LGAILGLFLI PGLAIFWVAQ RKQIFGADLD IRTILACTIA LSVLNLTVMG YLVHLSGMNI NGIVVLTMML PVASLIALRL KGQRFMIAFR WWRYGISIVI VLGCWLGFAT HGVLAQPTAM PALEVFATQA TTGSLTIEVA NNTDTATPAT VVIETPEGSV LMTTQQVLPA HRVTDIPWVI PATAAHQPAV IRVITDQQPP LTLWFAQVPT GK
|
| |