Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_4641 |
Symbol | |
ID | 5736488 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 5929015 |
End bp | 5931447 |
Gene Length | 2433 bp |
Protein Length | 810 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641281805 |
Product | hypothetical protein |
Protein accession | YP_001547400 |
Protein GI | 159901153 |
COG category | [R] General function prediction only |
COG ID | [COG3889] Predicted solute binding protein |
TIGRFAM ID | [TIGR01451] conserved repeat domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.0331671 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAATCTT TTTTTCGTCG GCAACGCCCG ACATCCCTCC GCGAACGAGG TAAAGGCAAA GTTCGCTGGA CAATCTGGGA GTTAACAGCG CTGGTTGTTA GCATCGTTAT GATGGCAACC CCTTTGTTAG CTGCGATTAG CAGTGCTATT TTTCCATTTA ATGCTCAGGC GCAAGCCACG ATTCGGCCAA CGTCAGCAGC AACACCGGTT ACGCCGTTAC CACCAACCGC CACCCAGACC TATCCGGTGC CAACCGAAAC ACCGCTTGTC ACGCCGTGTC CAACGTTTGA GCCAACCGAA ACACCAACCG AAACCGCCAC AACCTATCCT ACACCTGGCA CACCTGGCAC ACCGACCGAT ACGCCAAGTG TCACGGATAC GCCAACGACA ATTATTACGT CAACCAATAC ACCAGCTGTG ACCGATACGC CGAGCGTCAC CGACACCCCA AGTGTGACAG ACACGCCGAG CGCACCAACG GCTACGACGA TCATCACACC AACTGAAACC AGTGTTGTAA GCCCAACGAT ACCCCGTGGT GGGATGAGTG GAAGCACGCG TTTGCATCGG CCTTTAGCCC AAACGCCTGA TCCATGTGCA ACACCACCAA CGATTGTCAT TCCACCAACC GATACCATTC CACCTGATGA AACACCAACG GGCACAATCA CTGGTCCGGT TACACTGACC CCAGCAACGC CAACCATTGA TTGTTTGGTT GCAAGTTGTA CGCCAGTTCC AACCCTACCA ACTGATATTG CCACGGCAAC CGAAACCACC TTGCCAGCCG ATGAGCCAAT CGCGGTGGGC AAGAGTAGTT CGCGGGCGCT CGTGCAGCCT GGCGAAACAT TTAGTTATAT CATCACGGTA AGTTTCCAAG ATAATGGCGA TGGGCAAACT TCGCGCTCAG TCAGCATTAG CGACCCATTA CCAAGCCAGG TAACCTTTAT TGCAGTTCAA CAACTTGGCA CAGCGACCTG CGTTGGTGGC ACAACCGTCA ACTGTAATGG GACGGTCAGT GCAGGTAACC CAATTGTGGT CACGATTCAA GTCCAAGTTA ATGCCAGCGT GGCATTGGGC ACGAATATCG TTAATATCGT TAGTGCCACG GCGGCCAATC GCACGTTGCA AGCAAGCGAT ACCGTGATTG TGCCTGATAC CTTACCAACC AGCACGCTTG GCACAGCTGG CCCAAGCTTC ACGCCAATTG TTGTGACCAA TACACCAATT ACGCCAATTG TGACCAATAC TCCGATTACG CCAATTGGTG TAACCAATAC TCCACCTACC AGCACACCAG TCACACCAAT TGGTGCAACC AACACACCTG GTGGCTCAAC CAGCATACCT GTTACGACTG GTCCAAGCAA TACGCCACGG CCAAACCAGC CGAGCAATAC GCCACGGCCA AATCAGCCGA GCAATACGCC ACGGCCAAAC CAGCCGAGCA ATACGCCACG GCCAAATCAG CCAAGCAACA CACCACAGCC GAATCAGCCA AGCAACACAC CACGGCCTGA TATTACGCCA GTGCCAGCAA CCAATGTGCC TGTTGCCACG GTTGTTCCAC CAAGCAACCC AAGCGCAACG CCACGCCCAG GTGTGCCAGT GCCTTCGGCA ACTCAGCGCC CAGGCGGTGG TTCAAATCCA AGTGCAACGC CACGGCCAGG CGCACCAGTG CCTTCGGCTA CCAATGCACC AGCTGGCAGC CAACCAACCA ATGCACCCGC GCCAGCTACA GCAACGCCAA CCAATCCAGC TGGCTTCGTC ACCGATCCAA TCGTGACTGG CTTGCAGTTC CAAAAGAAGA GCGATTGGGG CAGCCGCTTT GCAGGCGAAA GTTTGATTTA CACAATCACG ATTATCAGCC CAACTAATTC GTTGAATGCT GGTACGATGC GTGATGTCGT GGTGGTTGAT CAATTGCCAA GCAACTTGGA AACCAATGGC CCAATCAAGG TCAGCGACCA AAATGCACGG GTTGAACAAC AAGGCAACCA AATTACCGTG CGGGTCGGGG TCTTGCCAGC AGGCCAAACC TTGACAATCA TGATTCCAGT CAAGATCAAA GATGGTGTGG CTGCTCAAAC GCGGATCGTC AACCAAGCTC AGTTGAATTT CACTGGCTTG GCCCAGCCAA TCTATTCGAA TATTTCGAGT GTTTTGGTGG TCGGCGAAGC TCCTGCAGTC AGCGCCACAG CAGTTCCTAA GGGCAATGTC GGCGGCGGTG CTGCAACTGC CAACCCAGCA ACCGTCACGC CAAATACTGG CATTGGCGGC GGCCAAGGTA GTGGCGATGG CACTGGCGCA ACCGATGTTG GGGTTAGCAA CCCAGCTACG AATATGGGTA TTCCAGCAGC AGGCTTTGTG CTCTTCGCCC TGACGATGTT CGTTCACGTT ATACGGGTTC GCCGCGAAAT GACGCGGATC TAA
|
Protein sequence | MKSFFRRQRP TSLRERGKGK VRWTIWELTA LVVSIVMMAT PLLAAISSAI FPFNAQAQAT IRPTSAATPV TPLPPTATQT YPVPTETPLV TPCPTFEPTE TPTETATTYP TPGTPGTPTD TPSVTDTPTT IITSTNTPAV TDTPSVTDTP SVTDTPSAPT ATTIITPTET SVVSPTIPRG GMSGSTRLHR PLAQTPDPCA TPPTIVIPPT DTIPPDETPT GTITGPVTLT PATPTIDCLV ASCTPVPTLP TDIATATETT LPADEPIAVG KSSSRALVQP GETFSYIITV SFQDNGDGQT SRSVSISDPL PSQVTFIAVQ QLGTATCVGG TTVNCNGTVS AGNPIVVTIQ VQVNASVALG TNIVNIVSAT AANRTLQASD TVIVPDTLPT STLGTAGPSF TPIVVTNTPI TPIVTNTPIT PIGVTNTPPT STPVTPIGAT NTPGGSTSIP VTTGPSNTPR PNQPSNTPRP NQPSNTPRPN QPSNTPRPNQ PSNTPQPNQP SNTPRPDITP VPATNVPVAT VVPPSNPSAT PRPGVPVPSA TQRPGGGSNP SATPRPGAPV PSATNAPAGS QPTNAPAPAT ATPTNPAGFV TDPIVTGLQF QKKSDWGSRF AGESLIYTIT IISPTNSLNA GTMRDVVVVD QLPSNLETNG PIKVSDQNAR VEQQGNQITV RVGVLPAGQT LTIMIPVKIK DGVAAQTRIV NQAQLNFTGL AQPIYSNISS VLVVGEAPAV SATAVPKGNV GGGAATANPA TVTPNTGIGG GQGSGDGTGA TDVGVSNPAT NMGIPAAGFV LFALTMFVHV IRVRREMTRI
|
| |