Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_5066 |
Symbol | |
ID | 5737024 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009973 |
Strand | - |
Start bp | 79105 |
End bp | 81528 |
Gene Length | 2424 bp |
Protein Length | 807 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 641282231 |
Product | hypothetical protein |
Protein accession | YP_001547822 |
Protein GI | 159901576 |
COG category | [S] Function unknown |
COG ID | [COG3391] Uncharacterized conserved protein |
TIGRFAM ID | [TIGR01451] conserved repeat domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.493331 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCATCGAT GGCGTTGGTT TGGATTGGTT GGACTGCTGC TCGGACTGCT CAGTGGACTG CCCGTGGCCG CGATGGACAC GCAGGATGGC TTGCCCACGG TCGCAGTGGG GATTGCCAGT CCCACAGCAG GGCAGGTGCT GCATGCGCCG AGTACGACCG TGACCGGAAC CGCTACGGCG GGATCGCCCG TGACGGTGTG GCTCGATGGT CAGGTTGTTG AGACACTGGA GGCGGGCGCG AATGGCCAGT GGAGTCTGCC TGTCAGTGGC CTGACCCACG GGACGCATAC GCTATCGGCC ACCAGTACGC TGGATGCGGG CACATGGCTG TACATGCTCG ATCCGTCGTG TGCGGTTGGC AGGGAGACCT GTGTGCGGAT TGCTGATCCG GATACCGCGA CGATCCACAC GACGATTCAC ACCAGCCCGG ATTCGGCGGG TGGTGAGCTG CTGATTGCCC ATCCGACCAT GGATCGGCTG TATATCGGCC ACTCGACGGG GGTGGATGTC TATACCCACG ACGGAACCTT CATCACGCGG ATTGCGCTGC CCAAGACGGT GCGCATGGGT GCGTTGACCC CGAATGGCCG CGAGTTGTGG GTTCCGCAAG ATGCGAGTGA TGGGCGGGGT GGCATGGCGG TGATTGATAC CGCGACCAAC ACGGTGATCA CGATCTTTGA CACGGCGGTG TACGGCAGCG GCTCCACGCT CGTGACCGCC AGTGCCGCCC AAGATCTGGT CTTTTCACCC GATGGTCAGA CCGCCTACGC GGCGGATATG GGCGATTACA GCCTGACGGT CTTGGATGTG GCCAGCCGAA CGGTGCGCTC ACGGTTGATT CGCGAGGGTG ACGCGGCGGT CGGGCGACGG GTGCTGCTCA ATCAGGCTGG GACGCGCTTG TATTTGGCGA CGCGGCAAGG GAATGCGCTG TATGTCGTCG ATACGGCGAC TAGCAGTTTC ACCCGCGTGG CCGTGAGCAA CCCCTATCGC CCGCAGCTGG AGGGGATTGT CCTCAGTCCT GATGAGTCGA AACTCTACGT CGTGGTCTAC CGGATTGGCG ATAGCTTTAC GCCGCAAAAT CGGGCGTTGA TCTTGGACAC GGCGACCAAT CAGTGGTTGC CCACCGACCT CCGTTGGCCA CAGCCGAATC CGGTGTGGGG CGCACGGGCG GCAACCCGCC ATCCGGTGAC CGGTATGGTG TATATCGGCG GCGGCAATGG GGTGATGGTG TTTGATGGTG AGGAACGCCA GCCCGCGCTG GAGTTGGCAG CGGGCCTCGA TAACTCGGTC TACGAGTTTG ATTGGTTACG ACGCATTGCG ACCGCGACCG CCAGTGTGAC GGTGCGGGTG GATTTGTCCT CCGATCTTGG CGTGGAGAAA ACCCATGCGG GGGATTTGGT CGTGGGGCAG GAAGGAACCT ACACCATTGC GGTCACCAAC CATGGCCCAG CGGTGATGCC TGCCGGAACG ACGATCACGG ACGACGTGCC AGACGATCTG CGGGTTGTTG CGGCCAGTGG GGCACATTGG TCGTGTGCTA TCACGGGCCA AACCGTGACT TGTACCGCGA CCGTGGCCAT GCCAGCGCTC GAAACGGGCA CGGTGCAGAT TCGGGTTATT CCAGAGGCAG CGGCGGGGGC CAGCGTGATT AATCGGGCCT GTGTCGATAC CCTGATTGAT GCCAACCCGA CCAACGATTG TGATGATGAC CTGACAACCA TCCTGCATCC GGCCTTGGCC ATCGGGAAGC GCTCGACCCC ACCCAATGGC ACGGCGGTGG CGGCAGGCAA CACGATTACC TATTTCCTTG ACGTGACCAA TACCGGAACC GCCCCGTTGA CGGGGGTGAC GGTACGCGAT GCGATTCCCG AGGCAACGGC CTTGATCGCG GCTGATCCGG CAGTGACTCC CATCGACGGC GTGCTGACGT GGGAACTGGG TGATCTGGCC GTCGGAGCAA CGCGCACCGT GCAGTTTCAG GTGCGGGTGT TGCCGATCGG CACAACCGTC GCCATTCGCA ATGTGGCGCA GGCCGACAGT GACCAAACCA GCGAGCAAGA TTCGAACCTG CTGATTCACC CCTTCGACCC GACCAGTATC AGCCTGGTGT CCTTTGACGC GGTGGCCACG GGCGGCATGG TCGATCTGCG CTGGGTGACG GGCAGTGAAG TCAACACGTT GGGCTTTCAC CTCTACCGGA GCACCACCCC GAATCGCAAC GAGGCCACCC GCGTGACGAC CAGCCTGATT CCCTCACAGG GCGCGACGGG CGGCAGCTAC CGCCTGACCG ATGCCCATGC CACCGCACCG CTTGGCCAAT GGTCGTATTG GCTGGAAGAA GTCGAGCTGA ACGGCCAAAC CACCTGGTAT GGCCCGGTGA CGGTACGGAT GCATACGATC TATGGCCCCG CTGTGATGCG GTAG
|
Protein sequence | MHRWRWFGLV GLLLGLLSGL PVAAMDTQDG LPTVAVGIAS PTAGQVLHAP STTVTGTATA GSPVTVWLDG QVVETLEAGA NGQWSLPVSG LTHGTHTLSA TSTLDAGTWL YMLDPSCAVG RETCVRIADP DTATIHTTIH TSPDSAGGEL LIAHPTMDRL YIGHSTGVDV YTHDGTFITR IALPKTVRMG ALTPNGRELW VPQDASDGRG GMAVIDTATN TVITIFDTAV YGSGSTLVTA SAAQDLVFSP DGQTAYAADM GDYSLTVLDV ASRTVRSRLI REGDAAVGRR VLLNQAGTRL YLATRQGNAL YVVDTATSSF TRVAVSNPYR PQLEGIVLSP DESKLYVVVY RIGDSFTPQN RALILDTATN QWLPTDLRWP QPNPVWGARA ATRHPVTGMV YIGGGNGVMV FDGEERQPAL ELAAGLDNSV YEFDWLRRIA TATASVTVRV DLSSDLGVEK THAGDLVVGQ EGTYTIAVTN HGPAVMPAGT TITDDVPDDL RVVAASGAHW SCAITGQTVT CTATVAMPAL ETGTVQIRVI PEAAAGASVI NRACVDTLID ANPTNDCDDD LTTILHPALA IGKRSTPPNG TAVAAGNTIT YFLDVTNTGT APLTGVTVRD AIPEATALIA ADPAVTPIDG VLTWELGDLA VGATRTVQFQ VRVLPIGTTV AIRNVAQADS DQTSEQDSNL LIHPFDPTSI SLVSFDAVAT GGMVDLRWVT GSEVNTLGFH LYRSTTPNRN EATRVTTSLI PSQGATGGSY RLTDAHATAP LGQWSYWLEE VELNGQTTWY GPVTVRMHTI YGPAVMR
|
| |