Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_1524 |
Symbol | |
ID | 5733411 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 1776332 |
End bp | 1777873 |
Gene Length | 1542 bp |
Protein Length | 513 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641278664 |
Product | transcriptional antiterminator, BglG |
Protein accession | YP_001544296 |
Protein GI | 159898049 |
COG category | [K] Transcription |
COG ID | [COG3711] Transcriptional antiterminator |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.000100792 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTGTCGC TGACGACTGC TCAACGTGAT CTGTTGTATC TGTTGCTTAC CACTGAGACT CCAATCGGGG CTGCGGCACT CGGTCAGTGT CTCCATCTCA CACCACGCCA AGTTTCCTAT AGTTTACGGA GTATTAAGCT GTGGTTGGCT CGTCGCCATG CCAGTTTGCG CCAAGTGCCT GGGGTTGGCA TGCAGCTGAT CTGCTCGGCC CAGCAACGTG AACGGCTTTA TGCTGAGCTA GAATCGCATG CCAAATTTCA ATTAATTCTC ACGCCCGAAC AACGTGGTCA GCTTTTGGCC TTAGTGCTGT TGATTACCCC CGAACCCTTG ACCCTCAACC AACTCCAGCA AGATTTAGCA GTTGCTCGCA CGACGGTGCT TAAAGATCTC GATGTCATGG AGATTTGGTT GGCCAGTTTT GGTTTGCAAG TGGTGCGGCG ACAACATCGC GGCTGCTGGA TCGAAGGCGC TGAGTTGGCG AAACGCCAAG CCTTGGCAGC CTTGTTATGG GGCGATGTGC CCTTTAGTTT GCCAATTATC AGCGTGCAAG CGGGCCTTGG CTTTGATTTT GTGTTGCAAC AAGATGCAGC CCTCTTACCG ATTATTCAAC GGGTCAATAG CTTTTTGCAG GAGCTTGATC TGCCTAAAGC CCAACAGCAG GCGATCTGGG CCGAAGCTGC CTTGAATGCG CGGTTTAGCG ATCAGGCAAT TAGTTTGCTG GCTTTGGCCT TGGCTTTGCA ACAGCAACGG ATCAATGCTC AACAGTATCT CCATTGGCAT CCCGAAACAT TACATTGGCT GGAACAGCAG TTGGTTTGGT CAGTTGCTAC GCAATTTGAG CAACATTATG GTTTGCAGGC CAGTGCAACG ATCGATCTTG CTGAAATTGC TGGTGTAGCC TTGCAATTGG TGTGTGCTGC CCGCGAACGG CCATGGCCCA ATCAGCATGA GACCGATCAC GTAACCGTTC GGTTGATCAA TGCACTGATT GATTTGATTG CCAGTAGCTA CGATGTCGCA GAGCTTGCCA ACGATCAGCT TTTGGGTGAT GGTTTGGCCG CCTTGATTCC GCCAGCCTGC AATCGCCAGC GCTTTGGTTT GTGGATTCCG ACCCATCAAT CCAGCGAAAC GCAGAGTGAA CGTTATGCAA CTGAGCGGCG GGTGGCCGAT TTAATTGATC GCAAGCTGCT GGCAACGATT GGTGTAGCTT TGCCGATTGA TGCCCGTGAT GAGCTGATTT TGTTGTTGCG GGCAGCGGTG GTGCGGGCGC GGCCTGTGCA AACCCGCAAT ATTCTGGTGG TTTGCCCGAG CGGCATGGCC ACAACTCAAC TGTTGGTAGC ACGGCTCAAA GCACGGTTTC CCAAACTGGG TATTTTTGAA GTGCTCTCGA TGCGCGAGCT TTCGGCAGAA CGTTTAGCCA ATGCCGATTT GGTGATTACG ACTGCGCCTT TAGCATTGGC TAGTGTGCCA ATTGATGTCA TCCAAGTGCA TCCAATGCTG CACCCAGAAG ATATTGCGGC GCTGACCCAG TGGATGGTTT AG
|
Protein sequence | MLSLTTAQRD LLYLLLTTET PIGAAALGQC LHLTPRQVSY SLRSIKLWLA RRHASLRQVP GVGMQLICSA QQRERLYAEL ESHAKFQLIL TPEQRGQLLA LVLLITPEPL TLNQLQQDLA VARTTVLKDL DVMEIWLASF GLQVVRRQHR GCWIEGAELA KRQALAALLW GDVPFSLPII SVQAGLGFDF VLQQDAALLP IIQRVNSFLQ ELDLPKAQQQ AIWAEAALNA RFSDQAISLL ALALALQQQR INAQQYLHWH PETLHWLEQQ LVWSVATQFE QHYGLQASAT IDLAEIAGVA LQLVCAARER PWPNQHETDH VTVRLINALI DLIASSYDVA ELANDQLLGD GLAALIPPAC NRQRFGLWIP THQSSETQSE RYATERRVAD LIDRKLLATI GVALPIDARD ELILLLRAAV VRARPVQTRN ILVVCPSGMA TTQLLVARLK ARFPKLGIFE VLSMRELSAE RLANADLVIT TAPLALASVP IDVIQVHPML HPEDIAALTQ WMV
|
| |