Gene Haur_1524 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_1524 
Symbol 
ID5733411 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp1776332 
End bp1777873 
Gene Length1542 bp 
Protein Length513 aa 
Translation table11 
GC content52% 
IMG OID641278664 
Producttranscriptional antiterminator, BglG 
Protein accessionYP_001544296 
Protein GI159898049 
COG category[K] Transcription 
COG ID[COG3711] Transcriptional antiterminator 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000100792 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTGTCGC TGACGACTGC TCAACGTGAT CTGTTGTATC TGTTGCTTAC CACTGAGACT 
CCAATCGGGG CTGCGGCACT CGGTCAGTGT CTCCATCTCA CACCACGCCA AGTTTCCTAT
AGTTTACGGA GTATTAAGCT GTGGTTGGCT CGTCGCCATG CCAGTTTGCG CCAAGTGCCT
GGGGTTGGCA TGCAGCTGAT CTGCTCGGCC CAGCAACGTG AACGGCTTTA TGCTGAGCTA
GAATCGCATG CCAAATTTCA ATTAATTCTC ACGCCCGAAC AACGTGGTCA GCTTTTGGCC
TTAGTGCTGT TGATTACCCC CGAACCCTTG ACCCTCAACC AACTCCAGCA AGATTTAGCA
GTTGCTCGCA CGACGGTGCT TAAAGATCTC GATGTCATGG AGATTTGGTT GGCCAGTTTT
GGTTTGCAAG TGGTGCGGCG ACAACATCGC GGCTGCTGGA TCGAAGGCGC TGAGTTGGCG
AAACGCCAAG CCTTGGCAGC CTTGTTATGG GGCGATGTGC CCTTTAGTTT GCCAATTATC
AGCGTGCAAG CGGGCCTTGG CTTTGATTTT GTGTTGCAAC AAGATGCAGC CCTCTTACCG
ATTATTCAAC GGGTCAATAG CTTTTTGCAG GAGCTTGATC TGCCTAAAGC CCAACAGCAG
GCGATCTGGG CCGAAGCTGC CTTGAATGCG CGGTTTAGCG ATCAGGCAAT TAGTTTGCTG
GCTTTGGCCT TGGCTTTGCA ACAGCAACGG ATCAATGCTC AACAGTATCT CCATTGGCAT
CCCGAAACAT TACATTGGCT GGAACAGCAG TTGGTTTGGT CAGTTGCTAC GCAATTTGAG
CAACATTATG GTTTGCAGGC CAGTGCAACG ATCGATCTTG CTGAAATTGC TGGTGTAGCC
TTGCAATTGG TGTGTGCTGC CCGCGAACGG CCATGGCCCA ATCAGCATGA GACCGATCAC
GTAACCGTTC GGTTGATCAA TGCACTGATT GATTTGATTG CCAGTAGCTA CGATGTCGCA
GAGCTTGCCA ACGATCAGCT TTTGGGTGAT GGTTTGGCCG CCTTGATTCC GCCAGCCTGC
AATCGCCAGC GCTTTGGTTT GTGGATTCCG ACCCATCAAT CCAGCGAAAC GCAGAGTGAA
CGTTATGCAA CTGAGCGGCG GGTGGCCGAT TTAATTGATC GCAAGCTGCT GGCAACGATT
GGTGTAGCTT TGCCGATTGA TGCCCGTGAT GAGCTGATTT TGTTGTTGCG GGCAGCGGTG
GTGCGGGCGC GGCCTGTGCA AACCCGCAAT ATTCTGGTGG TTTGCCCGAG CGGCATGGCC
ACAACTCAAC TGTTGGTAGC ACGGCTCAAA GCACGGTTTC CCAAACTGGG TATTTTTGAA
GTGCTCTCGA TGCGCGAGCT TTCGGCAGAA CGTTTAGCCA ATGCCGATTT GGTGATTACG
ACTGCGCCTT TAGCATTGGC TAGTGTGCCA ATTGATGTCA TCCAAGTGCA TCCAATGCTG
CACCCAGAAG ATATTGCGGC GCTGACCCAG TGGATGGTTT AG
 
Protein sequence
MLSLTTAQRD LLYLLLTTET PIGAAALGQC LHLTPRQVSY SLRSIKLWLA RRHASLRQVP 
GVGMQLICSA QQRERLYAEL ESHAKFQLIL TPEQRGQLLA LVLLITPEPL TLNQLQQDLA
VARTTVLKDL DVMEIWLASF GLQVVRRQHR GCWIEGAELA KRQALAALLW GDVPFSLPII
SVQAGLGFDF VLQQDAALLP IIQRVNSFLQ ELDLPKAQQQ AIWAEAALNA RFSDQAISLL
ALALALQQQR INAQQYLHWH PETLHWLEQQ LVWSVATQFE QHYGLQASAT IDLAEIAGVA
LQLVCAARER PWPNQHETDH VTVRLINALI DLIASSYDVA ELANDQLLGD GLAALIPPAC
NRQRFGLWIP THQSSETQSE RYATERRVAD LIDRKLLATI GVALPIDARD ELILLLRAAV
VRARPVQTRN ILVVCPSGMA TTQLLVARLK ARFPKLGIFE VLSMRELSAE RLANADLVIT
TAPLALASVP IDVIQVHPML HPEDIAALTQ WMV