Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_0473 |
Symbol | |
ID | 5732372 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 552446 |
End bp | 553780 |
Gene Length | 1335 bp |
Protein Length | 444 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641277599 |
Product | NusA antitermination factor |
Protein accession | YP_001543252 |
Protein GI | 159897005 |
COG category | [K] Transcription |
COG ID | [COG0195] Transcription elongation factor |
TIGRFAM ID | [TIGR01953] transcription termination factor NusA |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.0000679634 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAGTG ATTTTTACGC AGCTATTTCA CAGATTGCCG CTGAACGTGG CATTCCCCGC GAGTCGGTGC AGGATGTTGT CGAACAAGCC TTAATTTCTG CCTATCGGCG CTATTTGGGC AGTAATCCAC CACCAGTTGA CGTTAAGATT GAATTAGAGC CAAATACTGG GCGGATTCGG GTTTACGCTG AAAAGCAAGT CGTCGATGAA GTGATGGATG ATCGCTTCGA AATCGATATT GAAGATGCCC GTAACGTTCG CGCCGATGTT GAAATTGGCG AAACGGTTTA TGTGGAAAGC ACGCCCGACG ATTTTGGGCG GATTGCCGCC CAAACCGCCA AACAGGTGGT ATTGCAACGG ATCAAAGAAG TTGAACGCGA CCATATCTAT GGCGAATACT TTGATCGCGA AGGCGAAATT GTCACTGCCA CCGTGCAGCG CACCGCCAAA GGCAACGTAA TTTTAGAAGT TGGGCGAGCC GAAGCGATTT TGCCCCAAAA AGAGCAAATT AGCCACGACA ACTATCGCCA TGGCCAACGC CTCAAAGTCT ATTTGATGGA AGCTCGCCGT GATGATCCGC GTGGCCCGCG CTTGGTCGCC TCGCGCACCC ACAAAGATTT GATCAAACGC TTATTTGAAA TGGAAGTGCC CGAAATCTAC AACGGCACGG TTGAAATTAA ATCGATCGCC CGTGAACCAG GTTTACGCTC GAAAGTCGCC GTCCATGCCC GTCAAGAAGG CATCGATCCG GTTGGCTCGT GCGTGGGGAT GCGCGGGATT CGGATTCAAA ATATTGTGAA TGAACTGAAC GGCGAGAAAA TCGACGTGGT GCAATGGGGT GCTGATATGC GGGTATTTAT TGCCAACGCC CTCAGCCCAG CCCAAGTCGT CGAAGTTCAT CTTGATGAAG GCGAAAAAAC GGCCACGGTG GTCGTGCCAG ATAAACAATT GTCGTTGGCA ATTGGCAAGG AGGGCCAAAA CGTTCGTTTG GCAGCCAAAC TGGTTGGCTG GCGCATCGAC ATCAAGAGCG CATCTTCACT CTTAGAGGAA GAACGGGCTG CTGCTGAAGC GCGTGAGGCT GCCGCGTCGG AACAAATGCT GCAAGAAGCA GCGCTCTCAA CCGCCAAAGT TGAAACCCGC AAGGTGCGGG TCGATTCCTT GGTCACCTAT CAAGGGCGAC AATATGGCCC CTTGCCAGTT GAACTAATTG GCGAAGAAGT AGCGTTGCGA GCCGCCGCCC AAAAACTCAA TATTTATTTC AATGACAAGC TGATTGCTAG CTATATCATC GATGATGAGG CTGGTGACAG CGACGAGACG GATACCGAGG CATAG
|
Protein sequence | MKSDFYAAIS QIAAERGIPR ESVQDVVEQA LISAYRRYLG SNPPPVDVKI ELEPNTGRIR VYAEKQVVDE VMDDRFEIDI EDARNVRADV EIGETVYVES TPDDFGRIAA QTAKQVVLQR IKEVERDHIY GEYFDREGEI VTATVQRTAK GNVILEVGRA EAILPQKEQI SHDNYRHGQR LKVYLMEARR DDPRGPRLVA SRTHKDLIKR LFEMEVPEIY NGTVEIKSIA REPGLRSKVA VHARQEGIDP VGSCVGMRGI RIQNIVNELN GEKIDVVQWG ADMRVFIANA LSPAQVVEVH LDEGEKTATV VVPDKQLSLA IGKEGQNVRL AAKLVGWRID IKSASSLLEE ERAAAEAREA AASEQMLQEA ALSTAKVETR KVRVDSLVTY QGRQYGPLPV ELIGEEVALR AAAQKLNIYF NDKLIASYII DDEAGDSDET DTEA
|
| |