Gene Haur_0473 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_0473 
Symbol 
ID5732372 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp552446 
End bp553780 
Gene Length1335 bp 
Protein Length444 aa 
Translation table11 
GC content51% 
IMG OID641277599 
ProductNusA antitermination factor 
Protein accessionYP_001543252 
Protein GI159897005 
COG category[K] Transcription 
COG ID[COG0195] Transcription elongation factor 
TIGRFAM ID[TIGR01953] transcription termination factor NusA 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000679634 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAGTG ATTTTTACGC AGCTATTTCA CAGATTGCCG CTGAACGTGG CATTCCCCGC 
GAGTCGGTGC AGGATGTTGT CGAACAAGCC TTAATTTCTG CCTATCGGCG CTATTTGGGC
AGTAATCCAC CACCAGTTGA CGTTAAGATT GAATTAGAGC CAAATACTGG GCGGATTCGG
GTTTACGCTG AAAAGCAAGT CGTCGATGAA GTGATGGATG ATCGCTTCGA AATCGATATT
GAAGATGCCC GTAACGTTCG CGCCGATGTT GAAATTGGCG AAACGGTTTA TGTGGAAAGC
ACGCCCGACG ATTTTGGGCG GATTGCCGCC CAAACCGCCA AACAGGTGGT ATTGCAACGG
ATCAAAGAAG TTGAACGCGA CCATATCTAT GGCGAATACT TTGATCGCGA AGGCGAAATT
GTCACTGCCA CCGTGCAGCG CACCGCCAAA GGCAACGTAA TTTTAGAAGT TGGGCGAGCC
GAAGCGATTT TGCCCCAAAA AGAGCAAATT AGCCACGACA ACTATCGCCA TGGCCAACGC
CTCAAAGTCT ATTTGATGGA AGCTCGCCGT GATGATCCGC GTGGCCCGCG CTTGGTCGCC
TCGCGCACCC ACAAAGATTT GATCAAACGC TTATTTGAAA TGGAAGTGCC CGAAATCTAC
AACGGCACGG TTGAAATTAA ATCGATCGCC CGTGAACCAG GTTTACGCTC GAAAGTCGCC
GTCCATGCCC GTCAAGAAGG CATCGATCCG GTTGGCTCGT GCGTGGGGAT GCGCGGGATT
CGGATTCAAA ATATTGTGAA TGAACTGAAC GGCGAGAAAA TCGACGTGGT GCAATGGGGT
GCTGATATGC GGGTATTTAT TGCCAACGCC CTCAGCCCAG CCCAAGTCGT CGAAGTTCAT
CTTGATGAAG GCGAAAAAAC GGCCACGGTG GTCGTGCCAG ATAAACAATT GTCGTTGGCA
ATTGGCAAGG AGGGCCAAAA CGTTCGTTTG GCAGCCAAAC TGGTTGGCTG GCGCATCGAC
ATCAAGAGCG CATCTTCACT CTTAGAGGAA GAACGGGCTG CTGCTGAAGC GCGTGAGGCT
GCCGCGTCGG AACAAATGCT GCAAGAAGCA GCGCTCTCAA CCGCCAAAGT TGAAACCCGC
AAGGTGCGGG TCGATTCCTT GGTCACCTAT CAAGGGCGAC AATATGGCCC CTTGCCAGTT
GAACTAATTG GCGAAGAAGT AGCGTTGCGA GCCGCCGCCC AAAAACTCAA TATTTATTTC
AATGACAAGC TGATTGCTAG CTATATCATC GATGATGAGG CTGGTGACAG CGACGAGACG
GATACCGAGG CATAG
 
Protein sequence
MKSDFYAAIS QIAAERGIPR ESVQDVVEQA LISAYRRYLG SNPPPVDVKI ELEPNTGRIR 
VYAEKQVVDE VMDDRFEIDI EDARNVRADV EIGETVYVES TPDDFGRIAA QTAKQVVLQR
IKEVERDHIY GEYFDREGEI VTATVQRTAK GNVILEVGRA EAILPQKEQI SHDNYRHGQR
LKVYLMEARR DDPRGPRLVA SRTHKDLIKR LFEMEVPEIY NGTVEIKSIA REPGLRSKVA
VHARQEGIDP VGSCVGMRGI RIQNIVNELN GEKIDVVQWG ADMRVFIANA LSPAQVVEVH
LDEGEKTATV VVPDKQLSLA IGKEGQNVRL AAKLVGWRID IKSASSLLEE ERAAAEAREA
AASEQMLQEA ALSTAKVETR KVRVDSLVTY QGRQYGPLPV ELIGEEVALR AAAQKLNIYF
NDKLIASYII DDEAGDSDET DTEA