Gene Haur_5242 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_5242 
Symbol 
ID5737200 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009974 
Strand
Start bp9353 
End bp11116 
Gene Length1764 bp 
Protein Length587 aa 
Translation table11 
GC content56% 
IMG OID641282406 
ProductRNA-directed DNA polymerase 
Protein accessionYP_001547997 
Protein GI159901752 
COG category[L] Replication, recombination and repair
[V] Defense mechanisms 
COG ID[COG1403] Restriction endonuclease
[COG3344] Retron-type reverse transcriptase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.461209 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACACGA CCCCACTTGA TGAGCGCGGA ACCGCGCTCA CCGCTTGGAT CAATCAGACG 
CAAGAAGTGT TAGCCCATCG AAGTCTCAAC CAACAACCCT TTCATCGGGT ATTCAATCTC
ATGCGGACAC GGCGACTCGC CACGGTGGCA CTCAATCGGG TGCTTTCCAA CACGGGAGCA
CGCACCGCAG GGATTGACGG AATGACCAAG AAGCATATCG CAACGGATAC AGAACAACAG
GCATTGGTTC AGGAAATCTG GCACGACCTG ACAACCCATC AGTATCGGCC AGCTCCCGTG
CGTCGGGTGT ATATCCCCAA AGCCAATGGA CAGCAACGAC CGCTCGGTAT TCCCACGATC
AAAGATCGGG TGGTGCAAGA GATGGTACGG CTGATCCTCG ACCCGATCTA TGAAAGCACG
TTTTATCGCC ATAGTTATGG ATTTCGCCCC TATCGGGCAA CCCATCACGC GGTGGTACGA
CTCCGCGACC TGATCGGACG ACGAGGCTAC CAGATGGCCC TAGAAGGAGA CATCCGCGCG
TGCTTTGACC GAATTCATCA CACCACCTTA ATCCGGATTC TACGCCGGAC AATCAAGGAT
GAACGCCTGA TAACGGTCAT CCACCAGATG CTCAAGGCCG GAGTGATGGA CGATGGACAG
TGGCGCGTAA CGGAGGACGG AACGCCACAG GGCGGAATTG TCTCGCCACT GCTGGCCAAC
ATCTACCTGA ACGAGCTTGA CCAATGGGTA GCCAACCGAT GGGACACCTA CACCCCACTA
GAGCGCTATT ACCATCGGAA AGCCGGAACG GGGTATCCCT GTCAGATAAC CCGCTACGCG
GATGACTTTG TGGTATTGCT CCACGGCACA CACGCCGAGG CAACCACCTT GAAAACCGCG
CTCGCGACGT TTCTCGCTGA CCACCTCCAC TTGGAATTAT CAGCGGAAAA GACGCTGATA
ACGCCGGTGG AACAGGGCTT TGACTTTCTT GGATTTCACA TCCGGAAATA CCAAGACAGC
ACACGGATAA CCCCATCACG GAAGGCGATT GCGACCTTCA AACGCGAGGC GGCAGACCGC
ATCGGCAAAG GATTTCGGGA CAGTGACGAA GCGGGCATCG TAATGTTGAA CCACTACCTC
ACCGGATGGG GCCACTATTA CCGACGAGTG AGCAGCTCAA CCACGTTTCG CAGTCTCGAC
CACTACATTT GGTGGCGGGT GATGCGAACG ACGTTCCGGC TGCGACGCGG GCGGGGAGTC
CGGCACTTTG GCACACACTG CCGAAGCCAT CGCAAACGAT ACCGTGACGG TCTCAACCGA
AAACACGCAC ATCGACGAGG GGGCCATTAC GGCGTATGGG CAAACACCGC CCAAACGCGA
GCCTACATTG TCACGAGCTT GGCGTTCCTG CCGATTGAAT ATGTCGCCTT ACACCCACAA
CTGCATCCGT ACCGCAAAGC CGACCGGGCA AAACTCGACC AACGCAAACG CTTAGCGCTG
CTGTTGGCGC GAAATAGCCA TCCGGAACGG CCCGCGAACC CTGCCTATGG GAAGGCGTGG
GAACAGATAC GACAAGAGGT ACTCCAGATG AGCAACTACA CCTGCCAACA CTGCGGCACA
CGCGTGCATC GTAGCACGGC AGAAATTGAC CACCGGATAC CGTTGAAACG CTTCACGCGG
CGACAAACCG CGCACAAATT GGAAAATCTC CAATGCCTTT GCCGCGCATG CCATCTTCGG
AAGCATGGCA AAGAACCACG ATGA
 
Protein sequence
MNTTPLDERG TALTAWINQT QEVLAHRSLN QQPFHRVFNL MRTRRLATVA LNRVLSNTGA 
RTAGIDGMTK KHIATDTEQQ ALVQEIWHDL TTHQYRPAPV RRVYIPKANG QQRPLGIPTI
KDRVVQEMVR LILDPIYEST FYRHSYGFRP YRATHHAVVR LRDLIGRRGY QMALEGDIRA
CFDRIHHTTL IRILRRTIKD ERLITVIHQM LKAGVMDDGQ WRVTEDGTPQ GGIVSPLLAN
IYLNELDQWV ANRWDTYTPL ERYYHRKAGT GYPCQITRYA DDFVVLLHGT HAEATTLKTA
LATFLADHLH LELSAEKTLI TPVEQGFDFL GFHIRKYQDS TRITPSRKAI ATFKREAADR
IGKGFRDSDE AGIVMLNHYL TGWGHYYRRV SSSTTFRSLD HYIWWRVMRT TFRLRRGRGV
RHFGTHCRSH RKRYRDGLNR KHAHRRGGHY GVWANTAQTR AYIVTSLAFL PIEYVALHPQ
LHPYRKADRA KLDQRKRLAL LLARNSHPER PANPAYGKAW EQIRQEVLQM SNYTCQHCGT
RVHRSTAEID HRIPLKRFTR RQTAHKLENL QCLCRACHLR KHGKEPR