Gene Haur_5150 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_5150 
Symbol 
ID5737108 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009973 
Strand
Start bp217654 
End bp219408 
Gene Length1755 bp 
Protein Length584 aa 
Translation table11 
GC content45% 
IMG OID641282315 
ProductRNA-directed DNA polymerase 
Protein accessionYP_001547906 
Protein GI159901660 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTTTAC ATCTTTCGTC TGATCCCGAT ACCCTTCGCC AGCAATTCTT TCAGCTAAAA 
AGCCGCGATG ATCTCTTAAA CCTCTTAGAT ATTTCTCAAC AACAGCTTCT CTATTATTTA
TACATCTGCC CTGAAAATAA GCGCTATCGC CACCTCCGCT TACGAAAAAA AAGGGGCGGC
TATCGCACTA TTTACGCACC TGCAACACAT CTCAAAATAG TCCAACAAAA ACTTTCCTCC
ATTTTACAGT TGATCTATGA ACAAAAACCC GCAGTCCATG GGTTTGTCCC GCACAAAAGT
ATTGTCTCCA ATGCTGCTAT GCACCTCAAC AAAACCTACG TGCTCAATCT TGATTTACAA
GATTTCTTTC CATCCATCAA TTTTGGGCGG GTTCGAGGTC TTTTTATGAA TCAACCATAT
TATCTCAATG AGGAGGTCGC AACCATCCTT GCGCAAATCT GCTGCCATCG CAATACACTC
CCCCAAGGTG CGCCCACATC CCCCGTTATC TCGAATATGA TTTGCGCTAA ATTAGATCGG
GAATTACTCC GTTTTGCCCA AGCCAATCGC TGTGTTTATA CGCGGTATGC CGATGATCTC
ACCTTTTCAA CTAATACGCG GCAACCACCT TCCAAACTAG TGCGTCGTAC TGAGGCTACA
GCATCTATTG AACTTGGGCG TGATTTAGTT TCAATCATTA CAGCGAATGG CTTTCAGGTT
CATCCAGAAA AATCACGGCT TCAGGTCAAA GGTCGTCGCC AAGAAGTCAC GGGTCTTACT
GTAAATCACT TTCCCAATGT ACCGCGGAGA TTGATTCGGC AGATCCGGGC AATGCTCCAT
GCATGGCGAA AGTTTGGGTT GGATGCTGCG CAACAGCACT ACTATGCTCA CTATTGTCAT
CGTCAGTATC CAGTATTCAA ACCACGACCT CCCTTTCGGC AAGTCTTGAT TGGAAAAATC
GCATTTGTTG GTATGGTACG CGGCAAACAT GATCAACTCT ATCTCCGTCT TCGCGATCAA
TTGCTCAACC TTGACCCCAC CTATCGGGCA GCAGTGGAGA AAAAGGCCGA AGAAACATTC
ATCCTGAGTA CGCCACTTAT CAAAACCGAA GGAAAAACTG ATTGGAAACA CATCAAACAT
GCCTTACGGG TTTTGCAAGC ACAGGGGCTG TATGCGGGGT TATCCCTCGA TTTTGATGAA
AGTCTCACCG AGGGTGGAAG CAGTGAGCTA AAAAAAACGT GCTACTATCT TTCACGCGTG
AAACAAGCTC AAATTATTAT TGCGTTGTTT GACCGCGATG AACCGAATAT TATTCGCGAA
GTAGCAGATG GTGATAGATT CAAGGCATGG GGGAACCAGG TATTCTCCTT CGTGCTTCCT
ATTCCTGATC ATCGAACACA CACGCCAGAT ATTTGTATTG AATTCCTCTA TCCCGATAAC
AATCTCCTTC TTGTGGATGA GCATGGACGA CGCTTATACC TCAGCTCCGA GTTTCATGAA
ACATCAGGAC GGCATAAAGA AAACCCAGCG ATTTCCTGCC TGCTTTCGTC GAAATTCAAG
AAAGATGCCA AACTGTCGAT TATTGATGCT TCGGTCTTTG ATGCGAATCA TCGCAGTATT
GCACTCGCGA AAGATGCGTT CGCAGACTAC ATTCTTCATG ATCAAGTCCC ATTTAATCGC
ATGAATCTCG ATGGATTTAA GCCTATCTTT GATATGATTA TCGCAATTCT TCGTGCCGAG
ACAAACAAGA CATAG
 
Protein sequence
MSLHLSSDPD TLRQQFFQLK SRDDLLNLLD ISQQQLLYYL YICPENKRYR HLRLRKKRGG 
YRTIYAPATH LKIVQQKLSS ILQLIYEQKP AVHGFVPHKS IVSNAAMHLN KTYVLNLDLQ
DFFPSINFGR VRGLFMNQPY YLNEEVATIL AQICCHRNTL PQGAPTSPVI SNMICAKLDR
ELLRFAQANR CVYTRYADDL TFSTNTRQPP SKLVRRTEAT ASIELGRDLV SIITANGFQV
HPEKSRLQVK GRRQEVTGLT VNHFPNVPRR LIRQIRAMLH AWRKFGLDAA QQHYYAHYCH
RQYPVFKPRP PFRQVLIGKI AFVGMVRGKH DQLYLRLRDQ LLNLDPTYRA AVEKKAEETF
ILSTPLIKTE GKTDWKHIKH ALRVLQAQGL YAGLSLDFDE SLTEGGSSEL KKTCYYLSRV
KQAQIIIALF DRDEPNIIRE VADGDRFKAW GNQVFSFVLP IPDHRTHTPD ICIEFLYPDN
NLLLVDEHGR RLYLSSEFHE TSGRHKENPA ISCLLSSKFK KDAKLSIIDA SVFDANHRSI
ALAKDAFADY ILHDQVPFNR MNLDGFKPIF DMIIAILRAE TNKT