Gene Haur_0161 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_0161 
Symbol 
ID5732070 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp190394 
End bp195169 
Gene Length4776 bp 
Protein Length1591 aa 
Translation table11 
GC content53% 
IMG OID641277285 
Producthypothetical protein 
Protein accessionYP_001542941 
Protein GI159896694 
COG category[R] General function prediction only 
COG ID[COG2373] Large extracellular alpha-helical protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.330881 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGATTGC TCGAAAAACT AGCACGATGG TGGTGGCGTT GGCTTCTAAC AACAATTAGT 
TTGTTGGGCA CGGCGCTGCT TGGCGTAATC GTGGCGAGTA TCGCTGGCTG GGTTCCGCAA
CCAAGTGTTT TGCGCTTAAC GCCGTTGGCT GGTAGCCGCG AATTGCCGCC AGCCACGGCC
TTGGAATTTA ATTTCAATCT GCCGATGGAT CGTTCGAGCG TCGAGGATGC TTTGGTGATT
ACGCCGCCGG TGCGTGGACG TTGGCATTGG GAAGGCCGCG CCCGTGCCCG TTGGCAGCCT
GAATTTGGCT GGACACCAGG CACAACCGTG ACGGTGCAAT TACGCCCAAC CGCCCAATCG
ATGTTGCGCC AAACCCTGGC AACAACCGTG ACAACCCAAT TTACCGCCGC TCCTGCCCCG
CTGTTAGTGT TTCGCTCACC GCTACCCAAT GCGATTATTG CCCCCAATAC CCCGATTTTG
TTGCGTTTCA ATCGGGCGAT GATCGATTAT GCCAGCCAAA CTGAGCGGGG CTTAGCTGAA
TTAAGCATCG AGCCAAATGC TGCTAGCCAA GTGCGTTGGC TCGATGATCG TAGCGTTTTG
GTGCAGGCGC AATGGCAGAT TGGTCAGCAC TATCAACTCA AGCTGCAAGA TCTCAACGAT
CTGCTGGGAA TTCCGGTTCA GACCACCGAA TGGCAAGTGC AGGTGAGCGA GCCAAACCTG
ATTGCTCCGC CAACGACCCA AACCCTTCAG GCTGATCAAG CCTTGGAATG GGCCTTTGAG
GGTTTGCTTG ATCAAACCAC AACCCAACAA TTGATCAAAC GAATTCAATT TACTCCCGCT
GTGAGCACAA CCTGGCAGGT GCAATATCAG CCTGATCCCC AACCACAAAC CATCATTCAA
GCCTTGCCTG AGGCAGCTTG GCCAGCTCAA ACGCTGACTA CCAGCCTACA ACTCAGCCAG
CCAATCACCC AAGTTTGGCA AATTCAACCC AACTTGCAAT TAATTGGCTC AGTGCCAGGC
CGTGGTGGCA GTTTAGCCGT TGATGATCCG CTGCGCTTGC TGTTCAACCA AACGCCTGAG
CGCCAGCAAC TTGTCGAGCA ACTATTTATC GAGCCTGAAG TTGCGAATGT GGCGATTAAC
ATCAACAATC AGCAGGCCTT GATTAGTGCT GCATGGCAAC CAAGCACAGT TTACACGCTG
ACCTTGGCTG GCTCTGCCCC GCTGACCTTT CGCACCCAAA CCCAAGCGCA GCCACTGCAA
ATTGAAGGTG CGGCTTATTC GTTGTTGTTG CCGGAAACTA CCGCCGAATT GCGGTTGAGT
GGGCGCGAAA ATAGCCGACT CACGGCGAAT CTGTATGCGG TTGATCGGGC GGTTTTGGCG
GCGGCTTTGC AACAACCGCA AACCCCGCTT AATCCTCAGC GTTATAATTT GCAACCGCAA
CAGCAATGGC AAATTGCCGC TGGCCCTGAA CGAACCATTA GCATTACCCC AACTCAAGCG
GCGTTATTAC TGCAAGTGCA AGCTCCCAAT GCCGAACTTG TGCAACATGT GTTGATTTGG
ACTCCCTATC GAGCGCAATT GTTGGCGACA CCTGAGCAAG CCCAAGCGTG GATTGTTGAT
CTGGTCAATC GCCAACCAGT CGCCGCCGCA AATTTGAGCA TTTTGGCGGG TGCAAATCAA
CTTGCCACAG GCCAGAGCGA TTCGCAAGGG CTTTGGCAAA GCTCAATTCA GGGCAGTAGC
GGGCGTTTGG TCTTGCTTGG CGGCGAGCCG CAAGCGCCGA TTGTGGCTGA AACGATTATT
CGACCACAAC GCCAAGCACC AAGTTTGAGT AGCCAATTGC TGCTTGACCG CCAGAGCTAC
CTAGCCAACC AACAACTGAC GATTATCGGC AGCAGCGAGT TTGCTGCGGA TGGGATGACT
ACGCCAACCC TAACCTTGGC AGTGCTTGAT CCTAGTGGCC AAGCGATTGC GCCCGAACAA
ACCTTGGTGC TAAGCCAAAG TCTCTGGATA ACCAAGGTGC AACTGGCCGC CGATGTTCAA
CCTGGTTTGT ATGAAGTGCG GCTGCGCTAT GCCAACCAAG TGATTGCGAG CCAAAATTTT
GTGGTCAATC AGCCCGATTT GGCCTATCGC GTAGTTTTAC CCGAATTGCA AGCGACCAAT
GCGATCGTGT CAGCTGAAAT CATCAGCGAT TTACCGAGCC AACATGGCCT GTGGCGCTTG
CTTGATCAGC GTAATCGCCA AGTGAATGCT GGCAGTTGGC ATAGCAACGC CGCTGGAATT
GCCCAGTTCG ATTATCAACA GAGCCAAACG TTGTTGGGCG ATTATGTGCT CGAAATTCAG
CTTGGTGAGC AAGTTAGCTA TCATCCCAGC CAAATTCACT ATCCTGCGCA ATTAGCTGCC
AAGGCTGAGC AACAGTTGCT CAATCCCAAT CAACCAGCCA CGATCAACTT CCAGCTGCGC
GATCAACAGG GTATGGCTTT GAGCAATCGT GCTTTGCAAA TCGAAGTTCA AGGCGCAATT
AGCAGCACCC AAAGTTTGCG TACCAATGCT GCTGGCGAAG TTGTTTGGCG CACCAATGGT
TTGCGGGCGG GCCTGTGGCT GATTCGTGCC AGTAGCCCAG ATGCGGCCAG CGTGGAAGTG
CCTTTGTGGG TGTTGGGGGC CAAGAGCGAT CAATTGATTG GTCGGAGCCA AAGTAGCTTG
GTCGAAACAA CTGAACTCGC GCTTGCCCCG CTGACCAATC TGGCGGCTAC TAATGTGTTG
GTTAGTTGGT TTGAAGGTAG CAGCATTCGC CAGCAGATTC AGCCTTGGGT TGCGGGCCAG
AGCATCGCGA TTACGCCCAG TCTGAGCAAT ACTCAGCAAT TAGAGGTAGC TTTGGCTTGG
CTTGATGCTG AGCAGTTAAT CAGTAGCAGC AGCCAAATTC CCTTCCAAGC AGTTGCGCCA
CTCGATTTGA CGATGGTTGT GAGCGGTTCG CAATTGGTGC TGGCAACCCA ACGCGCTGGT
CGTCCTGTTG CCAGCATGGT TGGCTTGAGT TTTCAGTCAA TCCAAACGTC AGCTAGCCAA
CAACGGCTTG TGCGCACAGC CAGCAATGGC CAAGCAGTGC TCGATTTACA AGGATTTGGC
GAGGGTTGGC GGATTCAGGC CTTAGCCAGT GATGGCTTGA ATAGCGCCGA AGCCACGACG
TTGTATGCAG CAAATCCCTT GATCGATGCC CAAATGCTGT TGCCCGAACG CTTGATTGTG
GGCGATCAAC TGAGCGTGAC CGCTCGCTTG AATTTGCTAG CGACTCGCGG TGGGATTCCT
AGCGTTAGCC TGAGCGTCGA TTCGGCTTAT CTTGCGGCCT TGGCTGAGCC AACCACGGGC
AATAGTGGCC GAATTTATAC CACAACTCAA GCCTTGGTTG CCGCCAACCT TGGTACGACC
ACGCTCACCG CAACCATTGA AGTTAATCAG CAAACCCAAA TTATCACCAA AGCCCTGACG
ATTCAGCCAC CAAGCCAAGT GCGTTTGCAA CAAAGCCGTT TGTTGACCGA AGCGACCGAT
ATTACTTGGC AAGTGCCGAC GGCCAACCAA GCAGCCTCAA TTAATATTGT GATTGCTAGT
TTAGCCCAAG CCAGCCAAGC CTTGCCTAGC CAATTGCCGG GCGATGATCC CCTGACGTTG
GCTGGGCGAA TTGCGCTATC CCGCTATGCA GCGCAGCCTG CGCAGGCCGA GATTGAATTA
TTGGCCAAGC AGCAACTTAC CAATGGCAGC TGGAGTTGGG ATGCGCAAGC GGGCGATTTG
CAATTGACTG CCTTGATTGT GCAACTGATT GCGACCGAGC AACGTTCGCC CCAACTGCGC
CAACTCGAAG ATCAAGCCGC TCGTTGGTTA GCCTTGGTTA GTCCACTTGA TCCCGATTTG
CGCAGCGAAA TGCTGTTGGC CTTGGCGCAC AGCGGCAAAA CAGTTGAAGC TCAAATCACC
AGCTTGCTCA AAAATCGCCA ATTAACCACT GCCCAACGCT TGCGCTTGAT CTATGCCTTG
GTGCTGCTCG ATTCGCCTCA AGCCGATCAA TATTTGCTTG AAGTGAGCCA GATGTTGGCT
CAACCCCTAA CGCCAAGTGC TTGGATGAGT CGCACCCGCC AAGCGGCGCT GATCGGCTTG
ATTTTTGAGC GCAGCCAACC AAGTTCAGCC TTACGTGCCC AAGTGCTCAA ACTGATCGCC
GAGGGCTGGA ATGGCACGAC ATGGGAAGAT TTGCCAAGTA GCGTAGCGGT ATTTCAGCTA
ACCCAACAAG ATTTTCCAAC CCGTGGGGAT TATCGTTTGG GCTTGAATGG CGCAGCCTTG
AGCGATTATG ATCAAGCTAG CAGCCTAATG TTGCCAATTA CCAACCAACT TAACGTGCAG
ATCGAGCCGA ATGGCCCAAT CTTGTTGGCG AGTCAAGCTA GCTGGCAAAA CCCTGCTACC
CAAGCCTACT TGCTCCATTC CAGCAGCGAA GATCAATTGG CTTATGGTGA GCAATGGTTG
TGGCAAGGCT ATTTGGTGCT TGAACAAGAT CTGTCATTGT TACAACTACG TTTGGCTGCG
CCGAGTGGTG TGGAGTGGCA GCTTGGTTCA GCTACAGGCT TGAATTGGCA TGATGGTCTG
TTCAGTGCCG CCAACGTTCG GGCTGGCATT TACCCCATTC AACTATTGGG CGTAGCTCGC
CACAAGGGCC AATTTGCTTG GCCAGCGCCC CAATTAACCA GTGGCGGCCA GATCGTTGAG
CTTAAACAAA CCGCGCCGCC AGTGCTAATT CGCTGA
 
Protein sequence
MRLLEKLARW WWRWLLTTIS LLGTALLGVI VASIAGWVPQ PSVLRLTPLA GSRELPPATA 
LEFNFNLPMD RSSVEDALVI TPPVRGRWHW EGRARARWQP EFGWTPGTTV TVQLRPTAQS
MLRQTLATTV TTQFTAAPAP LLVFRSPLPN AIIAPNTPIL LRFNRAMIDY ASQTERGLAE
LSIEPNAASQ VRWLDDRSVL VQAQWQIGQH YQLKLQDLND LLGIPVQTTE WQVQVSEPNL
IAPPTTQTLQ ADQALEWAFE GLLDQTTTQQ LIKRIQFTPA VSTTWQVQYQ PDPQPQTIIQ
ALPEAAWPAQ TLTTSLQLSQ PITQVWQIQP NLQLIGSVPG RGGSLAVDDP LRLLFNQTPE
RQQLVEQLFI EPEVANVAIN INNQQALISA AWQPSTVYTL TLAGSAPLTF RTQTQAQPLQ
IEGAAYSLLL PETTAELRLS GRENSRLTAN LYAVDRAVLA AALQQPQTPL NPQRYNLQPQ
QQWQIAAGPE RTISITPTQA ALLLQVQAPN AELVQHVLIW TPYRAQLLAT PEQAQAWIVD
LVNRQPVAAA NLSILAGANQ LATGQSDSQG LWQSSIQGSS GRLVLLGGEP QAPIVAETII
RPQRQAPSLS SQLLLDRQSY LANQQLTIIG SSEFAADGMT TPTLTLAVLD PSGQAIAPEQ
TLVLSQSLWI TKVQLAADVQ PGLYEVRLRY ANQVIASQNF VVNQPDLAYR VVLPELQATN
AIVSAEIISD LPSQHGLWRL LDQRNRQVNA GSWHSNAAGI AQFDYQQSQT LLGDYVLEIQ
LGEQVSYHPS QIHYPAQLAA KAEQQLLNPN QPATINFQLR DQQGMALSNR ALQIEVQGAI
SSTQSLRTNA AGEVVWRTNG LRAGLWLIRA SSPDAASVEV PLWVLGAKSD QLIGRSQSSL
VETTELALAP LTNLAATNVL VSWFEGSSIR QQIQPWVAGQ SIAITPSLSN TQQLEVALAW
LDAEQLISSS SQIPFQAVAP LDLTMVVSGS QLVLATQRAG RPVASMVGLS FQSIQTSASQ
QRLVRTASNG QAVLDLQGFG EGWRIQALAS DGLNSAEATT LYAANPLIDA QMLLPERLIV
GDQLSVTARL NLLATRGGIP SVSLSVDSAY LAALAEPTTG NSGRIYTTTQ ALVAANLGTT
TLTATIEVNQ QTQIITKALT IQPPSQVRLQ QSRLLTEATD ITWQVPTANQ AASINIVIAS
LAQASQALPS QLPGDDPLTL AGRIALSRYA AQPAQAEIEL LAKQQLTNGS WSWDAQAGDL
QLTALIVQLI ATEQRSPQLR QLEDQAARWL ALVSPLDPDL RSEMLLALAH SGKTVEAQIT
SLLKNRQLTT AQRLRLIYAL VLLDSPQADQ YLLEVSQMLA QPLTPSAWMS RTRQAALIGL
IFERSQPSSA LRAQVLKLIA EGWNGTTWED LPSSVAVFQL TQQDFPTRGD YRLGLNGAAL
SDYDQASSLM LPITNQLNVQ IEPNGPILLA SQASWQNPAT QAYLLHSSSE DQLAYGEQWL
WQGYLVLEQD LSLLQLRLAA PSGVEWQLGS ATGLNWHDGL FSAANVRAGI YPIQLLGVAR
HKGQFAWPAP QLTSGGQIVE LKQTAPPVLI R