Gene Haur_3236 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3236 
Symbol 
ID5735104 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp4096464 
End bp4099337 
Gene Length2874 bp 
Protein Length957 aa 
Translation table11 
GC content51% 
IMG OID641280382 
Producthypothetical protein 
Protein accessionYP_001546001 
Protein GI159899754 
COG category[S] Function unknown 
COG ID[COG1262] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCACGATG TTGAGCGAAT CCGTCAGAAA ATTATTAATT TACAAGCCCG CCGTGCCCAA 
GAAAGCGACC AAGATTTGAT CGAGGCGCTT GATCAGATGA TTGGGCAACA TCAACACCAA
CTAAGTTTGT TGGCTCAACA AGCTGCCGCT GTGCTCGAAC CACCAGCCCC CGCCGCTACC
CAAGGCCAAG TTTATGGTTC CAGTGTGATG ACCAACCTTG GTCATGTTTG GATCAACCAA
GTTTTTAATG CGCCCTTGAG CACTGGCGAA CGCACCGATC AATTGATTTT GCACTATCTA
GCTCATGTTC AGGGCAATGC TGATCGCTTA GTGCTGGGCA CTGATGTCCA AGATAGCGAT
CAAGAGCGTA TGTCGTTGCA GCGGGTGTTT ACCATGCTGC AAGCCGATTT TTATGGGCCG
ATTGCAGGCG ATGCCTTGTT GCAATCGCTG CTGCTGGGTA GCTCACAGCC ACGCCCGCAA
TTGCAAGATT CGATTCTCAA GGCGCTCAAC CAAATTCAAC ATCAGCGTTC GGTGATTTTG
GGCGTGCCTG GTGGCGGCAA AACGACGGTG GTGTCGTATC TGGCTTCGGC TCAAGCGGCG
GCATTGCTCG ACCCAAGCAA AGCTGAATTT TTGCATAGCC AAGGCTGGAT GCACACCAAA
CTTGTGCCAG TGCGGGTGCG ACTGAAACAT GTGCAACCAC CAGCCGACCC TGAGCAACAA
ACTGCCGACG CTTTTTGGGA AGTCGTGACC GAATTGCAAT TGGCTGGACG GTTTGGCCAA
CGTGCCATCA ATAGCGATGT TGGCTTTCAG CGGATCGAAG TTGAGCGCCA ACAACGGCTG
CAAGATGCAA TTGAAGAATT ATTGACCAAT GGCAATGGCT TGTTGCTGCT CGATGGACTT
GATGAAGTGC AGCCCGAACA TCTGGCTGCC GTCAAACGCT GTATCGAGCA TGCTCAACGC
ATGTTTCACA AAAGCCGAAT TATTGTCACA TGTCGGGTGT TTGATTATGA GCATCCGCTG
CCACCGCCGT TTCCCTCGCG CCAGCTCTAC GATTGGCCGA CAATTCAATT GATGCCGTTT
GACCTAACCG CGCAGCATGT GTATATCACC AATTACTATA GCGAATTGGG ACGCTTGCAT
GCCTCACACG CTGATATTGT CAGCATTCGT AAAAAGCATG CCAAACTGCA TGCCGAGTTG
ACGAAATCGG GGATGTTGCA TGAACTGACA CGCACGCCGT TACTCTTGGC GCTGACCGTG
CACGTGAATA TGGTGCACAC TGATTTGCCT GAAAGCGAAG GCAAATTGCT ACACATTTGT
ATCGACGAAT TGCTTAAACG CCGTGCGCCT GAGGCCATCG CCGTTTCACT TGAAGATTTG
TATGACTTGG TTGCATTGTT AGGCTATTAT GCCCATAGTC AAGAAGAAAT TCAAGGCAAA
GCGTTGCTTA GTTTGCAGCA AATCAATATG GTGGTGCAGC GCTATTATAG CGATCGCTAT
CCGCATCCCA ATCAAATTCA TATGCTTGCG CAGGCGGTTG GCAACGCAAC CTTGCGCTTG
ATCAATAGCA ATGGCTTATT GCAAGAGGCT AGCAATAATC AGCAAGATAA TCCACATTAT
GATTTTGCCC ATCGCTTGTT TCAGCAATTT TTGGCAGGCA TGTATTTGCT CAATCAAGAG
CGCCACGACG AATGTATTGA ACGGGCCGGC AGCGAGCATT GGCGGGTTTG CTTGCACTAC
ATGGCCAATT TTGCCCCTTA CATTCAAAAG CAAAGCTTCA TTTTTGGGGT GATTCAAGAT
TTATTGGGTG GTATGCCCGA TGAACAGGTG CAAGGCTCGC ACCTGCTTTT GGCGATTGGC
AAAGCCAAAG TTGCTAGCGC TGGGCGGCGC AATTTATGGC AGCAAGCAGT CAATCATCTC
AAAAATATTG GCGGCTTGAG TGGCAGTAGC CGAGGCGTGC GCCGTTGGGC ACCACCACGC
ATCGCCTTCC CCACTCGTTT GCGGGCGGCC TTAGTGCTGG GCGCAATTGG CGATACCCGC
TTTATTGGCG AAGATGGCGC ACTCATTGCG CTGACTGAGC GGGTTGTTTG CATTCATGCT
GGCAGCGTTG AGCTGGAAGA TCCTCATTCA CGCCCCCAAA CCTATAGCGT TGAGCGCTTT
TGGATGAGCC GCTATCTTCT GACCAACTTT GAGTATAGCC AATTTATTGC TGCCCGTGGG
TATTTGGATG ATCAGTGGTG GCAGAACGAC GATGCGCGGC ATTGGCGACG CGGTGATCCA
TCGTGGCTAC ATGGTTTGCC ACCGTGGGCG GCTCCGCGCC GCTTGCCCGA TTTATGGCAT
AACGAGCGTT TCAATCACCC AACTCAGCCA ATTGTTGGAG TGAATTGGTA CGAAGCGAAT
GCCTTTTGTG CTTGGCTCAC GGTGCAACTG CGGCCTCAAC TTGATGCGCT TGGGCCAAAT
TTGGTGGTAC GTTTGCCGAG TGAGGCTGAA TGGCAACTCG CCGCCAGCCA ACATAGTGAG
CGCGAATACC CGTGGGGCGA TGAATGGCGC AACGATCATG CCAACACCAG TGAAAGCGAG
CTTGAGCAAC CAACTCCGGT TGGTATGTTT CCCTATGGCA CCTGGAGCGA TGGCCCAATG
GATTTAGCCG GCAATGTTTG CGAATGGACT AATAGCATCA ATCGCGACCC TGAACTTGAA
CCACAAACCA CGCGTAATCG CCGTTTGCAA CATCGCGATT TGATGGTCGC TGTGCGTGGT
GGTTCGTGGT TTCATAATCG CTATTTTGCG CGTTGTCGTT CGCGCTTACA GCATCGCCCA
TTTAGCCATG GCCCAAATAT TGGTGTGCGC TTGATTATCG GCCACGCCGA ATAA
 
Protein sequence
MHDVERIRQK IINLQARRAQ ESDQDLIEAL DQMIGQHQHQ LSLLAQQAAA VLEPPAPAAT 
QGQVYGSSVM TNLGHVWINQ VFNAPLSTGE RTDQLILHYL AHVQGNADRL VLGTDVQDSD
QERMSLQRVF TMLQADFYGP IAGDALLQSL LLGSSQPRPQ LQDSILKALN QIQHQRSVIL
GVPGGGKTTV VSYLASAQAA ALLDPSKAEF LHSQGWMHTK LVPVRVRLKH VQPPADPEQQ
TADAFWEVVT ELQLAGRFGQ RAINSDVGFQ RIEVERQQRL QDAIEELLTN GNGLLLLDGL
DEVQPEHLAA VKRCIEHAQR MFHKSRIIVT CRVFDYEHPL PPPFPSRQLY DWPTIQLMPF
DLTAQHVYIT NYYSELGRLH ASHADIVSIR KKHAKLHAEL TKSGMLHELT RTPLLLALTV
HVNMVHTDLP ESEGKLLHIC IDELLKRRAP EAIAVSLEDL YDLVALLGYY AHSQEEIQGK
ALLSLQQINM VVQRYYSDRY PHPNQIHMLA QAVGNATLRL INSNGLLQEA SNNQQDNPHY
DFAHRLFQQF LAGMYLLNQE RHDECIERAG SEHWRVCLHY MANFAPYIQK QSFIFGVIQD
LLGGMPDEQV QGSHLLLAIG KAKVASAGRR NLWQQAVNHL KNIGGLSGSS RGVRRWAPPR
IAFPTRLRAA LVLGAIGDTR FIGEDGALIA LTERVVCIHA GSVELEDPHS RPQTYSVERF
WMSRYLLTNF EYSQFIAARG YLDDQWWQND DARHWRRGDP SWLHGLPPWA APRRLPDLWH
NERFNHPTQP IVGVNWYEAN AFCAWLTVQL RPQLDALGPN LVVRLPSEAE WQLAASQHSE
REYPWGDEWR NDHANTSESE LEQPTPVGMF PYGTWSDGPM DLAGNVCEWT NSINRDPELE
PQTTRNRRLQ HRDLMVAVRG GSWFHNRYFA RCRSRLQHRP FSHGPNIGVR LIIGHAE