Gene Haur_2599 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_2599 
Symbol 
ID5734477 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp3334104 
End bp3337289 
Gene Length3186 bp 
Protein Length1061 aa 
Translation table11 
GC content49% 
IMG OID641279739 
Producthypothetical protein 
Protein accessionYP_001545365 
Protein GI159899118 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0018117 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATAAAC CATCGAAAAT AATTGTCCTG ATCACTCTAA TGGCATTGTT GGTTGGAGCG 
TTACAAAGGC CTGTTGCTCC AGTTATTGCC AGCGAGCCGC AAATTCCAAC TAGCCCAGGC
ACATGGAGCC AAGCCTTTGC CGACCCACTG AAACTTGACC CCGGCTATAT GCCGCTGCGC
CCAATTGAAT GGAATGGCAC GTTATATGCA GGAATCGTTG GCGTAGACGG CTTTGAGCCG
GGGGTGGGCT ATTGGAGCGG TCAGCAATGG CTGAAACTTG ATGGGTTATC AGGTGAGGTC
GATTCGGTGG TCGTGCATCA AAACCGTTTG TTTGCAGCCG GACGCTTGAC GCTTGGGGGC
AACCATATCA GTATTGCTTT TTGGGATGGC AATCTGTGGA CGGCGATGCC CACCCAATTT
AGCCCCAATA TTTTTATCTT GGCCAGCCAT AACGATCAAC TCTACGTTGG TGGATATTCA
GAGCAGATTG CTGACCAAGC CTCAGGTTTA CTCTTGCGTT GGGATGACAC GCAATGGCAT
CCCGTCGCTG AAGGTATTTT CGGCGCAGTT ATGAGTATCC TCTCGCGGCC CGATGGTCTG
TATCTCGGCG GGGTCTTCCA ACTCAATGGC CAAAATACCG GCTTGATCCA TTGGAATGGA
GCACAGTGGC AGAGCGTTGG CGGGGGCGTT CAGGGGATGG TGATGGATGT TGAATGGGCC
AATGATCAAC TCTACATTAG TGGCAAATTC ACCTCGACGC TTGAACCAAC TATGCAGAAT
ATCGCTGCCT GGAATGGCAC CAGTTGGAAT ACCTTTGGCA CCGGGATTGT TAGCCCAACC
CATAATTTGG CGCTGCTTGA TGGCGACCTG TATGCCCTCA GCCAAACCGA TAGACCTTAT
CCTTATCAAA CGATTTATCA GCTCCAACGC TGGGATGCGA CTCACTGGAC GACACTCTCC
AATTTAAGTG AAACAAGTAG CGTTCTTAAT TGGTCGCGCT ATCCTGATGT TGTGCTCGTC
AATTATCAGC AAGAATTATT GGCATTTGGG CCAATAGGCT TTGTGGATCG CAATGTGCAA
ACACTGAGAT GGGGTGATTC GGCCTTGCGC TGGAAGGGTA ATTCTTGGGA AGCCATGACT
CCGAATGGAA TTTCTGCAAT GAAACTCGCA TTAGCCGTTG ATGGTGAGGA TGTCTACGCA
GCCTCTGGGC GGATGACTTG GGGCAATGGA CAAGCGAGTT TAGCTCATTT GTCGCCAAAT
AACCAATGGC AATTGCTGAT AGCCTATGAC TCCCAGCAGC CACAATATGC ACAAGCGCTC
CAAAAGTATC AGCAGAACTT TTTTAGTATT TACAATAGCA CTCTATATCA AGCGGTTAAC
AATGCTTGGA ATCAAGCCAG TCCTGCGACA GTGGAAAGTT TGGCTCAAGC CAATGATTTG
TTGTATGTTG CTGGCGATTT TGAGCAATTC AATGGGGTTA CAGCGCGTAA TCTGGTGACC
TGGAATGGCA CACAGTGGCA AGCCTTGAAT ACGCCTGCCT CATTTGATCG GGTCGTTATT
GTTGAAGCCC ATGGCGATGA TGTCTATATT AGCGATGGCT TTCAATTGGC CCACTGGAAT
GGCAGCCAAT GGACAACCCT CGCCACTAAT GTGGTCAATA TTGGTTCCAT TGAGCCAACC
GCCAATGGGG TCTATATCGC TGGCACATTT AGCAGTGTTG GTGGCATAGC CACACCAAAA
ATTGCCTATT GGAATGGCAC GGCTTGGTCG GGTTTAACAG GCGAGATCGA TGGTTCAATC
TACGATCTCG AAATGGGAGC CGATGGCTTG TACGTGGCTG GATGGTTCCG AGGCATTATC
AATGGTATTT ATAGCCCAGG CATTCTACGC TGGGATGGCA CTACATGGCA TGGGCTTGGC
GGTGGGGTGA AGTCCAGTGC AACACCAAAT CAACCAGGTG CTGTGACGCT GCTTGCAGCA
ACCCCAACCC GCATGCTGCT GTATGGGTCT TTTGATCGGG TGGGAAATAC CTACGAATCC
AAACAAATTG CAGCGTGGGA GTATGGCAAC GAACCGTTGA TTAAGGCCAA ATCGGATTAT
GGCCTTACCT ATCGTCCGCA GTCAGTTACG GTGAATGTGC TGGCGAATGA TTGGAGCGAT
CAGCCAAATC AATTGCAATT GGTGAGTGTG AGCAGCCCAA GCCATGGCAC GGCTGTGATT
AACGGTAACT CGGTTGTGTA TAGGCCGGAA GCACAATTTG AAGGCGTTGA AACCTTGACC
TATGTTGTGC GCGACCCAAT CAATGCTGTC ACCAGCACAG CGCAACTCCA GGTGCATGTC
TGGAATCACT TCCCAAGCAT TGCTGATCAG GAACAAGCGG TCTATCCATT TACTGAAACG
CTGCTTGACC CATTGGATGG CCTGATTGAT TTGAATGGCG ATAGCTTGAC GATCACCCAA
GCCAGTGCGG TCAGCGGCAC GGTGACGATT GTCAATAATC AATTGCGCTA CATGCCGCCG
AATCAACACC ATTTTACCGA TGTGGTGACG TATAGGGTAA GTGATGGTCA TGGGGGGCAA
CAAAGCGCCC GGATCAACAT CCATAGCATT GATACAATCG TGACTGCAAC CGCTGATTAT
GCAACAACCT ATCGTCCGTA TTCAGTCAGG GTTGATGTGA TTGCCAATGA CTGGACGATT
AATGGAGAAC CCTTAGCGGT GGTGGCAGTT GACGCAGCCA TTCATGGCAC AGCAACGATT
AGTGGCAACC AAGTACATTA TATTCCTGCG GAAACCTTTC AAGGTACTGA AACCTTAACC
TATACCGTGC GCAATCAAAC CCGTGGCATA ACGGCAACCG CAACCTTGAC GATTGAGGTA
CAAAATCATG TGCCCACTGT TGCTCCCATA ACGATTACCG TTCAGCCTAA TAGCATCACA
ACGCTGAATG TAATGGCAAA TGCGGTCGAT CTAAACGGCG ACCAATTAAC CATTACGCAA
GCAAGCACCA CAGCTGGCAC GGTAGCGGTG GTTAATAATC GATTGCGCTA TACCGCGCCA
AATTCTTATC CATTTGTTGC GACAATCAGC TATACCATCA ACGATGGTCA TGGTGGTTCG
CAGGTTGGGA CAATTGTAGT CAATAGCGTA AAGTATCACT TATTCTTGCC CTATACCATC
AAATAA
 
Protein sequence
MNKPSKIIVL ITLMALLVGA LQRPVAPVIA SEPQIPTSPG TWSQAFADPL KLDPGYMPLR 
PIEWNGTLYA GIVGVDGFEP GVGYWSGQQW LKLDGLSGEV DSVVVHQNRL FAAGRLTLGG
NHISIAFWDG NLWTAMPTQF SPNIFILASH NDQLYVGGYS EQIADQASGL LLRWDDTQWH
PVAEGIFGAV MSILSRPDGL YLGGVFQLNG QNTGLIHWNG AQWQSVGGGV QGMVMDVEWA
NDQLYISGKF TSTLEPTMQN IAAWNGTSWN TFGTGIVSPT HNLALLDGDL YALSQTDRPY
PYQTIYQLQR WDATHWTTLS NLSETSSVLN WSRYPDVVLV NYQQELLAFG PIGFVDRNVQ
TLRWGDSALR WKGNSWEAMT PNGISAMKLA LAVDGEDVYA ASGRMTWGNG QASLAHLSPN
NQWQLLIAYD SQQPQYAQAL QKYQQNFFSI YNSTLYQAVN NAWNQASPAT VESLAQANDL
LYVAGDFEQF NGVTARNLVT WNGTQWQALN TPASFDRVVI VEAHGDDVYI SDGFQLAHWN
GSQWTTLATN VVNIGSIEPT ANGVYIAGTF SSVGGIATPK IAYWNGTAWS GLTGEIDGSI
YDLEMGADGL YVAGWFRGII NGIYSPGILR WDGTTWHGLG GGVKSSATPN QPGAVTLLAA
TPTRMLLYGS FDRVGNTYES KQIAAWEYGN EPLIKAKSDY GLTYRPQSVT VNVLANDWSD
QPNQLQLVSV SSPSHGTAVI NGNSVVYRPE AQFEGVETLT YVVRDPINAV TSTAQLQVHV
WNHFPSIADQ EQAVYPFTET LLDPLDGLID LNGDSLTITQ ASAVSGTVTI VNNQLRYMPP
NQHHFTDVVT YRVSDGHGGQ QSARINIHSI DTIVTATADY ATTYRPYSVR VDVIANDWTI
NGEPLAVVAV DAAIHGTATI SGNQVHYIPA ETFQGTETLT YTVRNQTRGI TATATLTIEV
QNHVPTVAPI TITVQPNSIT TLNVMANAVD LNGDQLTITQ ASTTAGTVAV VNNRLRYTAP
NSYPFVATIS YTINDGHGGS QVGTIVVNSV KYHLFLPYTI K