Gene Haur_0022 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_0022 
Symbol 
ID5736856 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp26163 
End bp28562 
Gene Length2400 bp 
Protein Length799 aa 
Translation table11 
GC content51% 
IMG OID641277143 
Producthypothetical protein 
Protein accessionYP_001542802 
Protein GI159896555 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATTCGC AGCAACAGTT CACCGCTGCC TCGCATTTGA CTGCCGATAA TTTGCGTAGT 
ATTGCGCATT CGATCTCGAT CAGTCAGGTG TCGCGGTTGT CGTTGCCGGA GATCGATGCC
GTTGTTGACC AAATCTCGCG GGTCGTGCCT GCTGGTAATG TCCCCGGGGT GATTTTAAGT
GGCTTGGCGA AGTTGACGGG CCGTCGTCCA GCAGGCAATG TCATTAAGCG CGATGTAAAT
TTGCTGTTTC GTGGGGTTGA GCAAGCCCTC GATAAAGCGG TGTTTAGCAC GTTCTTTGCT
GGGCCGGCTG CGGTTATTTG GGGTTACCAA AAACTGCTCG AATTGGCTGG CAAAGATCCG
CAAGATGCGT TTCCGGAAGG CACATGGCAG TTTTATGTGG GCTATGCCTT GCGCGAAGAT
ACTGCCCGCC ATGCCAACGA AACGATTGGC TTCGATGAAA CCCTGAACGA TCATAAAATT
AATTTGCCAC CAATCGACCG CATGACCGCT TGGGTTATGG CGGCGATTCA TATTCTGCAT
AGTTACCCCG ATTTGCTCGA AAACGAATGG CGTGAGCGCG TTTCGTTGGC CTTGCTGCGC
GATCTGACCA AGGTTAATCC TGAAACTCGT CAGTTTGCCG ATTTATATAA TCAATGGGAG
CGCCAACGCC CTTATGGCCG TGGCCCCGAT GTTCAATCGC AAGAAAATTA TGCGCTGTAT
CGCAAACGCA AATTCGATGA ATTTATGGCC GAATCTACCC GCGATTTGCC CAAAGAAATT
CGCGAACGTT GGGGCAAACA GTTTCAGCGT GCTCGCGAAA TCGCCTTGCC AGCCTATCAA
CGCCAAATGG CCCTGGCAGC CTACCTTGAT CCCACACCCT ACAACGAAAA TATGGTAGCT
TTGCCACGCC AAAGTTGGCA TATTGGCTTA ATTTGGCGCG GCCATTATTA TTTGATTCCT
GCTTGTGCGC CCAATAGCAC TCGACCCAAT GATGTAAGCA GTGTGCGCAG CCAAATTGCT
GCCTTGTTAG CCAGCCCAGC CAACCATGCC CCAACTTCAT TAATTCCTTT GGCAACCACC
AAACGCACAA TCTTGCCCAG CATTTTGGGT AAGTTGCGGC CTGAAACCAG CCAACAACTT
GAGGCCTTGC GTTGTGCACC AATTTGGTTT AATGCTGATG GCCGACCGCG CCATTTGCCC
TTAGCCGAGT TGCGCCTGAC CGAACGTGGA CTAGGCGACC ACGGCCTGAC CTTGATCGAT
ACTGGCTCAA GTATGGTCTT TGATCAATCG CATATTTTCT TCGATGGCGC TTGGGGTTCA
GCCGTCGCCG AAATTATGAC CCTCGAAGCC TTGGCTTGGG CGGTCTATTT GCGTGGCCAA
CCAGCGCCAG TCGCGGGTAC GGTACGCCCC TATGCACCCA ATATTGAACT CAACGACGAA
GAAAAACAGA TTCTCGCTGA TAGCCCCAAG ATTGTGGCTG AAGCCAGCGC CGAATCAATC
GGGGTTGATC TGAAGAAGAT TTTGGAATTG CGCAAGTTGT TCAAGCAGCG TAACGACCAA
ATTCGAATTA CAGTTAACGA TATTTTGGTG CTCTATCGGG CAATTCATGC GGTTTCCTAC
AAGCCTAATC CAGAGTTGCA AGCGTCATTG CAAGAAGCCT CCAACGATGC CAATCTCAAA
GCAGCGGTCG AAGCAACGAT CACGGCGTTC GAGGAATCGC TGGCTAATCC GGCAATTCTC
ATTCCGGTTG ATGGCAGCAT TCCCAATCCC AGCGATCGAC TGCACCCCAT GACCTTCGAG
GTTCCGCTGG AAGAGCTGGA AATTAGCAAA TTGCATGAAC GCGCATTAAG TTTGCTCGAT
CAAGCGCGGC AAGAGTGGCG AGCGGAAGTC TGGGATACCT TTGAGGCAAC CCAAAAGCAT
TATTTGGCGA CGATTGCTGG CTTTGGCGAG GTTTCAGCCC GCGCCAAAGA TATTGCCCAA
TCGGGCGAAA GCACTTCGTC AGGAGCCTTG CGCTTGTTGG CCCACGTACC GATGGCTTTG
CAACGCTTGC TCGATGCGAT TCCTGGTAAG TTCGATGTGC TCAACGATTT GATCAAAGGC
CGCGAGGTGC TTTCAAATGT GGGCGCGGTG GCCGATACCA GCTCATTAAC CCGTTTTATC
ACTGCCAAAG ACGATAACGA GAAAAAAACC TTGGCTTGGG GTGTTATCAC CGATGCCAAT
GGGGTAATGC ACGTTTCGCT ACGCGACTTT CGCCCACATG TTGGCTTGTT TGTGGCTGCT
GGACGACGCG ATTTGGCCCG CCGGATCGCC AACGATTATC TTGAAAGCTA TGTTAATGGT
CTCAATCGCT TTATCAGTGA ATTGACCAAA ATTACCCAAG GCCGCCATAG TCGCCAATAA
 
Protein sequence
MDSQQQFTAA SHLTADNLRS IAHSISISQV SRLSLPEIDA VVDQISRVVP AGNVPGVILS 
GLAKLTGRRP AGNVIKRDVN LLFRGVEQAL DKAVFSTFFA GPAAVIWGYQ KLLELAGKDP
QDAFPEGTWQ FYVGYALRED TARHANETIG FDETLNDHKI NLPPIDRMTA WVMAAIHILH
SYPDLLENEW RERVSLALLR DLTKVNPETR QFADLYNQWE RQRPYGRGPD VQSQENYALY
RKRKFDEFMA ESTRDLPKEI RERWGKQFQR AREIALPAYQ RQMALAAYLD PTPYNENMVA
LPRQSWHIGL IWRGHYYLIP ACAPNSTRPN DVSSVRSQIA ALLASPANHA PTSLIPLATT
KRTILPSILG KLRPETSQQL EALRCAPIWF NADGRPRHLP LAELRLTERG LGDHGLTLID
TGSSMVFDQS HIFFDGAWGS AVAEIMTLEA LAWAVYLRGQ PAPVAGTVRP YAPNIELNDE
EKQILADSPK IVAEASAESI GVDLKKILEL RKLFKQRNDQ IRITVNDILV LYRAIHAVSY
KPNPELQASL QEASNDANLK AAVEATITAF EESLANPAIL IPVDGSIPNP SDRLHPMTFE
VPLEELEISK LHERALSLLD QARQEWRAEV WDTFEATQKH YLATIAGFGE VSARAKDIAQ
SGESTSSGAL RLLAHVPMAL QRLLDAIPGK FDVLNDLIKG REVLSNVGAV ADTSSLTRFI
TAKDDNEKKT LAWGVITDAN GVMHVSLRDF RPHVGLFVAA GRRDLARRIA NDYLESYVNG
LNRFISELTK ITQGRHSRQ