Gene Haur_1855 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_1855 
Symbol 
ID5733744 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp2156362 
End bp2158857 
Gene Length2496 bp 
Protein Length831 aa 
Translation table11 
GC content50% 
IMG OID641278999 
ProductXRE family transcriptional regulator 
Protein accessionYP_001544626 
Protein GI159898379 
COG category[R] General function prediction only 
COG ID[COG3903] Predicted ATPase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGCAGC GCCAAGCGTT TGCTGCATGG CTCCGTCATG TACGCCATGA ATTACAATAT 
AGTCAAGACC AATTTGCCGA ACAACTTAAC TATGCCACGG TGACCTACCG CAAGGTTGAG
CGTGGTTTAG CACCATCGGC AGCATTTTTA GAGCGGTTGG CCATGGTGCT TGAGTTACCA
GCCAGCGATA TGCGGATCTT GCACGACTTC GCCAACTCCG ACACCGCGCT GCAAGCACTC
TCGTTGCCAG ATCATCTGGA GCATCTTCAG CCGCAGGTAG CCCAGGCCAA GCCAGCAATT
GCCGCTACGC CGCATACCTT GCCAATGTTG CCCTACCCGT TAATTGGCCG TGAACGCGAG
GTTGAAACGC TTACCAAGTT GTTACAGCAC CCGCAACATC GCTTGATTAC CGTGATCGGC
CCGCCTGGGG TTGGCAAAAC GCGGGTCGCT CAGGCCGTTG GTTGGGCCAG CCTCGGCCAT
TTTTGCGATG GAATTTGGTA TGTTGAAGGC ATTCAATGCA CCACAATTGC CGATTTTTGG
GTCGATATTG CCAATATGCT CGGGCGTTCG GCCAATAGCT CCATGACCTT AATCGAGCAA
ATTAGTGCAT TAATCGGCCA AAAAAATAGC TTGCTGATTT TAGATAATTG CGAGCATTTG
AGCGAAATTA ATCTTGGTTT AGCCCAGTTA TTGGCGCAAT GCAGCGGCTT GAAAATCTTG
GTAACCTCAC GGACAAGCCT AAAACTCCGG ATCGAACATC TTTTTTGGCT GCATCCTTTT
CCTACGCCCG ACCCGCAAAG CAGCAATCTG AGTGCGATTT GGCAGAACCC AGCGGTACAA
CTTTTTTGCC AGCGTGCCCA AGCTAGTAAC CATGAGTGGC AAATCAACGA TAGCCAAGCA
GCAACCATCG CCCAAATTTG CCAACATCTT GATGGCTTGC CCTTGGTGAT CGAATTGGCG
GCAGTACGCA CCCAATTTGT TACGCCAACC ACGCTACTGG CACGGTTAAG CAATCGATTA
GGCATCTTAA CCAACACCAT GCGCGATGCT CCGGCGCATC AAAGCACCCT GCGACGCACA
CTTGAATGGA GCTATCAACT GCTTGATAGC AACGAACAAC AGATCTTTGC GCGGCTCAGT
GTTTTTGCAA CCGATAGCGA TTTCGAGGCG ATTGTGGCGG TTTGCGCCGA TTTGGCACCA
TCGAACGATG ATATTTTTGA TTGTATGGCC AGCCTCGTTG CCAAAAGTCT GGTTATTCAT
CGACCCGACC CTCAAGGCAA TTCACGCTTT GGCATGTTGG CGACAATTCG TGAGTTTGCG
GCGAGTTTGT TAGCTGAGCA ACAACAAACG CATCATTATA CCCAACGCTA TATTAATTAT
TACATTGAAC TCGCCGAAAA AATTGATCGC GAACTGCGCG GCAAAGAGCA AATTCAGCTG
CTCGAGCAGC TTGAGTCGGA GTTTCATCAT TGGCAAGCGG TTTTACGTTT ATGCCTCAAT
CAACAACAGT ATCATGGCTT TTTACGGCTG TTTGCAGCAC TGAGCCAATT TTGGTATGGT
CATGGCCATT TTATGGAGGC TTGGCAATGG CTCAGCGCAG TCGATCAAGC GTTAAATCAC
GTCGATTCGC CCATTATTCA AGCGCGGGCG GCTCTAGGCG CAGGCATTGT AACCAATATT
CATCATTGCC TTGATCTCCC GTTGGGCTAT CTCGAACGCG CCCTCGATTT ATGTCAACAA
TTGAATGATC AACAGGGGAT TGCCACATGT TACTTGTTGC TTGGGTTGAT TATGATGCGC
AAACATCAAT ATGTGCAAGC AACCCGCTGG CTCAACCAAA GCCTTAATTA TTTTGAGTCG
AGCGTTGAGT ATTGGCTTTT GAGCATTAAT CATTTGCTGC TGGCTCAATT AAATATCTAT
CTCAACGATC TTGATCAAGC TAGCCGCTAC CTTGATTTGG TGGGGCATTC GCCACAACTC
CGGCTTGATC CCTTCCGATC ATCGTGGTAT CAATCATTAC AAGGCCATGT TGCCTTCTAC
AAACGCTGCT ACACCGAAGC ACTGACGTGG CATCAACAGA GTTTGGTCGA GCGCCAGCAG
CTCGGCATCA AAGGTGATAT TGCGGTTTCT TGGCTGCGGA TTGCTCAAAC CGAGCGAGCT
TTAGGCCACT ATCAACCAAC CCGCAACGCC CTCGAACAAA GCCTCAAGCT CTGGCAAATG
CACGACAATC AAGAAAACGT CTTGCATTGT TTAGAAGAAT TTGCCGCGCT ACTGGCCTAT
GATCAACAGC ATCAGACCGC CACCTATCTG CTGAGCTATG CATGGTTTCA ACGTGAACAA
CGTCAATTGC CCCATCCACC AATCGATCAG GCGCGATCGC AGCAATTTGG CATGTGGCTG
CAAAACCAAC AACCAAGCAA TGTCTGGCGC GAAGCTTGGA GTTACGGCCA AACACTCAAA
CTTGATCAAG TGATTGGCTT TGTGCTCGCG GGGTAG
 
Protein sequence
MKQRQAFAAW LRHVRHELQY SQDQFAEQLN YATVTYRKVE RGLAPSAAFL ERLAMVLELP 
ASDMRILHDF ANSDTALQAL SLPDHLEHLQ PQVAQAKPAI AATPHTLPML PYPLIGRERE
VETLTKLLQH PQHRLITVIG PPGVGKTRVA QAVGWASLGH FCDGIWYVEG IQCTTIADFW
VDIANMLGRS ANSSMTLIEQ ISALIGQKNS LLILDNCEHL SEINLGLAQL LAQCSGLKIL
VTSRTSLKLR IEHLFWLHPF PTPDPQSSNL SAIWQNPAVQ LFCQRAQASN HEWQINDSQA
ATIAQICQHL DGLPLVIELA AVRTQFVTPT TLLARLSNRL GILTNTMRDA PAHQSTLRRT
LEWSYQLLDS NEQQIFARLS VFATDSDFEA IVAVCADLAP SNDDIFDCMA SLVAKSLVIH
RPDPQGNSRF GMLATIREFA ASLLAEQQQT HHYTQRYINY YIELAEKIDR ELRGKEQIQL
LEQLESEFHH WQAVLRLCLN QQQYHGFLRL FAALSQFWYG HGHFMEAWQW LSAVDQALNH
VDSPIIQARA ALGAGIVTNI HHCLDLPLGY LERALDLCQQ LNDQQGIATC YLLLGLIMMR
KHQYVQATRW LNQSLNYFES SVEYWLLSIN HLLLAQLNIY LNDLDQASRY LDLVGHSPQL
RLDPFRSSWY QSLQGHVAFY KRCYTEALTW HQQSLVERQQ LGIKGDIAVS WLRIAQTERA
LGHYQPTRNA LEQSLKLWQM HDNQENVLHC LEEFAALLAY DQQHQTATYL LSYAWFQREQ
RQLPHPPIDQ ARSQQFGMWL QNQQPSNVWR EAWSYGQTLK LDQVIGFVLA G