Gene Haur_1041 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_1041 
Symbol 
ID5732945 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp1187599 
End bp1189644 
Gene Length2046 bp 
Protein Length681 aa 
Translation table11 
GC content52% 
IMG OID641278176 
Producthypothetical protein 
Protein accessionYP_001543817 
Protein GI159897570 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1331] Highly conserved protein containing a thioredoxin domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAAATC GTTTAATTCA TGAAACTAGC CCTTACTTGT TGCAACACGC CGAAAACCCT 
GTCGATTGGT ATGCTTGGGG TGAAGAGGCT TTGCAACGCG CCAAACAAGA TGATAAACCA
ATTTTATTAA GTGTTGGCTA TAGCGCATGC CATTGGTGTC ACGTTATGGC TCATGAATCG
TTTGAAGATC CAGCGACTGC TGCGGTTATG AACGAATTAT TTGTCAATAT CAAGGTTGAT
CGTGAGGAAC GACCTGATAT TGATTCACTG TATATGGCCG CTGTGCAAGC CATGACTCGC
CATGGCGGCT GGCCGATGAC GGTCTTTTTA ACGCCTGATG GCGCACCGTT TTATGGTGGG
ACCTACTTCC CCCCCGAGCC GCGCCACAAT ATGCCTTCGT TCCAACAGGT GCTACATGGC
GTGGCCGAAG CTTACCGCGA CCGTCGCGAA GAAGTGTTTC AGAGCGCCGA GCAGATGCGC
GAGCATTTAG AAGATATTTT GAGCTTCGAT CTTGAGCAGG TGAAGCTGAG CAAAAGCCAA
TTGAATGTGG CTGCTCAACG CCAAATGAGC CAATTCGATT CGCGCTTTGG TGGCTATGGC
GGTGCGCCGA AATTTCCGCA AGCCTTGATT TTTGGCATGG TTTTGCGTAC ATGGCTGCGC
AGCGAGGATC AAGATGCGCT TAATCAAGTG ACCCAAACCT TGCAAGCCAT GGCCAACGGT
GGCATGTACG ATCAGCTTGG CGGTGGCTTT GCACGATATT CGGTCGATGC TCAGTGGCTC
GTGCCGCACT TCGAGAAAAT GCTCTACGAT AATGCTTTGC TCAGCCAGCT CTATCTCGAA
ACCTACCAAG CCACCCACGA TCCGTTTTAT CGCCGAATTG CTGAGGAAAG CATCAACTAC
ATTTTGCGCG ATATGACGAG TCCCGATGGC GGTTTTTATG CTGCCGAAGA TGCTGATAGC
GAAGGCGAAG AGGGCAAGTT TTATGTTTGG AGCTTAGCTG AAATTCAGCA ATTGCTCAGC
CCTGAGGATG CGGCCCTTGC CCAGTTGTAT TGGAATATTC AGCCCGAAGG CAATTTTGAG
GGCCATGCGA TTTTGTATGT GCCCCAAGAT CCCAGTGTGG TTGCCAAAGA GTTGAGCATT
AGCGAGGCAG ATTTGGCCCA GCGGATTGCC GTAATTCGTG CTACGCTCTT GGCCCAGCGT
AATACCCGCA TTCGCCCAGG CCGCGATGAA AAGATTTTGG CCTCGTGGAA TGGCATGATG
CTGCGCAGTT TGGCCTTTGC TGCCAATGTG CTCGATAACG CCGATTATCG CGCTGCGGCG
ATTCGCAACG CTGAATTTAT TACCAGCAAG CTGTATCAAA ACGGCCAACT GTATCGCTCC
TATAAAGATG GTCAAGCCAA ATTCAAGGGT TACCTCGAAG ATTATGCCTG TGTTGCCGAT
GGAATGCTGG CCTTGTACGA GGCAACGTTT GATCTGCGCT GGTTGCAAGT GGCGATTGAA
TTGGCCGAAA GCATGACTGA GCGCTTCTGG GATGCGCAAC AACGCAGCTT TTTCGATACG
GCCAGCGATC ATGAACAGTT GATCACACGG CCCCGCGACC TTTACGACAA TGCTACGCCT
GCCGGTAATT CGGTGGCGGT TGATGTGTTG CTGCGTTTGG CAACCCTGCT TGATCGCTAC
GAATATCGCC AATATGCTGA AACGGTGTTG GCGAATTTGA GCGGTGCGTT GCTCCAACTG
CCTGGGGCAT TTGGGCGCTT GCTGGCTGCC GCCGATTTTG CGCTTGCTGA GCCACGCGAA
GTTGCCTTAA TTGGCGATCC AGCTGATCCT GCGTTCAAAG CGTTGTTGCA AGCGACCTAT
CGCAACTACC AGCCCAACAA AGTCGTGGCT GCTTGCAAGC CCGATGATCA CGCGGCTCAG
CAGCTAATTC CATTGTTGGC TGAACGACCG TTGCTCAACC AACAAGCCAC GGCGTATGTG
TGTGTGCGGC GGGCGTGCAA GTTGCCAACC AACGATCCAA ATGAATTAAT CAAACAATTA
GGCTAA
 
Protein sequence
MANRLIHETS PYLLQHAENP VDWYAWGEEA LQRAKQDDKP ILLSVGYSAC HWCHVMAHES 
FEDPATAAVM NELFVNIKVD REERPDIDSL YMAAVQAMTR HGGWPMTVFL TPDGAPFYGG
TYFPPEPRHN MPSFQQVLHG VAEAYRDRRE EVFQSAEQMR EHLEDILSFD LEQVKLSKSQ
LNVAAQRQMS QFDSRFGGYG GAPKFPQALI FGMVLRTWLR SEDQDALNQV TQTLQAMANG
GMYDQLGGGF ARYSVDAQWL VPHFEKMLYD NALLSQLYLE TYQATHDPFY RRIAEESINY
ILRDMTSPDG GFYAAEDADS EGEEGKFYVW SLAEIQQLLS PEDAALAQLY WNIQPEGNFE
GHAILYVPQD PSVVAKELSI SEADLAQRIA VIRATLLAQR NTRIRPGRDE KILASWNGMM
LRSLAFAANV LDNADYRAAA IRNAEFITSK LYQNGQLYRS YKDGQAKFKG YLEDYACVAD
GMLALYEATF DLRWLQVAIE LAESMTERFW DAQQRSFFDT ASDHEQLITR PRDLYDNATP
AGNSVAVDVL LRLATLLDRY EYRQYAETVL ANLSGALLQL PGAFGRLLAA ADFALAEPRE
VALIGDPADP AFKALLQATY RNYQPNKVVA ACKPDDHAAQ QLIPLLAERP LLNQQATAYV
CVRRACKLPT NDPNELIKQL G