Gene Haur_4398 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4398 
Symbol 
ID5736248 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp5619026 
End bp5622307 
Gene Length3282 bp 
Protein Length1093 aa 
Translation table11 
GC content44% 
IMG OID641281560 
ProductM protein-like MukB domain-containing protein 
Protein accessionYP_001547158 
Protein GI159900911 
COG category[S] Function unknown 
COG ID[COG4913] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATCAAGC TTAGTCGCAT TTTTCTGTAT CATTGGCATC GCTTCGATTG GCATGTGTTG 
GATGTGCAGG ATAGTTTGTA TTTGGCGGGC CACAACGGTT CAGGCAAATC GTCGATTCTT
GATGCCTTGC AATTGGTGCT GGTGGCCGAT TTGGGTCGCG TGCGTTTCAA CAGCGCGGCC
CAAGATCGCT CGCAACGCTC GTTGGATAGC TATGTGCGCG GCAAAATTGG CGAGCAACAT
TGGCTGCGGC CTGGCGACAC GATTGGTTAT GTGGTGCTGG AGTGGACTGA AGATCTGAAG
CATCAAGCCT TTGTTAGCGG GGTTTGCTTG GAAGCTCGTG CCGCGACCCA AAGCGTTGAA
AAAACCTTTT TTATCTTTGA TGGCCACTTA AATCCTGATG TATTTATTGA GCAAGGCAAA
CCACGCACCC GCCGCGAAAT CAAAGCGATG GCGCGTAATC GCTCAACTGC CGAAAGTTTC
GATCAAACTG GCGAATATCA AGCGGCGTTG CTCAATCGGT TGGGCGGCTT GAATCAACGC
TTTTTCGATT TATTTCTGCG TGCCTTGACC TTCCAGCCAA TTCGTAATAT TCGTGAGTTT
GTTGAGCAAT GGCTGCTGGA GCCAAATTCG TTGGATGTTC ACACCTTGCA AAATGTGGTT
GAACGCTTAC GCGATTTGGA GCGAAAAGCT AAAGATGTTG AAGAACAATT AAAAAGCCTC
GGTGCTATTA TCAAAACCCA AGAAGAAGCC AAGCGCTTGC GTGGCCTGCA TGGGAAATTT
GAGATTCTGG CGGCTCACTA TAGCGTGGCG GCAATTCAGC AACAAATTAA TCAACAGCAA
CAACGATTAA TCGATACTCG CCGCGATTAC GAATTGAATC AGACTGATTT GGAACAACAA
CTTGGCTTAC AGCAAAATGC CCAAGCTGCC TTAGTTGAAG CCCAAGTTCA ATTAAGCCAA
TCTGATGTTG TGCGGCGTAA AAACCAACTT GCCGCTGAAA TTCAGCAATT GCAGCAACAC
ATTCAAAAAA TTAACCAACG TTGGCGACAA CTACAAGCCA AATTAATCCA AATTGCGGCA
TTATTGAATA AATTACAGCC AATTGCTGAG CAAGATTTGG TTAATTTATT GCAGCAGCCT
GAGCACATTC CCCAATCACA GCAAATTGTG CCTTTATTAC AACAACAGAG CACGGCGGCT
GGCTCTGCTT TGAAATTGCA ACTCCAGGCG ATGACCAGAA CCGACGATAA AATCTCGCGC
TTATTTGAAC AAGAAAGCCG CCTAAAACAA GAAATTACCC AGCTTGAGCA ATCGAATCGC
CAAAATCATT ATCCCCAAAA TGTGGAAAAT GTGCGTAAGT ACTTAAAATC AGCATTAAAT
ATTGAACTCT TTTTGCTTTG TGAGCTGTTA GAAATTCCTG ATGAACATTG GCAAAATGCG
GTTGAGGCAA TGCTTGGTCA ACGCCGTTTC AATATTATTG TCGAACCTAA ATATTATAGC
CAAGCCCTTG ATTTGCTTGA TCAGTTACGC GAAAAAGAGC GGATCTATGA TGTTGGTTTG
GTTGATTTAC AACAAGCTAG TGCAGAAGCT CGCCCAGCAC GACCATTATC ATTGGCGACC
AAAGTCAAAG CCAAAACTGG CTTGATTGAT AAATATATTG CGGCAATTTT AGGTGATATT
ATTACTTGTC AGCATGTGCA AGATTTGCGT CAACATCGCC GTGCGATTAC CGCTGAAGTG
ATGGTTTATC AAGAATGGAC AGCGCGTTCG ATTGATCCAC GGAAATTTCA GCCTTGGTTT
ATCGGTGAAC GCGCCCAACG CTCGCAAATT GAAAGCCGCC GCCAGCAATT AACCGAAATC
CAAGCCGAAT TGGCGATATT ACGCCCAGAA CAACAACGCC AGCGCCAATA TTATGATCAA
TTGGAAAAAT TACAACAACT TTTATTAGGC TTAGATTCGT ATTTCGAGGA AGAACTTGAT
TCAAGCGAAT TACAAACGAC ACTTGCTGAA TTGCAACGCG AACACGATAG CATTGATACT
AGCGGCGTGG CGGCACTTGA GGCCGAAGTT AAACGTTTAC ACATAATTCA TGAAGAATAT
GCCTTGAATA TTCGCCGTTT AGAGCGGATT ATTGGCACAC TGAATAATCA AATTACTCAG
ATTGAACAAC AACTTCAAGA AAACCAACAT CAATTGTTGA ATGCTCAAAG CCTGTTTGAG
CATACGCAGC AACGCTATCC CGACCAAATT GACGATGCAC TCAACGATTT TCAGCGCGAG
AACAATGAGC AAGCGCAATT TAGCCGTTTG CAAGAAACCG CCGAGCAACA AGGTCGTGGC
TATAATACCC GTTTTAACAA TACCCAAGAA CAATTACGCG AACAAGTTAT ACTCTATAAT
CGCGATTTTC GAGCGCGTGA ACTAGCCAAT ATCGAGCAAA CGAGCTTCCA AGAGAAACGC
GCTGAGCTTG AGGCTACTGA TTTACCGCAT TTTGTACAGA AAATTGCTGA AGCCAAACAT
GAAACTGAAA CCGAGCTACG CGAACATGTT CTGCATAAAT TGCGTGAAAA TATTGGTCGC
GCCAAAGCAG AGCTTGATCG GATTAATGAT GCTTTGCAAG GCTTAGATTT CAATGGCCAG
CGTTATCATT TTCGCTATGA AATTGCCGAT TCATTGCGTG ATTTTTATCA ATTGATTGAG
CAATCAGCCA ATGTTACTAG CGAGATGATT CAGGATAGTG AATTTTATCA GCAGCACAAA
GCCACGTTTG ATCGTTTCTT CCAATTGTTA ATTCAGCCTG GCCTAACCGA CCAAGAAAAG
CTTGAGCAGG CTCAAATTAC CGATTATCGG CGCTACTTGA GCTATGATAT TGATGTGATT
GAGCGTGATG GTACTCGTTC ACGCTTGAGC AAAATTATGG GTCAAACATC TGGCGGCGAA
ACTCAAACTC CGTTTTATGT GACGATTGCG GCCTCGTTTG TGCAGCTTTA TAAAATTAAT
GAACGTAGCA AACGCTCAAC CATTCGGATT GTGGCCTTCG ATGAAGCCTT TTCCAAAATG
GATCAAGATC GGATTGGCTC GACGCTTGAT CTATTTCACC ATTTTGGTTT ACAAATTATC
ACGGCCACGC CGATTGAGCG TTGCGAATAT TTAGTGCCCA AAATGTGTAC AACCTTGGTT
TTGACTGGCT TGAAAGATCG CCAGCAGCGG GTTTTGGTCG AGCCATATCG TAATTATGCA
GCTCGTCTAG AGGCGCTGAA TGCTGAATCT GAACAAAACT AG
 
Protein sequence
MIKLSRIFLY HWHRFDWHVL DVQDSLYLAG HNGSGKSSIL DALQLVLVAD LGRVRFNSAA 
QDRSQRSLDS YVRGKIGEQH WLRPGDTIGY VVLEWTEDLK HQAFVSGVCL EARAATQSVE
KTFFIFDGHL NPDVFIEQGK PRTRREIKAM ARNRSTAESF DQTGEYQAAL LNRLGGLNQR
FFDLFLRALT FQPIRNIREF VEQWLLEPNS LDVHTLQNVV ERLRDLERKA KDVEEQLKSL
GAIIKTQEEA KRLRGLHGKF EILAAHYSVA AIQQQINQQQ QRLIDTRRDY ELNQTDLEQQ
LGLQQNAQAA LVEAQVQLSQ SDVVRRKNQL AAEIQQLQQH IQKINQRWRQ LQAKLIQIAA
LLNKLQPIAE QDLVNLLQQP EHIPQSQQIV PLLQQQSTAA GSALKLQLQA MTRTDDKISR
LFEQESRLKQ EITQLEQSNR QNHYPQNVEN VRKYLKSALN IELFLLCELL EIPDEHWQNA
VEAMLGQRRF NIIVEPKYYS QALDLLDQLR EKERIYDVGL VDLQQASAEA RPARPLSLAT
KVKAKTGLID KYIAAILGDI ITCQHVQDLR QHRRAITAEV MVYQEWTARS IDPRKFQPWF
IGERAQRSQI ESRRQQLTEI QAELAILRPE QQRQRQYYDQ LEKLQQLLLG LDSYFEEELD
SSELQTTLAE LQREHDSIDT SGVAALEAEV KRLHIIHEEY ALNIRRLERI IGTLNNQITQ
IEQQLQENQH QLLNAQSLFE HTQQRYPDQI DDALNDFQRE NNEQAQFSRL QETAEQQGRG
YNTRFNNTQE QLREQVILYN RDFRARELAN IEQTSFQEKR AELEATDLPH FVQKIAEAKH
ETETELREHV LHKLRENIGR AKAELDRIND ALQGLDFNGQ RYHFRYEIAD SLRDFYQLIE
QSANVTSEMI QDSEFYQQHK ATFDRFFQLL IQPGLTDQEK LEQAQITDYR RYLSYDIDVI
ERDGTRSRLS KIMGQTSGGE TQTPFYVTIA ASFVQLYKIN ERSKRSTIRI VAFDEAFSKM
DQDRIGSTLD LFHHFGLQII TATPIERCEY LVPKMCTTLV LTGLKDRQQR VLVEPYRNYA
ARLEALNAES EQN