Gene Haur_3345 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3345 
Symbol 
ID5735215 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp4217602 
End bp4219722 
Gene Length2121 bp 
Protein Length706 aa 
Translation table11 
GC content53% 
IMG OID641280492 
ProductRNA-binding S1 domain-containing protein 
Protein accessionYP_001546109 
Protein GI159899862 
COG category[K] Transcription 
COG ID[COG2183] Transcriptional accessory protein 
TIGRFAM ID[TIGR00426] competence protein ComEA helix-hairpin-helix repeat region 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATTATG CCCATTTAAT TGCCAAAGGC TTGACGGTGC GTGCCGAGCA AGTTACCGCT 
GCGATTCAAT TATTCGATGC AGGCAATACT TTGCCGTTCG TCGCTCGTTA TCGTAAAGAG
CAAACTGGTG GCTTGGATGA AGAACAATTA CGCAGCATTC AAAGCCAAAT TGCCCGTTTA
CGTGAGCTTG ACGAGCGGCG CGAGGCGATT TTGAGTGCTC TGCGCGAGCA AGGCAATTTG
AGTGATGAAT TGGCCCAAGC CTTGGCCGCC GCTACTGATA AAACCACCCT CGAAGATTTG
TATGCGCCGT TCAAGCCCAA ACGCCGTACC CGCGCCAGCA TAGCCCGCGA ACGTGGCCTC
GAAGGCTTGG CTGAAATTAT TCAAATGCAG CCGAATGATC CGATTGATGC TACCGCTCGC
CAGTTTCTCA ACGAACAGGT TGCCAGCATT GAAGAGGCCT TGGCTGGAGC ACGCGATATT
GTAGCCGAGC AAATCAGCGA TCATCCTGAG GTGCGCCGTC AAACTCGCGA ACGCGCTTTG
CGTTGGGGCG TGGTTCGCAG CGAGTTAATC GCCGATGCCG AGGATAGCAA GGGCGTATAT
CAAACCTACT ATCAATTTGA GAGCACGGCC AGCCGCCTCA AGCCTTACCA AGTGTTGGCG
CTCAACCGTG GTGAAACCGA ACATATTTTG CGTTTTAAAA TTCAGATGGA TCAACGTGAT
TGGTTTGATG TGGTTGCCAA GTATTTTCCG CTTGATCAAC GTTCAGCTTG GGCCGAGCAA
CTGCGCTTGG CGATCCACGA TGGCGCTGAG CGATTGCTCT TGCCAGCAAT TGAACGCGAT
GTGCGCCGCG CCTTGACTGA GCAAGCCGAA AGCCATGCGA TCACCGTCTT TGCCAAGAAT
GTGCATTCGT TGTTGCTGCA AGCGCCAATC GCCAATAATG TGGTGCTCGG GCTTGATCCA
GGCTACCGCA CTGGTTGCAA AGTGGCGATT ATCGGCCAAA CTGGCAATGT GCTCACGACT
GCCACGATTT ATCCTCACAG CGGCGCGGCA GCGCGTGAAC GGGCGTTTCA AGAATTGCAA
AGCTTGATCA AACGCTATGC TGTGAGTTTG ATTGCCATTG GCAATGGCAC GGCCTCGCGC
GAAACTGAGC AATTAGTCGC CGATGTGATT CGCCACCAAA CTGGTTTGCA CTATTTGATT
GTCAGCGAAG CAGGAGCCAG TGTTTACAGT GCTAGTACGC TTGCCCGCAG CGAATTGCCT
GATCTCGATG TCAGCTTGCG CGGCGCGGTT TCGATTGCAC GGCGGGTGCA AGATCCCTTG
GCCGAGTTGG TCAAAATCGA GCCAAAAGCG ATTGGCGTAG GCATGTATCA ACACGATGTT
GATCAATCGG CCTTGGGCAA TGCGCTTGAT GGCGTGGTTG AGAGCGCAGT TAATAATGTT
GGGGTTGATG TCAACACCGC TTCGCCTGCG CTTTTACGCT ATGTTGCCGG GATTGGCCCC
AAACTTTCGG CTCAAATTGT CAGCCATCGC GAGGAACATG GCCCATTTCG TTCGCGGGTT
GCACTCAAAA AGGTCAAAGG GCTTGGGCCA AAAGCCTTTG AACAGGCCGC CGGATTTTTG
CGAATTCGCG ATGGCGATGA AGCCTTGGAT GCCAGCGCAA TTCACCCCGA AAGTTATACG
GTTACCCGTA ATTTGTTGGA CAAGCTGAAT ATTAACGCCA AAACAGGCCG CAACGAACGA
ATCAAACGCT TGGAAGATTT GAAAAATCAG CCATTGCATA GCCTTGCGGC GGAATTGGGC
ACAGGCGTAC CAACCCTGAG TGATATTATT GACCAACTGC TGCGGCCAGG TCGCGACCCC
CGCGAGGATG TGCCAGCACC AATTTTGCGC AGCGATGTGC TGGCTTTTGA AGATTTGCAG
CCGGGCATGC AGCTCAAAGG CACAGTGCGC AATGTCGTCG ATTGGGGCGC ATTTATCGAT
TTGGGGGTTA AGCACGATGG CTTATTGCAC CGTTCGCAAA TTCCCCGTGG CCTGAGTTTG
AGTGTTGGCG ATATTGTCGA TGTTAGCATT CAATCGATCG ACCCAGATCG CAAACGGATT
GCCTTAGTTT TAGCACAATA A
 
Protein sequence
MDYAHLIAKG LTVRAEQVTA AIQLFDAGNT LPFVARYRKE QTGGLDEEQL RSIQSQIARL 
RELDERREAI LSALREQGNL SDELAQALAA ATDKTTLEDL YAPFKPKRRT RASIARERGL
EGLAEIIQMQ PNDPIDATAR QFLNEQVASI EEALAGARDI VAEQISDHPE VRRQTRERAL
RWGVVRSELI ADAEDSKGVY QTYYQFESTA SRLKPYQVLA LNRGETEHIL RFKIQMDQRD
WFDVVAKYFP LDQRSAWAEQ LRLAIHDGAE RLLLPAIERD VRRALTEQAE SHAITVFAKN
VHSLLLQAPI ANNVVLGLDP GYRTGCKVAI IGQTGNVLTT ATIYPHSGAA ARERAFQELQ
SLIKRYAVSL IAIGNGTASR ETEQLVADVI RHQTGLHYLI VSEAGASVYS ASTLARSELP
DLDVSLRGAV SIARRVQDPL AELVKIEPKA IGVGMYQHDV DQSALGNALD GVVESAVNNV
GVDVNTASPA LLRYVAGIGP KLSAQIVSHR EEHGPFRSRV ALKKVKGLGP KAFEQAAGFL
RIRDGDEALD ASAIHPESYT VTRNLLDKLN INAKTGRNER IKRLEDLKNQ PLHSLAAELG
TGVPTLSDII DQLLRPGRDP REDVPAPILR SDVLAFEDLQ PGMQLKGTVR NVVDWGAFID
LGVKHDGLLH RSQIPRGLSL SVGDIVDVSI QSIDPDRKRI ALVLAQ