Gene Haur_3418 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3418 
Symbol 
ID5735279 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp4305180 
End bp4308101 
Gene Length2922 bp 
Protein Length973 aa 
Translation table11 
GC content50% 
IMG OID641280565 
ProductTPR repeat-containing protein 
Protein accessionYP_001546182 
Protein GI159899935 
COG category[K] Transcription 
COG ID[COG2909] ATP-dependent transcriptional regulator 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTTTTT ATCCGACCTA TGTTGAACGC CCACGTTTAC TCACTTTGAC CCAACATGCG 
CCACGGGTAG TTTTAATCAT TGGGTTGCCA AGCAGCGGCA AATCCACAGT GTTAGAGGCA
CTCAAGCATT CCATCAACGC CCCGTCTGCC TTGATTCGGC TTGAAGAGTC AGCGCATGAT
CCCCTAGCGC TGCTTCAGGC TATTTGCCAA CAACTTGGCT TACCTAGCGA TCATGCACAT
GAGCAGCTTT TAAGGTTACG CCAACCAAGC TGGATTTTGA TCGACGATTT GGATCGGATC
TTGGATGCGT GGGGGACGTT CTTGCCTGAG CATGCTTGGC TGAGCGAATT ATTGAAAAAC
CCAGCCCTGC ATATTGCTGC TACTTCGTTA CATCTGCCCA ATTTGCCCAT GATTCAACAA
TCGATTATTC GTCGGCAATT GCGCATAGTG GGTATGCAGG AACTGGCCTT CACCAGCGCA
GAATTAATCG AGCTGGCGCA ACAACGGCAA CTCCACCTAA CCCACGCTCA AGCTGAGCAA
TTAATCGCCT TGACCGATGG TTGGGTTGCT CCTTGTATTT TTGCGCTCAA TCGTTCTGGT
GTGCTCAATC AATCGGCGGT AGAGTGGGTG ATGCAAGATG TGATGGCGAG TTTGCCAGCG
CCGTTGCATC AGATGCTTGA AGCTTGGCTT TGGCTTGCCC CCGATTCGAC CGAATTGCTC
CAAGCGTTAC TGCCACACTA TGATCTGCAT CGTCTGATGA TGAGTTGGGA ATATGCGGGG
TTGTTGGCGC AGTCGAGTCT TCAGTTGCAG CGTTTGCCCG AGGTCGGTCT GCGCCAGCGT
TCTGATCCAC AGGCCAGCTA TCTGAGCCCA TGGTTTGAGC AGGCTGGCGA TTGGTATCTG
GCGCAACAAC AATTATTGCC ATTTGTACAA AAAATGCAGC GCTTGCAACG TTGGGATGTG
ATTAACCATA GCCTAACGCG CAATTTGGCA TTGATTGATC GTCAGCAGTT TTTTGCTGAG
ATTGTCAATG CCTTGAATGC GGTTCCAGTT CAGCAGCTTA GCTCAAAAGT AGGTTGGTTG
TTGGTGCGCA GCTACCATCT TGGGTTACGG AATGGTGAGA AAGCCTTGCT TTTGGTTAAT
CAGCTGCTAG GCATTCATCA GCAGCCCGAT GATCAGCGCA TGTTGCGCTT GCTCAAGGCC
GATATGTTGC GAGCACAGGG CGCAATTGTC GAGGCTGATG GTTTAATTCA GCCCTATCTG
GATGACCCTG ATTTATCGCC AGCCGATCAG GCGCGGGTGC TCCGAACGCA TGCAAGTTAT
CTTGTATCGT TGGGTCAAGA TGATAATGCG ATTACCCATT TTGAGCGGGC ATATGGCTAT
ATTCAACATT CTGGCATTAA TCGTTTGCTC GGGTTGATTC TCGGTGATTA TGCGAATGCG
GCGGCACGCG CCGGACGCTA CTCGTTGGCC GATCGACTGT TACGCCAAGC GACGTTGCTG
TGGAATGAAT TAAATTCGCC TGCGGGTTTA TCGCAGACGA TGAATCTGCG GGCGGTGGTG
GCACTGCATC GTGGCCAAAT TCGTGATGCA GCCCGCTACG CTCAAGAAGC CCTCGATAAT
GCCTTGATTT CCGATAATCG GGCTGCCAAT GGTGCCTTGA TCACGTTGGG CGATGTAGCG
TTAGCTGATC GTGATTGGAC AAGTGCGATT GCGCGTTATC GTTCGGCGCG TGAGCAATTA
CGCAATAGTA ATATGGTCGA TTATCATATT TTGACCTATG CCTTGGCCTT GGAATCGCAA
GCTACGCGCC ATAGTTCGAT CGAGCAGTTG CAACGCTTAT TGATTGAGAT CGATACTACG
ACGGCCCAAA ATCCGCTTGA TCAGGCTTGG TTGGCAGTAG CTCGTAGTGC TGCTAACTTG
ACATTGAAGC TTGATGGCAC GATTCCGTTA TTGCAGCGAG CGTTGGCTGG GCTTGATAGC
GAAGGTGCGT TGGCCAAAGG TTTATTGTAT CTCTTGCTCA GCGAAGCTTT TTGGCAGCGT
GATCAATTTG GTGAGGCTCG CCAAGCGTGG GAAGCCCTTG ATAAGTTGAT TATTGATGGG
CGCAGTGGCT TGCCCGTGCT ATTGTCGGCG CTGACCGTGC ATCTCCCTGA ATTGGTTCAT
GAGGCCTATA CCCAATGGAA TTCACCATTT GCGGCACGGG TGCTGCACCA TAGTTTGCCA
ATGCCGCAAG CACCAAAATT GATTATTCGC TTAATGGGGC AGGTGCGTGT CGAGATGCAC
GGCAAGCCTG TCAAAATTCC TAAACAAGGT ATTTTATTAA TTGCCCTGTT GCTTATGAAC
CCTAGCGGGA TGACTGCTGA TGAATTACGG GCAAAAATTT GGGGATTTGA TAGTAGCAAT
GATGGCTGGC GTAAAATGTT GCAACGTACT CGCAAAGAAT TGCCCGATTG CGTTATTTCT
GATGGTTCGA TTTATCGTTT GTGTTTCCCA TTGTCTGAGA TCGACGCTGA TATTTTGGTG
ATCAATCAGA CTCCATTACA AGGCTCTGAT GCGACGCTTG ATCGTCTGCA AATAGCCGCC
GATTATGCAA GCCAAGTGTT TTTATCGGGC CATGAGGCTC CTTGGATTGA ATGGGAACGT
AAGCAGTTAT CCAAACGTGG AGCCGAAATT TGGATTTCAA TCGGCATTCA ATGCTATGAT
GCCCTGCGCT TTGATCAGGC TCAAAAGGCC TTTGCTCAAG CATTGAGACT AGATATTTCG
AATGGGCGTG CGGTTTCGCA GGCAATGAAC TTGGAGATTA ACCAAGGTCG TCGGCTTGAG
GCCTTGGCAA TCTATGACCG TTACCGTGAG GCCTTGTTTG AAGAATATGG GCTAGACCCT
TCGGCAGAGC TGCAAGCTTT GCAAAAACGT GCTTTGGATT AA
 
Protein sequence
MSFYPTYVER PRLLTLTQHA PRVVLIIGLP SSGKSTVLEA LKHSINAPSA LIRLEESAHD 
PLALLQAICQ QLGLPSDHAH EQLLRLRQPS WILIDDLDRI LDAWGTFLPE HAWLSELLKN
PALHIAATSL HLPNLPMIQQ SIIRRQLRIV GMQELAFTSA ELIELAQQRQ LHLTHAQAEQ
LIALTDGWVA PCIFALNRSG VLNQSAVEWV MQDVMASLPA PLHQMLEAWL WLAPDSTELL
QALLPHYDLH RLMMSWEYAG LLAQSSLQLQ RLPEVGLRQR SDPQASYLSP WFEQAGDWYL
AQQQLLPFVQ KMQRLQRWDV INHSLTRNLA LIDRQQFFAE IVNALNAVPV QQLSSKVGWL
LVRSYHLGLR NGEKALLLVN QLLGIHQQPD DQRMLRLLKA DMLRAQGAIV EADGLIQPYL
DDPDLSPADQ ARVLRTHASY LVSLGQDDNA ITHFERAYGY IQHSGINRLL GLILGDYANA
AARAGRYSLA DRLLRQATLL WNELNSPAGL SQTMNLRAVV ALHRGQIRDA ARYAQEALDN
ALISDNRAAN GALITLGDVA LADRDWTSAI ARYRSAREQL RNSNMVDYHI LTYALALESQ
ATRHSSIEQL QRLLIEIDTT TAQNPLDQAW LAVARSAANL TLKLDGTIPL LQRALAGLDS
EGALAKGLLY LLLSEAFWQR DQFGEARQAW EALDKLIIDG RSGLPVLLSA LTVHLPELVH
EAYTQWNSPF AARVLHHSLP MPQAPKLIIR LMGQVRVEMH GKPVKIPKQG ILLIALLLMN
PSGMTADELR AKIWGFDSSN DGWRKMLQRT RKELPDCVIS DGSIYRLCFP LSEIDADILV
INQTPLQGSD ATLDRLQIAA DYASQVFLSG HEAPWIEWER KQLSKRGAEI WISIGIQCYD
ALRFDQAQKA FAQALRLDIS NGRAVSQAMN LEINQGRRLE ALAIYDRYRE ALFEEYGLDP
SAELQALQKR ALD