Gene Haur_4422 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4422 
Symbol 
ID5736273 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp5658405 
End bp5659883 
Gene Length1479 bp 
Protein Length492 aa 
Translation table11 
GC content49% 
IMG OID641281585 
Productputative PAS/PAC sensor protein 
Protein accessionYP_001547182 
Protein GI159900935 
COG category[T] Signal transduction mechanisms 
COG ID[COG1366] Anti-anti-sigma regulatory factor (antagonist of anti-sigma factor) 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTTGCTT ATTGGATTAA TTTGGTATTT GTGATGCTTG AAACGGCTTT GGGCACGTTT 
ATTTTGGCCC GATCTAGCCA TTATCGCCCA GCCCGAATCT TTTTTGGGTT AACCCTATGC
TTGATAGTCA TTAATAGCAC GGCGCTTGCC CGTAATCTAA CAACTGATCA TGCTACCTCG
TATGGGTTGG TTGGGATTGC CCTGGTGAGC CTGGGGTTGC TTTGTTGGTT GATGTTGCTT
TTGTTTGCAG CGTTGTTTAT GCCACAATGG TGGGAAGGCT CACGCCCAAT TCGCTCGATT
TCATTGGTTT ATGGCCTATC GATTGGCTTA TTAGCACTTG ATTTGATTGG GCAATTTGGC
TGGTTTACCA CTGGGATCGA ATTAGCCAAT GGCAGCTATC GGCCTATCGC TGGGCCAGCG
GCGGGTTTGA TGTTGGCTTT ATTTAGCCTT GGGTGGTTGG TGCAGCTTGG ATTGTTAGGC
ATGGCATTTT GGAGACAACC AGCAACCCGC CGCTCCATTA GTTGGTTGGC TTTTGCAATT
CTCTTTTCAG CATTGACCAA CTCGGTATTG GGGATTGTCA AGCTTGAGCC AAGTGGCCAG
TTGGCAAGTG TGCTACAAAC GTTGCCCTTG GTTTTAAGTT TGACCTACAT TGTTTTGCGT
GGCAGTTTGT TTCAGACCAA ACAAGTAGCG GTGCAGCAGG CTTTGCAAAC CATGAGCGAA
GCGATGGTGG TCGTTGATCG CGAAGGAATG ATTGTTTATC TCAATAATGC GGCCCACCAG
CTTGGTTTGC AAACTCAGCA ACCACTTCAA CAGGCATTTC AAACGATTGG AGTTGCAGTT
GATGATGTTG CGGCTTTGGC TAAAGCATTA GCGCAGCCGC AAATCCAAGC TTTTACTCAA
ACGCTGGCCT TGGGCAACCC TTTGCGCTTG CTAGAAAATG CAGTATCACC AATTTTGGAT
AGCGCTGGAC AGAGCCAAGG TACGATGTTG TTCATTCGGG ATATTACCGA GTTGGAGCGG
CACACGGCTT TGTTGGAGCA GGAACGGCGG CGCTTGAGCG TAATGGTAGA ACAACTTGAG
CAGGAACAAA TCCAACGTAA TCAATTGACT CAAGCAGTTC AGGCGCTCTC ATTTCCAACG
ATTCCGGTTT TGCCGGGCGT GCTGGTTCTA CCCCTGATCG GGGTACTTGA TCAGCAGCGA
ATTATTGAAT GTCAACGGGT TTTGATGGAA TCATTGAATC AGCAGCCAGT TCAACGTTTG
CTGATAGATT TGACCGGAGT TCAATTGATC GATGCTGAAG GTGCAATTGG CATGCAGCGA
ATGTTGCGGG CCGCCTATTT GCTTGGTGCT CAAACAACAT TAATCGGGGT ACGACCTGAG
GTCGCTCAAG CCTTGGTTGG AATGGGCGCT GATTTGCAGC ATGTCGCGAC CGCCGCAACC
CTGCAAGCTG CGGTCGGCCA AATTATCGCC AAAGGCTAG
 
Protein sequence
MLAYWINLVF VMLETALGTF ILARSSHYRP ARIFFGLTLC LIVINSTALA RNLTTDHATS 
YGLVGIALVS LGLLCWLMLL LFAALFMPQW WEGSRPIRSI SLVYGLSIGL LALDLIGQFG
WFTTGIELAN GSYRPIAGPA AGLMLALFSL GWLVQLGLLG MAFWRQPATR RSISWLAFAI
LFSALTNSVL GIVKLEPSGQ LASVLQTLPL VLSLTYIVLR GSLFQTKQVA VQQALQTMSE
AMVVVDREGM IVYLNNAAHQ LGLQTQQPLQ QAFQTIGVAV DDVAALAKAL AQPQIQAFTQ
TLALGNPLRL LENAVSPILD SAGQSQGTML FIRDITELER HTALLEQERR RLSVMVEQLE
QEQIQRNQLT QAVQALSFPT IPVLPGVLVL PLIGVLDQQR IIECQRVLME SLNQQPVQRL
LIDLTGVQLI DAEGAIGMQR MLRAAYLLGA QTTLIGVRPE VAQALVGMGA DLQHVATAAT
LQAAVGQIIA KG