Gene Haur_1962 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_1962 
Symbol 
ID5733851 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp2397262 
End bp2398878 
Gene Length1617 bp 
Protein Length538 aa 
Translation table11 
GC content50% 
IMG OID641279106 
Productputative PAS/PAC sensor protein 
Protein accessionYP_001544733 
Protein GI159898486 
COG category[T] Signal transduction mechanisms 
COG ID[COG1366] Anti-anti-sigma regulatory factor (antagonist of anti-sigma factor) 
TIGRFAM ID[TIGR00229] PAS domain S-box
[TIGR00377] anti-anti-sigma factor 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.118262 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGAAGCA ATCTGGATCA GGATGGGGCG CAACCACCTC GCGACGTTCA GCAGACGATT 
GACAGCTTGC GGGCACAACT CTTAATCTAT GAACAAATTT TAGATACACT CCCCGATTTG
ATTTTATACA AAGGCCCAGG CTCGCATATT CGCTATGCCA ATCGCGCTTT TCGCGATTAC
TATGGCATGG ACAAGCAGCA GCTCCAAGAT GTGATCGATG CCGATTTCAA TCAGCCTGAT
TATACCCAGC AGTATATTCG TGATGATGCG CAGGTGTTTC AAACTGGTCA AACATTAGTG
ATCGAGTGCG AGCCAGTCAC CCATCATAGC GGTGATGTCC ATCTATTTGC GACCACCAAG
CATGCAATTC GCAGTCACGA GGGGGAAATC ATTGGCACGG TTGGCATCTC ACGCGATCTG
AGCACAACCA GCGAATCTGC GGCGGTGCGC ACTAATAACG AGATTCGTTT GCAGCGGATT
ATCGACAACG TGCCTGGCAT GGTCTATCAA TTATTGCTGA ATCCCGATAC AACCATCAGC
TTTCCATTTG TCAGCACAGG CAGTCGCGAT CTGTATGCCC ACGAGCCAGA AGCGATTATG
CATCAGGCCA ACATTGTGAC GGAGGCGATG CATCCTGCCG ACCGTAGCCG TTTTCAGGAA
GGAATGTTGG CTTCGGCCCA AAGCCTTACT GCTTGGCATT GGGAAGGCCG GATTGTGATC
AATGGCCACG AGCGTTGGTT GCAAAGCGCC TCGCGACCTA GCAAATTGGA TAATGGAGCC
ATTTTGTGGG ATGGTGTGTT GCTGGATATT ACCCAGCAAA AGCAAGCCGA AGCACTGTTA
ATCCGCTTTG ATAGTATTTT AAGCGCGACC CCTGATCTGG TAATGATTGC TGATGCTGCT
GGTCAGATCC AATATCTCAA TCCAGCCGCT CGCCACGTTA TCCAATCCGA ACCAACCACA
GCAGCTGAGC CATTAACGTG GCAACAGCTT TACCGCACCG CCCCCGACCA AGCTTGGATT
AGCCAAACCT TGCAGCATGC CCAAACCCAT GGCTCATGGA TGGGCGATTA TCATTTGCAA
ACGGCCCAAG GCAGCCAGCT GCCGCTGCAT TATCAAGTGT TGTGTCATTA TGATTATGAT
CAGCAGCCCA GCTTTTTTTC ATTGATCGCC CACGATATTC GCGATCAGCA ACAGGCTGAT
GCCGAACGTC AGCGCTTGCA CGAAGAAATT ATTCGGACTC AACAGCAAGC CTTGATTGAT
CTTTCAACTC CATTGATCCC AATTGCGGAT ACGGTTGTGT TGATGCCCTT GATTGGTAGC
GTTGATACGG CGCGGGCGCA GCGCTTGATG GAAACCTTGT TGGAAGGCGT GCATACTCAC
AAAGCCCAGA TGGCGATCGT TGATGTGACG GGTGTTCCGA TTATGGATAC CCAGGTGGCT
GGATTACTCA TTCGGGCCGC TACAAGTGTA CAATTGCTTG GTGCTCAGGT TATTATCACC
GGCATTCGCC CCGAGTTGGC CCAAACCCTC GTGACCTTGG GCATTGATTT CCGATCCATC
ATTACCCAAA GTAGTCTGCA ACAAGGCATT ACCTATGCAA TCCAATCACT AAAATAG
 
Protein sequence
MGSNLDQDGA QPPRDVQQTI DSLRAQLLIY EQILDTLPDL ILYKGPGSHI RYANRAFRDY 
YGMDKQQLQD VIDADFNQPD YTQQYIRDDA QVFQTGQTLV IECEPVTHHS GDVHLFATTK
HAIRSHEGEI IGTVGISRDL STTSESAAVR TNNEIRLQRI IDNVPGMVYQ LLLNPDTTIS
FPFVSTGSRD LYAHEPEAIM HQANIVTEAM HPADRSRFQE GMLASAQSLT AWHWEGRIVI
NGHERWLQSA SRPSKLDNGA ILWDGVLLDI TQQKQAEALL IRFDSILSAT PDLVMIADAA
GQIQYLNPAA RHVIQSEPTT AAEPLTWQQL YRTAPDQAWI SQTLQHAQTH GSWMGDYHLQ
TAQGSQLPLH YQVLCHYDYD QQPSFFSLIA HDIRDQQQAD AERQRLHEEI IRTQQQALID
LSTPLIPIAD TVVLMPLIGS VDTARAQRLM ETLLEGVHTH KAQMAIVDVT GVPIMDTQVA
GLLIRAATSV QLLGAQVIIT GIRPELAQTL VTLGIDFRSI ITQSSLQQGI TYAIQSLK