Gene Haur_1198 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_1198 
Symbol 
ID5733091 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp1379470 
End bp1381227 
Gene Length1758 bp 
Protein Length585 aa 
Translation table11 
GC content50% 
IMG OID641278338 
Productcell envelope-related transcriptional attenuator 
Protein accessionYP_001543974 
Protein GI159897727 
COG category[K] Transcription 
COG ID[COG1316] Transcriptional regulator 
TIGRFAM ID[TIGR00350] cell envelope-related function transcriptional attenuator common domain 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCTGCAA AAACCGATAC ACCTGAGGAA TCGACCAACG GGTCAGCCAA TGATCTATCG 
CTCGATGCTG AGCATGTTCG TAGTCAAGCT ACCACACCCT TGCCAACTCG TTCACGTGCC
GCAGCTAAAG CCGATGCTGC CAACGACGCG GCGTTGATGA ATGATCTGTT GTTGAACAAT
AATTCGACTG CTGCTGCACG CCCACGCCCA CGGGCCGTAC CACGTGCTGA CCACGAAGGA
TCATATGTTC GTGTGCCCAA GGCAGTCCAA TCGGCGGTTG GCAGCCGACC AACCTCGCGC
CCATTGCCAC CCAAAACAGA TCCAATCAAA CGTACCTCGC GTGGAGCTGG CTGTGGTATG
GTTATGCTTG GGGTCTTGCT AACGGTGGTT TTGGTCGGTG GTGGTGGGGC ACTTTGGGCC
TATTTAAAGG TTAAAAATGC CACAAGTGAT GCCTTAGTAA CCATCCCAAC CCAAGCGCCA
AACCTAGTCG AAAATCCTAA TAATCCTAAT CCTCAGCCCA ATCAACCTTT GGCAACCCCC
GATATTGTCA AAGATCCCTT TAATTTGTTG TTGATTGGGG TCGATTTACG TGAAAATGAT
ACCAAAGCTC GCACTGATAC CATCATTGTG TTGCATATTG ATCCAACCCA AAAATGGGCC
AGCATGGTTT CGATTCCGCG CGATAGTTGT GCCGAAATTC CAGGCTACGA TGCCCCAGGC
ACATGTTCGC AGCGAATTAA TGCAGCCTAC GAACTGGGTT ATAAAGAAGG TATCGCCCAA
AATATGACGA TTCCTTCAAC TCAGGCAATG GCATTAACCC GCGATACCGT CGCCAATATG
CTGAATATTA ATATCGATTA TGTTGCTCAA GTCGATTTTA AGGGCTTTCG CAAAATTGTT
GATGCCGTTG GGGGCATTAC AATCGATGTG CAACGCCCGC TATGGGATGC GACTTACCCA
ACCGATGATG ACGATTATGG AGTGATTCGG CTGTTTATTC CGGCTGGCTT GCAACACATG
GATGGCACAA CTGCGCTGCG TTATGCCCGT TCACGCCATC AAGATGCTGA TTATGGCCGC
TCACGCCGCC AACAAGATGT GATTCGAGCG CTGGTTCAAA CCCTCAAAGA TAAAGGCTTG
CTCGACCAAA TTGATGCCTT GGATAGCCTT GCTGCACAAC TCAAAGGCTC GTTCTATACC
GACCTGCCAA TCGATGACCT TGGCAATTTA CGGGCTTTGG CTGGGCTTGG CAGTGATATT
GCCAATGGCC GGATCAAGAG TGTCAAATGG GATACCAGCT CAGTAATCGG CTATATGGAT
GAGTCGCAGT ATGTGCCAAT TTGGGACCCT GCCAGCATCG CGGCCACGGT TGATCAATTA
TTGACCAGCC CAATTCCTGA TACCAATAGC CCAATTGATG GTAGCTCAAA CACACCTTCG
GACGACAATC TAAGCATCGA AGTGATCAAT GGAGCGCAAA TTAGTGGTTT GGCAGGCGAT
GTAGCAACTC ACCTCGAAAA CCGTGACTAT CAATTGCTCA ATCCTTCAAC CGCTTCAACC
GTGTATGACA CCACCAAAAT CATCGATTTT GGCAATCATA AAGAACTGCG TGAGCAACTT
GCTGCTGAAT TGGGGATTAG CAGCCGCAAC ATTATTGTGG TAAGCAAAAC CAGCCCAGCT
CCCGAACAAC CAAAGGGCGA CGCAGCTTTA GTGTTGTTGC TTGGCCGCGA CTATGATGAG
GCGTGGCGCA AACCCTAA
 
Protein sequence
MPAKTDTPEE STNGSANDLS LDAEHVRSQA TTPLPTRSRA AAKADAANDA ALMNDLLLNN 
NSTAAARPRP RAVPRADHEG SYVRVPKAVQ SAVGSRPTSR PLPPKTDPIK RTSRGAGCGM
VMLGVLLTVV LVGGGGALWA YLKVKNATSD ALVTIPTQAP NLVENPNNPN PQPNQPLATP
DIVKDPFNLL LIGVDLREND TKARTDTIIV LHIDPTQKWA SMVSIPRDSC AEIPGYDAPG
TCSQRINAAY ELGYKEGIAQ NMTIPSTQAM ALTRDTVANM LNINIDYVAQ VDFKGFRKIV
DAVGGITIDV QRPLWDATYP TDDDDYGVIR LFIPAGLQHM DGTTALRYAR SRHQDADYGR
SRRQQDVIRA LVQTLKDKGL LDQIDALDSL AAQLKGSFYT DLPIDDLGNL RALAGLGSDI
ANGRIKSVKW DTSSVIGYMD ESQYVPIWDP ASIAATVDQL LTSPIPDTNS PIDGSSNTPS
DDNLSIEVIN GAQISGLAGD VATHLENRDY QLLNPSTAST VYDTTKIIDF GNHKELREQL
AAELGISSRN IIVVSKTSPA PEQPKGDAAL VLLLGRDYDE AWRKP