Gene Haur_1013 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_1013 
Symbol 
ID5732917 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp1157123 
End bp1158598 
Gene Length1476 bp 
Protein Length491 aa 
Translation table11 
GC content48% 
IMG OID641278148 
ProductXRE family transcriptional regulator 
Protein accessionYP_001543789 
Protein GI159897542 
COG category[R] General function prediction only 
COG ID[COG3800] Predicted transcriptional regulator 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCACTG ATGTAATTAA TTTAATTTTT GGCATGAAAC TGCGTCAAGC CCGCCTCGAA 
GCCAATTTAA GCCTGACTGA ATTTGCTGCA CGCGCTGAAT TATCGCCGTC GTATGTGACT
GAGATGGAGA AGGGGCGCAA GTATCCTAAG CCCGATAAAA TTGTCAAAAT GGCCCAAGTG
TTGGGCAAAG GCTACGATGA ATTGGTCTCG ATCAAGCTGA ATCCGGCTCT GGCCTACCTT
GAAAATATGC TTTCTTCGCC GTTGTTACGC CATTTTCCCT TCGAGGAATT TGAGCTTGAT
CCGAATGAGT TGGTTGGTTT ATTTACCAAA GCGCCTGATA AAGCTAGCGC CTTGTTACAC
GCAATTTGGG AAATTGCCCG CCAATATGAT ATGAAGGATG AGCATTTTTT TCGGGCGGCG
TTGCGTTCGT ATCAAGAAAT TCATGAAAAT TATTTTCCCG AAATTGAGGA AGAGGCCGAA
GCTTTTGCCA CCAAATATCA GCTTAATCTC TTTCCGGTTG GCATCGATCA GCTCAAGCAG
ATTTTGCGGC AGCAATTTGG CTATCACATT GATGAAACCA GCTTGGCCGA GCACGAAATT
CTCTCGCATT ATCGTGCGGT CTATATCGAC ACGCCGCGCA AAACCTTGTT GATCAATCCG
AATTTGTATG AATTGCAGCG TAAATTTATT TTGGCTCGCG AGCTTGGTTA TGCAGCGCTG
GGCTTGAACG AGCGTTCAGT CACCTCAACG CCTGATCAAG TGCTGTCGTT TCGCCAAGTT
TCCAACGATT TCAAGGCTTC ATATTTTGGT GGTGCGCTGT TGATGCCACG GCAGCCGATG
ATCGCTGATA TTGCAACCTT GTTCAATCAA ACGACGTGGA GTGCCGAGCC ACTATTGGCG
ATGCTTAATC ACTATGATGT TACGCCCGAA ATGTTGCTCT ATCGTTTCAG CGAGGTCATT
CCAGCGGCCT TTGGCATTTC GCTACACTTT TTGCGCTTCC ACAATGTTGA AGGCTCCTAC
AAGCTGATCA AACGAATTAA CATGAATCGC TTGTTGCTAC CAAGTGGAAT CGGTTTACAC
GAACATTACT GTCGGCGTTG GCTGGCCTCG CGACTGTTAA TCAATTTGGC TGATCATAAT
GGGCATCAGC AACCCTTGGT CGATGTGCAA ATCTCAGAGT TTTTAGAAAG CCAAGATCGC
TTTTTAGATT TGGGTTTTGC GCGGCCATTG GTGCTGACTC CAACTGTTGG CAGTAGTGTG
GTGGTTGGTT TTCGCTTTAC GCCTGAGCTA GCCAAAGTGA TCAAGTTCGT CGATGATCCG
GCGATTCATC GCGCGGTGAT TCACGAAACC TGTCAGCGCT GCCCGATTGA CGATTGTGCG
GTGCGTAAAT CTGTGCCAAC GGTGCTATGG GGCGAGCAAC TGCGCGATCG CCGCAACGCC
GCAATTGCCG AAATCCGCCG AACGTTGTTG CCATAA
 
Protein sequence
MSTDVINLIF GMKLRQARLE ANLSLTEFAA RAELSPSYVT EMEKGRKYPK PDKIVKMAQV 
LGKGYDELVS IKLNPALAYL ENMLSSPLLR HFPFEEFELD PNELVGLFTK APDKASALLH
AIWEIARQYD MKDEHFFRAA LRSYQEIHEN YFPEIEEEAE AFATKYQLNL FPVGIDQLKQ
ILRQQFGYHI DETSLAEHEI LSHYRAVYID TPRKTLLINP NLYELQRKFI LARELGYAAL
GLNERSVTST PDQVLSFRQV SNDFKASYFG GALLMPRQPM IADIATLFNQ TTWSAEPLLA
MLNHYDVTPE MLLYRFSEVI PAAFGISLHF LRFHNVEGSY KLIKRINMNR LLLPSGIGLH
EHYCRRWLAS RLLINLADHN GHQQPLVDVQ ISEFLESQDR FLDLGFARPL VLTPTVGSSV
VVGFRFTPEL AKVIKFVDDP AIHRAVIHET CQRCPIDDCA VRKSVPTVLW GEQLRDRRNA
AIAEIRRTLL P