Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3755 |
Symbol | |
ID | 5735619 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 4721122 |
End bp | 4722591 |
Gene Length | 1470 bp |
Protein Length | 489 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641280907 |
Product | AraC family transcriptional regulator |
Protein accession | YP_001546519 |
Protein GI | 159900272 |
COG category | [F] Nucleotide transport and metabolism [L] Replication, recombination and repair |
COG ID | [COG0122] 3-methyladenine DNA glycosylase/8-oxoguanine DNA glycosylase [COG2169] Adenosine deaminase |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.00051726 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTGATGT TAACTGAAAT CGAGCCAAGT CTGCCAAGCC ACACCGAAAT GGTTGCCCAC ATGCTGCAAT CGGATGCCAG CTATAACGGC AAATTTATTA CGGCGGTCAA AACTACGGGC ATCTATTGTT TGCCAAGTTG CCGCGCCCGT AAGCCCAAGC CTGAGAATGT TGAATTTTTT ACCAATCCCA ACGCAGCTCA AGGTGCGGGC TATCGCGCTT GCAAATTGTG CCGCCCTGAT GATTTTTATC GTGGCTTTGA CCCTGAGGAA CATTTGACTG AGCAATTGAT TGAAGCGGTG CTGGCCCAAC CAGCCGCATT TGCCGATGTC AAGGCCATGG CGCAAACAGC GGGGGTTGGC CAAACCAAAT TATTTGAATT AATGCGCATT TATTACCACA CCACACCCGC CGATCTACTA CTACGAGCGC GAATTGAGGC CGCTTGTGGT TTATTGCTCA ACACCGATCA AACGATTATT GCGATCGCTA ATGAAGTGGG CTTCGATAGC TTATCGAGCT TCAATGAAAA CTTTCGCAAA CACACTATGC TCACGCCTAG CGAATATCGG CGTATATCTG AAACTGGGCG ATTTAGCCTT GCCTTGCCCA ACGATTATCC TAGCCGCCAA ATTTTAGGCC AGCTTGGGCG CGATCCAGTT AGCCTGACCG ATCAGGTGGT CGAGCAAACA TGGTACAGCA CCTGTCGCTT AAATGGGCAA ACTGGGGTGT TACTCGCAGT CACTATCACT CCAACCACGG CTGAATGCAG CATCGTGGAG CAATCAGCCG TAACGCCCAG CGACGTTGCC ACGATTCATC GCCATGTTAT TGCAGGCTTG GGTTTGAGTA ACGATCCCAG TCGTTTCGAG GCCCATGTTG CCAAATCGCC CGCCTTATTG CCATTAATTG AGCACCAACG TGGTTTGCGC ATGCCCTTGG TGCATAATCC ATTCGATGCC TTGGTTTGGG CAATTTTGGG TCAGCAAATT TCGCTGGCGG TGGCTTATCG TTTGCGCCAA CGCCTAACCG AGCTAGTTGG GCAACGATTA AATCAAGATT TTTATCTTGC GCCAACGCCC AATACAATTG CCCAACTAAC CGTTGAGCAA CTGCTACCCT TGGGCTTTTC CAACGCCAAA GCCCGCTATT TAATTGATAC CGCCCAGGCG ATTATCGCTG AAAGCTTGCC ATTGGCGAGC TATCACCGCA AATCGGCCAC ACGGATCGAG CGCGAACTAC TAGCGTTGCG GGGCATCGGC CCATGGACAG CCCAATATGT ACTAATGCGT TCGTTTGGCT TTAGCGATTG TGTACCAGTG GGCGATAGTG GCCTGACCAG CAGCTTACAG GCATTTTTTC AGCTTGAGCA ACGCCCCGAT CGCTCGACAA CCCTTGCTTT GATGGCAGCA TTTAGCCCTT ATCGCAGCCT AGCAACCTTT CATTTATGGC AACGTTTGAA GCCAATGTGA
|
Protein sequence | MLMLTEIEPS LPSHTEMVAH MLQSDASYNG KFITAVKTTG IYCLPSCRAR KPKPENVEFF TNPNAAQGAG YRACKLCRPD DFYRGFDPEE HLTEQLIEAV LAQPAAFADV KAMAQTAGVG QTKLFELMRI YYHTTPADLL LRARIEAACG LLLNTDQTII AIANEVGFDS LSSFNENFRK HTMLTPSEYR RISETGRFSL ALPNDYPSRQ ILGQLGRDPV SLTDQVVEQT WYSTCRLNGQ TGVLLAVTIT PTTAECSIVE QSAVTPSDVA TIHRHVIAGL GLSNDPSRFE AHVAKSPALL PLIEHQRGLR MPLVHNPFDA LVWAILGQQI SLAVAYRLRQ RLTELVGQRL NQDFYLAPTP NTIAQLTVEQ LLPLGFSNAK ARYLIDTAQA IIAESLPLAS YHRKSATRIE RELLALRGIG PWTAQYVLMR SFGFSDCVPV GDSGLTSSLQ AFFQLEQRPD RSTTLALMAA FSPYRSLATF HLWQRLKPM
|
| |