Gene Haur_3755 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3755 
Symbol 
ID5735619 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp4721122 
End bp4722591 
Gene Length1470 bp 
Protein Length489 aa 
Translation table11 
GC content51% 
IMG OID641280907 
ProductAraC family transcriptional regulator 
Protein accessionYP_001546519 
Protein GI159900272 
COG category[F] Nucleotide transport and metabolism
[L] Replication, recombination and repair 
COG ID[COG0122] 3-methyladenine DNA glycosylase/8-oxoguanine DNA glycosylase
[COG2169] Adenosine deaminase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00051726 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTGATGT TAACTGAAAT CGAGCCAAGT CTGCCAAGCC ACACCGAAAT GGTTGCCCAC 
ATGCTGCAAT CGGATGCCAG CTATAACGGC AAATTTATTA CGGCGGTCAA AACTACGGGC
ATCTATTGTT TGCCAAGTTG CCGCGCCCGT AAGCCCAAGC CTGAGAATGT TGAATTTTTT
ACCAATCCCA ACGCAGCTCA AGGTGCGGGC TATCGCGCTT GCAAATTGTG CCGCCCTGAT
GATTTTTATC GTGGCTTTGA CCCTGAGGAA CATTTGACTG AGCAATTGAT TGAAGCGGTG
CTGGCCCAAC CAGCCGCATT TGCCGATGTC AAGGCCATGG CGCAAACAGC GGGGGTTGGC
CAAACCAAAT TATTTGAATT AATGCGCATT TATTACCACA CCACACCCGC CGATCTACTA
CTACGAGCGC GAATTGAGGC CGCTTGTGGT TTATTGCTCA ACACCGATCA AACGATTATT
GCGATCGCTA ATGAAGTGGG CTTCGATAGC TTATCGAGCT TCAATGAAAA CTTTCGCAAA
CACACTATGC TCACGCCTAG CGAATATCGG CGTATATCTG AAACTGGGCG ATTTAGCCTT
GCCTTGCCCA ACGATTATCC TAGCCGCCAA ATTTTAGGCC AGCTTGGGCG CGATCCAGTT
AGCCTGACCG ATCAGGTGGT CGAGCAAACA TGGTACAGCA CCTGTCGCTT AAATGGGCAA
ACTGGGGTGT TACTCGCAGT CACTATCACT CCAACCACGG CTGAATGCAG CATCGTGGAG
CAATCAGCCG TAACGCCCAG CGACGTTGCC ACGATTCATC GCCATGTTAT TGCAGGCTTG
GGTTTGAGTA ACGATCCCAG TCGTTTCGAG GCCCATGTTG CCAAATCGCC CGCCTTATTG
CCATTAATTG AGCACCAACG TGGTTTGCGC ATGCCCTTGG TGCATAATCC ATTCGATGCC
TTGGTTTGGG CAATTTTGGG TCAGCAAATT TCGCTGGCGG TGGCTTATCG TTTGCGCCAA
CGCCTAACCG AGCTAGTTGG GCAACGATTA AATCAAGATT TTTATCTTGC GCCAACGCCC
AATACAATTG CCCAACTAAC CGTTGAGCAA CTGCTACCCT TGGGCTTTTC CAACGCCAAA
GCCCGCTATT TAATTGATAC CGCCCAGGCG ATTATCGCTG AAAGCTTGCC ATTGGCGAGC
TATCACCGCA AATCGGCCAC ACGGATCGAG CGCGAACTAC TAGCGTTGCG GGGCATCGGC
CCATGGACAG CCCAATATGT ACTAATGCGT TCGTTTGGCT TTAGCGATTG TGTACCAGTG
GGCGATAGTG GCCTGACCAG CAGCTTACAG GCATTTTTTC AGCTTGAGCA ACGCCCCGAT
CGCTCGACAA CCCTTGCTTT GATGGCAGCA TTTAGCCCTT ATCGCAGCCT AGCAACCTTT
CATTTATGGC AACGTTTGAA GCCAATGTGA
 
Protein sequence
MLMLTEIEPS LPSHTEMVAH MLQSDASYNG KFITAVKTTG IYCLPSCRAR KPKPENVEFF 
TNPNAAQGAG YRACKLCRPD DFYRGFDPEE HLTEQLIEAV LAQPAAFADV KAMAQTAGVG
QTKLFELMRI YYHTTPADLL LRARIEAACG LLLNTDQTII AIANEVGFDS LSSFNENFRK
HTMLTPSEYR RISETGRFSL ALPNDYPSRQ ILGQLGRDPV SLTDQVVEQT WYSTCRLNGQ
TGVLLAVTIT PTTAECSIVE QSAVTPSDVA TIHRHVIAGL GLSNDPSRFE AHVAKSPALL
PLIEHQRGLR MPLVHNPFDA LVWAILGQQI SLAVAYRLRQ RLTELVGQRL NQDFYLAPTP
NTIAQLTVEQ LLPLGFSNAK ARYLIDTAQA IIAESLPLAS YHRKSATRIE RELLALRGIG
PWTAQYVLMR SFGFSDCVPV GDSGLTSSLQ AFFQLEQRPD RSTTLALMAA FSPYRSLATF
HLWQRLKPM