Gene Haur_1918 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_1918 
Symbol 
ID5733807 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp2312314 
End bp2313855 
Gene Length1542 bp 
Protein Length513 aa 
Translation table11 
GC content46% 
IMG OID641279062 
Producthypothetical protein 
Protein accessionYP_001544689 
Protein GI159898442 
COG category[L] Replication, recombination and repair 
COG ID[COG1337] Uncharacterized protein predicted to be involved in DNA repair (RAMP superfamily) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTATTA ATGCAACTCG TATCATTCGT AAACGGATTA TTTTCAAGGC CGAGCTAGTG 
CTTACAAGTG CGGCGGTCTT CAGTAATGGT GATAGCGACC CAATTATTGA TATGATGATT
CTGCGCGATA GTTGTGAGCC GCAAAAGGCG CTTTTGCCAG GTAGTAGTTT GGCTGGGGCT
TTACGCAGTT ATTTAAAAGA TATAACCAAA GATGACACAG CGATTGATAA ATTATTTGGC
TGTATCGGTG ATGAAGAGGT TGGTGAATAT TTTGGGCAAA GTCGGCTCTT GATTAGTGAT
GCAGTCAGTC GTGAGCCGAT TCAGGCTGAA TTACGCGATG GGGTGCGGAT TGACCATGCC
ACCAGAACTG CTGCTGATCA GGCAAAATAT GATTTAGAGG TGTTGCCAGC CGGAACGTGC
TTTAGCTTAG AGCTTGAATT TATTGTACTG GAAGAAGTAC CAACTATCAA TCTGTTGCCA
TTAGTAGTTC AAGCTTTGCA TGGCCTTCAG ACGGGCCAAA TTAGCCTTGG CATGAAGAAA
AATCGTGGAT TGGGGCAATG TAGCGTTAAA GGTTGGGATA TTTATGAAAT AGATATGACC
AAGCCTACTG AAATTTTTGG CTGGCTTGAG CGTGATGATA AGAACCCTAT TCCTACCGCT
GCGCAACATT CTAATCTCTA TGACTATTTT GGGTTCCAAC CTGCCGATGC CAGCCGCTAT
CCGGTAACGC TTACAGCAAA TTTTACCTTT GCTGATGATG CGATGTTAAT TCGTTCGGCT
CTCCAAACCA ATAATCTAGA AGATCTGATC AATAATCCGG ATGATCAGAC TGTATCCAAG
AAGATTCCCG ATCCTGTGCA TTTGCAAACA CGGGTTGATG GCACGTTGCA GCCCGTTATT
CCTGGTACAA GTTGGGCTGG GGTTTTGCGC CATCGGGCTT TACGCATTTT GAATACTTTG
AAAGTGGCAA CTGCCGAGCA GCAACTTGAT GAGTTATTTG GCTTTGTTAT AGAGCAGCAA
GCTAAAGCAC AGGCCAGCCG AATCATGATC AAAGATAGCA TTATTCAGCA CCCTGCGACT
GAACCATTAG TGCAAAACCG CATTGCAATT GATCGCTTTA CGGGTGGTGC GTTTGATGGG
GCGCTGTTCA GCGAAATGCC GGTGTGGAAA ACCGACCAAA CTTGCGTCAC GCTTGAAATT
TCGATCAAAC CACCCCGACC AAAAAAAGAA GCCCAACAAG ATCAGCAAAA ACCAGAAGAT
ACGCCACAGC CTGATCCTAA GCCAACGCCT AAATTTAATC AGGCTGAAGT TGGCTTGTTG
CTGTTATTGC TCAAGGATTT GTGGACTGGC GATTTGGCGA TTGGCGGAAC CAGTAGCATC
GGGCGCGGGC GACTTCAAGG GCTTGAGGCC ACGTTAACCG TCGATGGTGC TGAATTTTGC
TTCAAGCAAG CCACTGATGG TATCGGTTCA TTGCATATAA CAGGAACTGG CAAGCGAGAT
CAATTACAAA TGTATGTTGA AGCGATCGGA GCGTCCTCAT GA
 
Protein sequence
MSINATRIIR KRIIFKAELV LTSAAVFSNG DSDPIIDMMI LRDSCEPQKA LLPGSSLAGA 
LRSYLKDITK DDTAIDKLFG CIGDEEVGEY FGQSRLLISD AVSREPIQAE LRDGVRIDHA
TRTAADQAKY DLEVLPAGTC FSLELEFIVL EEVPTINLLP LVVQALHGLQ TGQISLGMKK
NRGLGQCSVK GWDIYEIDMT KPTEIFGWLE RDDKNPIPTA AQHSNLYDYF GFQPADASRY
PVTLTANFTF ADDAMLIRSA LQTNNLEDLI NNPDDQTVSK KIPDPVHLQT RVDGTLQPVI
PGTSWAGVLR HRALRILNTL KVATAEQQLD ELFGFVIEQQ AKAQASRIMI KDSIIQHPAT
EPLVQNRIAI DRFTGGAFDG ALFSEMPVWK TDQTCVTLEI SIKPPRPKKE AQQDQQKPED
TPQPDPKPTP KFNQAEVGLL LLLLKDLWTG DLAIGGTSSI GRGRLQGLEA TLTVDGAEFC
FKQATDGIGS LHITGTGKRD QLQMYVEAIG ASS