Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3722 |
Symbol | |
ID | 5735586 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 4682500 |
End bp | 4683657 |
Gene Length | 1158 bp |
Protein Length | 385 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641280874 |
Product | DNA replication and repair protein RecF |
Protein accession | YP_001546486 |
Protein GI | 159900239 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1195] Recombinational DNA repair ATPase (RecF pathway) |
TIGRFAM ID | [TIGR00611] recF protein |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.000891221 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTATGTTT CTCGCCTGCA ACTCCAAGAC TTTCGAATTT ATCGCAGCCT CAATTTAGCC TTGCCTCCAG GCGTGTGTTT GTTCTATGGG GCCAATGCAG CGGGCAAAAC AACGATTCTG GAAGCATTAT ACTATTTGGC TACAACTCGC TCTTTACGGG CCTCGGTTGA ACGTGAATTA ATTGCCCTTG AAGCAGCAGG CGATCTTGGC TTACCGCCAT TTGCTCGCTT GGCCGCCAGC TTGCAACCGC AACCAGAGGC CGAAATGCAA ACGATTGAAA TTGTGCTGCA ACGCAAATTT GGCGCTGATG GCGATTTAGC CCCTACCACC AGCAAAACCA TTCGGATCAA TAAAATAGCT CGGCGAGCGC TCGATCTGAT TGGTCAGTTA CGGGTGGTGA TGTTCGCGCC GCAAGATTTA GAGTTAGTCA CGGGTGCGCC TGCTGAGCGG CGACGCTATC TCGATGTCAC GCTTTCGCAG ATCGATGGCC GTTATGTTCG CGCCCTTTCG CGCTACAACC AAGTGCTAAC CCAACGCAAC GGTCTATTGC GAACCAGTCG TGAGCGTGGT CGCGCTGCCA GCGAACAAGA TCTAGCATTT TGGGATGAGG AGCTAGCCAA AGCTGGGGTG TATGTGCTGC GCGAACGTCG CCGCGCCGTC ACCACGCTTG ACCAGCTTGC GCAACGCTTG TATGCCGAGA TTAGCGGCAG CGATTTAGAT TTACGTTTGA ACTACTTAGA TACAACGCCT GCTCACGATG TGCCAAGTTT TCAAGCAGCC TTGAAGCAGC TACGTCGTGA AGAGCGCGAA CGTGGCGTGA CGTTAATCGG CCCACATCGC GATGATCTTT CGATTCAATT GGCGGAGCGT GAAGTCGGCA GCTTTGGCTC GCGTGGCCAG CAACGAGCCT CGACCTTGGC TTTACGGTTG GCCGAAGCCG AATTGATGCA TAGTCGCACG GGCGATCGGC CTGTTCTGCT GCTCGATGAT TTGCTTTCAG AGCTTGATCA AAAACGCCGC GAACATCTAT TGACCACAAT TGTGCGTCCC CAGCAACAAA CCCTGATCAC TGCTACTGAT CTTGATGATT TTTCGCCTAA TTTTCTGAGC CAAATCACCC GAATGCATGT TGATCATGGC CTGATCTTCC CGGCGTGA
|
Protein sequence | MYVSRLQLQD FRIYRSLNLA LPPGVCLFYG ANAAGKTTIL EALYYLATTR SLRASVEREL IALEAAGDLG LPPFARLAAS LQPQPEAEMQ TIEIVLQRKF GADGDLAPTT SKTIRINKIA RRALDLIGQL RVVMFAPQDL ELVTGAPAER RRYLDVTLSQ IDGRYVRALS RYNQVLTQRN GLLRTSRERG RAASEQDLAF WDEELAKAGV YVLRERRRAV TTLDQLAQRL YAEISGSDLD LRLNYLDTTP AHDVPSFQAA LKQLRREERE RGVTLIGPHR DDLSIQLAER EVGSFGSRGQ QRASTLALRL AEAELMHSRT GDRPVLLLDD LLSELDQKRR EHLLTTIVRP QQQTLITATD LDDFSPNFLS QITRMHVDHG LIFPA
|
| |