Gene Haur_3722 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3722 
Symbol 
ID5735586 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp4682500 
End bp4683657 
Gene Length1158 bp 
Protein Length385 aa 
Translation table11 
GC content53% 
IMG OID641280874 
ProductDNA replication and repair protein RecF 
Protein accessionYP_001546486 
Protein GI159900239 
COG category[L] Replication, recombination and repair 
COG ID[COG1195] Recombinational DNA repair ATPase (RecF pathway) 
TIGRFAM ID[TIGR00611] recF protein 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000891221 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTATGTTT CTCGCCTGCA ACTCCAAGAC TTTCGAATTT ATCGCAGCCT CAATTTAGCC 
TTGCCTCCAG GCGTGTGTTT GTTCTATGGG GCCAATGCAG CGGGCAAAAC AACGATTCTG
GAAGCATTAT ACTATTTGGC TACAACTCGC TCTTTACGGG CCTCGGTTGA ACGTGAATTA
ATTGCCCTTG AAGCAGCAGG CGATCTTGGC TTACCGCCAT TTGCTCGCTT GGCCGCCAGC
TTGCAACCGC AACCAGAGGC CGAAATGCAA ACGATTGAAA TTGTGCTGCA ACGCAAATTT
GGCGCTGATG GCGATTTAGC CCCTACCACC AGCAAAACCA TTCGGATCAA TAAAATAGCT
CGGCGAGCGC TCGATCTGAT TGGTCAGTTA CGGGTGGTGA TGTTCGCGCC GCAAGATTTA
GAGTTAGTCA CGGGTGCGCC TGCTGAGCGG CGACGCTATC TCGATGTCAC GCTTTCGCAG
ATCGATGGCC GTTATGTTCG CGCCCTTTCG CGCTACAACC AAGTGCTAAC CCAACGCAAC
GGTCTATTGC GAACCAGTCG TGAGCGTGGT CGCGCTGCCA GCGAACAAGA TCTAGCATTT
TGGGATGAGG AGCTAGCCAA AGCTGGGGTG TATGTGCTGC GCGAACGTCG CCGCGCCGTC
ACCACGCTTG ACCAGCTTGC GCAACGCTTG TATGCCGAGA TTAGCGGCAG CGATTTAGAT
TTACGTTTGA ACTACTTAGA TACAACGCCT GCTCACGATG TGCCAAGTTT TCAAGCAGCC
TTGAAGCAGC TACGTCGTGA AGAGCGCGAA CGTGGCGTGA CGTTAATCGG CCCACATCGC
GATGATCTTT CGATTCAATT GGCGGAGCGT GAAGTCGGCA GCTTTGGCTC GCGTGGCCAG
CAACGAGCCT CGACCTTGGC TTTACGGTTG GCCGAAGCCG AATTGATGCA TAGTCGCACG
GGCGATCGGC CTGTTCTGCT GCTCGATGAT TTGCTTTCAG AGCTTGATCA AAAACGCCGC
GAACATCTAT TGACCACAAT TGTGCGTCCC CAGCAACAAA CCCTGATCAC TGCTACTGAT
CTTGATGATT TTTCGCCTAA TTTTCTGAGC CAAATCACCC GAATGCATGT TGATCATGGC
CTGATCTTCC CGGCGTGA
 
Protein sequence
MYVSRLQLQD FRIYRSLNLA LPPGVCLFYG ANAAGKTTIL EALYYLATTR SLRASVEREL 
IALEAAGDLG LPPFARLAAS LQPQPEAEMQ TIEIVLQRKF GADGDLAPTT SKTIRINKIA
RRALDLIGQL RVVMFAPQDL ELVTGAPAER RRYLDVTLSQ IDGRYVRALS RYNQVLTQRN
GLLRTSRERG RAASEQDLAF WDEELAKAGV YVLRERRRAV TTLDQLAQRL YAEISGSDLD
LRLNYLDTTP AHDVPSFQAA LKQLRREERE RGVTLIGPHR DDLSIQLAER EVGSFGSRGQ
QRASTLALRL AEAELMHSRT GDRPVLLLDD LLSELDQKRR EHLLTTIVRP QQQTLITATD
LDDFSPNFLS QITRMHVDHG LIFPA