Gene Haur_4600 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4600 
Symbol 
ID5736445 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp5881776 
End bp5884052 
Gene Length2277 bp 
Protein Length758 aa 
Translation table11 
GC content52% 
IMG OID641281762 
ProductUvrD/REP helicase 
Protein accessionYP_001547359 
Protein GI159901112 
COG category[L] Replication, recombination and repair 
COG ID[COG0210] Superfamily I DNA and RNA helicases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000134986 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTCACC CGCTGCTGAT TGGGTTGAAT GCCCAACAAC AACGCGCAGT CCAAGCGATT 
CATGGGCCAG TGTTGGTGCT GGCTGGCCCA GGCTCGGGCA AGACCCGCGT GCTGACTCAT
CGGATTGCCT ATTTAATTAA TGAAGTTGGG GTTCGCCCCT ATAACATTTT AGCCGTCACC
TTTACCAACA AGGCTGCCCG CGAAATGCGC GAACGGATTG GAAACTTGAT TGGCGAAAGT
CGTGCTCACG ATGTAATGAT GGGTACATTC CACTCAATTT GCGCCCGTTG GTTGCGCCGC
GATATTCAGC ATCTGCAACG CGCCAACGAT TTTGTGGTTT ACGATGCCGA TGATCAGCAA
CGGGTAATGA AGCAGATTTT GCGCGAATTG GATTTAAGCG AGAAGCAATA TAATCCGCGT
TCAATTCATG CCCGCATCTC CGCTGCTAAA AACGAAATGA TTGGGGTTGC TGAGTTTGCC
CGCAGCGTTA GCAGCTATTT TGATGAAATT GTCTTGCGCT GTTATGAGCG CTACGAAAAA
CAATTGTTAG CCAATAATGC CCTCGATTTT GATGATTTGC TGCTCAAAAC CGTTAATTTA
TTCGAATATC ATCCCGATGT GCTAGCGCGG TATGCCGAAC GTTATGTGCA TGTCATGGTC
GATGAAGTGC AAGATACCAA TCGGGTGCAA TTTTCGTTGA TCAATCAAGT TGGTGCAGGC
CATAACAACT ACTTTTTGAT CGGCGATATT CAGCAATCAA TTTATGCTTG GCGTGGGGCG
AGGTTGGCGA ATGTGCGCGA ATTTGAAGAA GCCCACCCTG ATGTGCAAAT TATTCCACTG
GAGCAAAATT ACCGTTCCAC CCAACCAATT CTTGATGTCG CCCAATCGAT CATCGATGCG
GCCTATGATC GTCGCCATAC CACCAAAATC TGGACTGATC AGCAGGATGG CGAGTTGGTT
TCGTTGGTCG AGGCCTATGA TCACAATGAA GAAGCGCGTT GGGTGGCCGA TGAAATTATG
CGTATTCGCG GGCGTGAAGG CCGTTCGCTC GACGATTTTG CTGTGATGTA TCGCACCAAC
GCCCAATCCC GTGCCTTTGA AGAGGCCATG ATCAGCCGCA ACCTGCGCTA TCGATTGGTC
GGTGGCACGC GCTTCTACGA ACGCAAAGAA ATCAAAGATG TTGTGGCCTA CCTACGAATT
ATCCACAATC CCCACGATGA AGTTAGTTTG CTACGGGTGA TTAACGTGCC AGGCCGAAGC
ATCGGCGATC GCACTCAGCA AGAGCTATTG CAATGGGCAC GCAATCTCGA TGTCTCAATT
TGGGATGCCC TCGAACTGCT AGCTACTAAC GAAGCCCAAA GTCCGCCGAT CAGCGGGCGA
GCACGCAACG CCGTTGAGCA ATTTCAAAAA CTGGTAGCAA GCCTACGCGA TCTGCGCCAC
GATTTGATGC TCGGCGAGTT GATTCAGCGC TTGCTTGAGC GGGTGCCATT GCAAGAGTTG
CTGGTGGCAG AATATGGCGA GGAAGAAGGC GCTGAGCGTT GGGAAAATAT TCTCGAATTG
CAAAACGTCA GTATGGAATA TCTGGCCCTG CCGACCGAGG ATCAATTGCC ACGCTTTTTA
GAAGAAGTGG CCTTGGTTAG CGATGTTGAT AGCCTTGATT CCAACAAAGA GCGCGAGCCT
GGTGTGACCT TGATTACCTT GCACCAAGCC AAGGGTTTAG AGTATCCGGT AGTATTTTTG
GCTGGCTTAG AAGAAGGCTT GTTGCCGCAC GGACGCTCAG TTGACGACCC CGAAAGTATC
GAGGAAGAGC GTCGTTTGCT CTATGTTGGT ACAACCCGCG CCAAACAACG ACTGTACATG
CTCTATGCTT TCAAACGAGC AACTTGGGGC CGCACCGATA TCACGATTCC TTCACGCTTT
TTGGGCGATA TTCCCAAAGA TTTGCTGCAG CGCACGCCCA CCCGTGAAGT CAAACAAATG
CCAGTTCATG CCGCAAGCCA ATGGCAAAGC AGCACGCCGC AACGCACGCG AGGCACGCAA
CCAAGCACCA GCAGCATGTG GAGTGGCGCG AGTGGACCAG TCCGGCCAAA ACGCCCCGAA
CGTGAGCCAA GCGCCGCCAG TTATAGCGCT GGCGACAAAG TGCGCCATGC TAATTTCGGC
GAGGGCGTGG TGGTCAGTAG CAAAATGGTC GGCGACGACG AAGAAGTTAC CGTGGCGTTT
CCAGGCAAAG GCGTGAAAAA GCTCTTGGCC GCCTTCGCCA AGCTCGAACG GGTTTAG
 
Protein sequence
MTHPLLIGLN AQQQRAVQAI HGPVLVLAGP GSGKTRVLTH RIAYLINEVG VRPYNILAVT 
FTNKAAREMR ERIGNLIGES RAHDVMMGTF HSICARWLRR DIQHLQRAND FVVYDADDQQ
RVMKQILREL DLSEKQYNPR SIHARISAAK NEMIGVAEFA RSVSSYFDEI VLRCYERYEK
QLLANNALDF DDLLLKTVNL FEYHPDVLAR YAERYVHVMV DEVQDTNRVQ FSLINQVGAG
HNNYFLIGDI QQSIYAWRGA RLANVREFEE AHPDVQIIPL EQNYRSTQPI LDVAQSIIDA
AYDRRHTTKI WTDQQDGELV SLVEAYDHNE EARWVADEIM RIRGREGRSL DDFAVMYRTN
AQSRAFEEAM ISRNLRYRLV GGTRFYERKE IKDVVAYLRI IHNPHDEVSL LRVINVPGRS
IGDRTQQELL QWARNLDVSI WDALELLATN EAQSPPISGR ARNAVEQFQK LVASLRDLRH
DLMLGELIQR LLERVPLQEL LVAEYGEEEG AERWENILEL QNVSMEYLAL PTEDQLPRFL
EEVALVSDVD SLDSNKEREP GVTLITLHQA KGLEYPVVFL AGLEEGLLPH GRSVDDPESI
EEERRLLYVG TTRAKQRLYM LYAFKRATWG RTDITIPSRF LGDIPKDLLQ RTPTREVKQM
PVHAASQWQS STPQRTRGTQ PSTSSMWSGA SGPVRPKRPE REPSAASYSA GDKVRHANFG
EGVVVSSKMV GDDEEVTVAF PGKGVKKLLA AFAKLERV