Gene Haur_3179 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3179 
Symbol 
ID5735054 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp4017007 
End bp4020078 
Gene Length3072 bp 
Protein Length1023 aa 
Translation table11 
GC content51% 
IMG OID641280325 
ProductSMC domain-containing protein 
Protein accessionYP_001545944 
Protein GI159899697 
COG category[L] Replication, recombination and repair 
COG ID[COG0419] ATPase involved in DNA repair 
TIGRFAM ID[TIGR00618] exonuclease SbcC 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0019664 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTGCCAG AGAAGCTGCG TGTTCGTAAT TTTATGTGTT ATCGCGATGA TGTTCCAACC 
CTCGATTTCG AAGGAATTCG GGTTGCTTGT TTGTCGGGCG AAAATGGCGC TGGCAAATCG
GCCTTGCTCG ATGCAATCAC TTGGGCGCTT TGGGGCAAAG CTCGCGTTTC CTCCGATGAT
GAATTGATTG CGTTGGGTGC ACAAGAGATG GAAGTTGATT TGCAATTTAG CGTTGCCAAA
ACTTCCTATC GTGTGTTGCG CCGCCGTTCA TCAGCCAAAC GCGGCCAAAC CATCCTCGAA
ATTCAAGTTA ACGATGGTGA TAATTGGCGA GCAATTTCTG GCAATAGCAT CCGAGAAACC
CAAGAGATCA TTCATTCTGT GCTGCGCATG GAATACGATA CCTTCATTAA TAGTGCTTTT
TTGGTGCAAG GCAAAGCCGA TGAATTTACC CGCAAAGCGC CTGCTGAACG CAAACGAGTG
CTGGCCGAGA TTTTAGGTCT CGATGCCTAC GAACAACTTG AAGCCCGTGC CAAAGAGCAG
GTGCGCTATT TCAGCGACCG CGCTCAAGGC CTCGAAGGTA CAATCAGCAG CTACCGCGAA
TGGGTCAATA AGCGTGATTT TTACCTGAGC CAAGAGGCCG AAGCGCAACA ACGGGTGCAG
CAGCTTAGCC ATGAAATCGA ACGAGCAACA GTAGTGTTTG AGCAAGCCGA CCAACAACGC
CGCGCACTCG AACATCGTAA GGCCGAACGC GACCGCGAGC TAAGCCGCAG CCGCGACCTC
GAACGTCAAA TTGCCGAAAG CGATGGCGAT TATCGCCAAC TGGCTCAAGA TATTGGTGTG
GCACAAGCGA TGGTCGCTCG TCAAAGCGAA ATTGAAACCA ATTTTCAAGT CTTGCAAGCT
GCGCGTGAAG AGTTAGTGGT GCTTGATCGC TTGCGTGAGC AAATTGTCCA GCTCAACGAT
CAATACAAAG AGCAACGCGC TTTGGTCGAG CGCGAAGAAC TTTCGTTGCA ACATCAACTT
CAGCAATATC AGCGCGATCA GCAAACCAAC GCCGAATTGA TCGCCAATAA AGCCAGTAAG
CAAGCAGATT TGGCCGCCTT GGCCGAGCAA TTAGCCGGTT TTGCCGACGA TCAACGCCAT
TTAGAGCAAG CCCGCAGCCA GCGCAACGAG CTAGAGCAAC AAAGCACCAA CTTAACTCAA
TTGCAGGTTG AGGTGCAACG ATTGCAAGGC TTAATCAACG TGCGCGGCGA TTCGTTAATT
GCCGCACGCG AGGAGCAAAA TCGCCGAATT ATCGAGGCCG ACAATGTTTT GGCCAACGAA
TCGCGCTGGC GCAGTGAATT TGAAACCTCG GTTGCCCAAC AAAAAGCCCT GATTCGCGAT
GAACGCAAGC TCCAAGAAAT GCGCAGCCGC GACCAACAAG ATGCCAAGCA ATTGGGCGAA
CTATCGGCAC AAGAGAGCAA TTTAAAGCAA CTTGGCAAGC AAATTAACGA TAAAATCGAC
CAACTACGGG CAACCCACGA TACGCATTGC CCACTCTGCC AGAGCGATAT CGGCCAGCAT
GGCATTCATC AAGTGATCGA GCAATATATC GTTGAGCGCG ACGATCTGCG TGATCAATAT
CGTGAGGCCA GCCAAGCTCG CAAACAACTA AGCAATGAAT ATGATGCTCG CCAGCGCGAA
ATTCAAGGTT TAGAGCGCAA AGTTGCCCAG CTGAGCACTC TGGCAGCAGC AGTTGGTCGC
TTGGAAGGCC AAATCAGCGA AGCCCAAGAA CATCGTAGCA AACGCAGCGA AGCCGAAAGC
ACCTTGCGCG ATTTGAATCA GCGGCTTGAA CATGGCGATT TTGCCCATGA AGAACGCGCC
GCCTTAGCCC AAGCCCAAAC TGAAATCGCC GAATTAGGCC TTGATCAAGC GGCGCTTGAT
GCCCAACGCC AAGCCAATGG CCGATTAATC GCCCAGCTCG AGCAACGGCT AGCCCAACGT
GGCACAATCG AGGCCAAAGC CGCCGTACTC CAAGAACAAC TTGAGCGAAT TCAGGCGGCT
GAAACCCACA ATGCCACATT GCACGAAACA ATTTTGAGCT TGCAAGTCCA GATCGATAGT
CAGCAATTTG CCCAAACTGC ACGGCAACAG GCCGAGGCAA TTTACCAGCA AATGGCTGAA
TTGGGCTATG CCTCACAACG CCATCAAGAA GTGCGCGACG CGGTTGCCAG CCTTGGCCAT
TGGGAAGGCG AATATCATCA ATTGCGTTCG GCGCAAACCA ACTTGGCAAC CAACCAACGC
CAAGCCCAGC GTTTGGCCGA ATTGATCGAG CGCCAACGCC AAGATTTAGC CCAAATTCAA
GTAATGCTCA ATCAACTCAA CCAAGAATTA GCCCAATTGC CAGCCGCTAT TCAAGCTGCC
GAAACCGCTC AACGCACAAT CAACGAGTTT CGCGGGCGCT TGGCCGTTGC TCAAAAAGAT
TTGGGTGCAG CTCAGCAAAA CGTCCAGCAT GTGGCTCAAG TCGCTGAGCA GTTGGCCGCA
GCCGAAAAAG AGCTGCTCTC AGTGCAAGAT CAACGCGATG TTCACAGCGA ATTGGTACGA
GCTTTTGGCA AAAAAGGCAT TCAAGCAATG CTGATCGAAA CGGCGATTCC TGAGCTTGAG
CGCGAAGCTA ACGAATTGCT CAGCCGCATG ACCGACAACC AAATGCACTT GCGCTTTGAA
ACCCAACGCG AAACAAAAAA AGGCGATACC AGCGAAACTC TCGATATTCA AATTGCCGAT
GAACAAGGCA CGCGCCGCTA CGATTTGTAT AGCGGTGGCG AGGCCTTCCG CATCAACTTT
GCAATTCGGA TTGCCATGAG CAAAATGCTG GCTCGTCGCG CCGGAGCCAA CTTGCAAACA
TTAATTATTG ACGAAGGCTT TGGTTCGCAA GATGGTCGTG GGCGCGAACG CTTGGTCGAG
GCTATCACCC AAGTTCAGCC AGATTTCAGT CGCATCTTGG TAATCACCCA CATCCAAGAA
TTGAAGGATC AATTTCCGGT GCAGATCGAA ATTACCAAAC ACGACAACGG CTCACGTTGG
GCGGTGAATT AG
 
Protein sequence
MLPEKLRVRN FMCYRDDVPT LDFEGIRVAC LSGENGAGKS ALLDAITWAL WGKARVSSDD 
ELIALGAQEM EVDLQFSVAK TSYRVLRRRS SAKRGQTILE IQVNDGDNWR AISGNSIRET
QEIIHSVLRM EYDTFINSAF LVQGKADEFT RKAPAERKRV LAEILGLDAY EQLEARAKEQ
VRYFSDRAQG LEGTISSYRE WVNKRDFYLS QEAEAQQRVQ QLSHEIERAT VVFEQADQQR
RALEHRKAER DRELSRSRDL ERQIAESDGD YRQLAQDIGV AQAMVARQSE IETNFQVLQA
AREELVVLDR LREQIVQLND QYKEQRALVE REELSLQHQL QQYQRDQQTN AELIANKASK
QADLAALAEQ LAGFADDQRH LEQARSQRNE LEQQSTNLTQ LQVEVQRLQG LINVRGDSLI
AAREEQNRRI IEADNVLANE SRWRSEFETS VAQQKALIRD ERKLQEMRSR DQQDAKQLGE
LSAQESNLKQ LGKQINDKID QLRATHDTHC PLCQSDIGQH GIHQVIEQYI VERDDLRDQY
REASQARKQL SNEYDARQRE IQGLERKVAQ LSTLAAAVGR LEGQISEAQE HRSKRSEAES
TLRDLNQRLE HGDFAHEERA ALAQAQTEIA ELGLDQAALD AQRQANGRLI AQLEQRLAQR
GTIEAKAAVL QEQLERIQAA ETHNATLHET ILSLQVQIDS QQFAQTARQQ AEAIYQQMAE
LGYASQRHQE VRDAVASLGH WEGEYHQLRS AQTNLATNQR QAQRLAELIE RQRQDLAQIQ
VMLNQLNQEL AQLPAAIQAA ETAQRTINEF RGRLAVAQKD LGAAQQNVQH VAQVAEQLAA
AEKELLSVQD QRDVHSELVR AFGKKGIQAM LIETAIPELE REANELLSRM TDNQMHLRFE
TQRETKKGDT SETLDIQIAD EQGTRRYDLY SGGEAFRINF AIRIAMSKML ARRAGANLQT
LIIDEGFGSQ DGRGRERLVE AITQVQPDFS RILVITHIQE LKDQFPVQIE ITKHDNGSRW
AVN