Gene Haur_4583 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4583 
Symbol 
ID5736428 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp5862018 
End bp5865059 
Gene Length3042 bp 
Protein Length1013 aa 
Translation table11 
GC content51% 
IMG OID641281745 
Productexcinuclease ABC, A subunit 
Protein accessionYP_001547342 
Protein GI159901095 
COG category[L] Replication, recombination and repair 
COG ID[COG0178] Excinuclease ATPase subunit 
TIGRFAM ID[TIGR00630] excinuclease ABC, A subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGCAGG ATAAAATTGT CATCAAGGGC GCACGTGAGC ACAATCTTAA AAATATCGAC 
ATCGAACTAC CACGCGATCA ATTAGTTGTG TTGACGGGAG TTTCTGGCTC AGGCAAGTCA
TCGTTAGCCT TTGATACCTT ATATGCCGAA GGCCAGCGCC GCTATGTTGA ATCGCTTTCC
TCGTATGCCC GCCAGTTTTT AGGCCAACTC GAAAAGCCCA AGGTCGATTT TATTGGTGGG
CTTTCGCCAG CAATCGCGAT CGAACAAAAA TCGGCCTCGA AAAATCCGCG CTCAACTGTG
GGCACGGTCA CCGAAATTTA TGATTATTTG CGGGTCTTGT TTGCGCGGGT TGGGGTGCCA
CACTGCCATA AATGCGGTAA AGCGATCGGC TCACAAACTG CTGAGCAGAT GGTCAATCGG
GTGCTGGAGT TACCCAAAGG CACGCGCTTT ATGCTTTTAG CGCCGATGGT AGCCGCTCGC
AAAGGCGAAT ATAAAGATAT TTTCGATGAT GCCCGCTCGC AGGGGTTTTC GCGGGCACGG
GTCGATGGCG AAATTCGTGA TCTGCAAGAC GAAATCAAAC TCAATAAAAA AGTTAAACAT
ACAATTGATA TTGTGGTTGA TCGCTTGGTC GTGCCAACCG AGGATGACGA AACCTTTCGT
TCACGACTTA ACGATAGTGT TGAAACCGCG CTACGCACCG CCAATGGCAC AATTATTATC
GCTATTCCTG AGTTTGCCCA ACAAACCGAA AAATCAAAAT CCAAAAAAGC CAAGGCCAAA
GCGGTTGCTA CCGAAGTTAA TGAAGAAGAA TTGTTGCCCG AAGAAGAATC GCGTATTAGC
GGCATGAACG CTGCTGGCGA TATTGTGATG AGCGAGGATT ATGCCTGTGT TGATTGTGGC
ATCTCGTTCC TTGAACTCAA CCCACAAATG TTTTCGTTCA ATGCTCCCCA AGGGGCTTGC
CCAGAATGTG CTGGCTTGGG TACGCGTTTA GAAGTTGATG CAGAACTGCT CGTGCCAAAC
CCGAATTTAT CGTTGCACGA TGGTGCGGTA ACCTATTGGG GCGAGTTGCG CAAAAAAGTT
GGTAGCTGGG GCTATCGCGC TCTGCAAGCC ATCTCCGCCC ACTATCAATT TGATCTTGAT
ACACCGTGGA AAGACCTCAG CCCGCGTGTG CGTGAGATTT TGATGCAAGG CTCAGGCAGT
GAAAAAGTCA AGCATGTTTG GAGCGAAGGT CAGTCCAAGG GCGAGTATTA TCGTCGTTGG
GAGGGCTTGG GCGCAGAAAT TATGCGCCGC TTCCAGCAAA CTGGTGTTGA AAATATGCGC
GAGCACTACC AACAATGGAT GAGCGATCAG CCTTGTCATG CCTGCCATGG CGCAAAATTG
CGGCCTGAGA GTTTGGCAGT GACGGTTGGT GGTGAAAATG TGCAACAAAT TTGTGCCAAA
ACTGTGGCTC AAGGCTATGC CTGGGCCTGT GGTTTAACTG GCAGCGATGC CCACTGGGTG
AGCCGTGATG GTTTGGATAC CACGGTATTG CCGTTGTTGG CTGCCGCGCC AACCCCGACC
AAACTCGATG GTCGGCAATT GGAAATTGCT GGCGAAGTGC TCAAAGAAAT TCGTGAGCGC
TTGGGCTTTT TGTTGAATGT TGGTTTGCAC TATCTCACGC TCGATCGCTC AGCTCCCTCG
CTGTCTGGTG GCGAGGCTCA GCGCATCCGC TTGGCTTCAC AAATTGGGGC GGGCTTGATG
GGCGTGATGT ATATTCTCGA CGAGCCATCA ATTGGCTTGC ATCAACGTGA TAATCGCAAA
TTGATCGATA CTTTGACCAA ACTGCGCGAT CTTGGCAACA CCGTAATTGT GGTTGAACAC
GACGAAGATA CCATGAAAGC CGCTGATTGG CTGGTAGACT TCGGGCCTGG CGCGGGAGTC
AACGGCGGCA AGGTGGTCGC TGAAGGTCGG CCAGAATATA TCAGCAGCAA TGGTTCGTTG
ACCGGTAGTT ATTTGTCGGG GCGCTTGAAA ATTGAAGTGC CAGCAACCCG CCGCCCAGCC
CATGGTCATA TTACTTTGCG CGGTGCAACC CACAACAACC TCAACAACCT TGATATTACG
ATTCCTTTGG GCACGCTGGT CGCGGTAACT GGGGTTTCTG GCTCAGGCAA ATCCTCATTG
ATCACCGAAA CGCTCTATCC AGCCTTGGCA AATTTGCTCA ATCGCGCCCA GCTGCGGGTT
GGCAAATACG ATACGCTCGA AGGTTTAGAG CAACTTGATA AAGTGATCGA TATTGATCAA
CAGCCGATTG GCCGCACGCC GCGCTCGAAT CCTGCAACCT ACGTCAAGTT ATTTGATCAG
ATTCGTGAGG TGTTTGCCAA TACGCCTGAT GCTAAACTGC GCGGCTATGA GCCAGGCCGC
TTTTCGTTCA ACGTTAAGGG TGGGCGCTGT GAAGCCTGCC AAGGCAATGG CGAGCAAAAG
ATCGAAATGC ACTTCTTGGC TGATGTGTGG GTGCGCTGCG ATGAGTGCAA AGGCAAGCGT
TACAATCGCG AAACCTTGCA AGTGCGCTAC AAAGGCAAAA CGATCTCCGA TGTGCTGGAT
ATGGATGTGC ACACTGCGCT CGAATTCTTC GAGAACCACC CCAAACTCAA GCGCGTGCTG
CAAACCTTGC ACGATGTTGG TTTGGATTAC ATCAAACTTG GCCAATCGGC GACGACGCTC
TCTGGTGGCG AGGCTCAGCG GGTCAAATTG GCCAAAGAAT TAGCGCGGGT GGCCACTGGT
CGCACGATCT ACATTCTCGA TGAGCCAACT ACAGGCTTGC ACTTCGCCGA CGTACAAAAT
CTGTTGCGGG TCATTCAGCG CTTGGTCAAG GCTGGCAACA CGGTTTTGGT GATTGAACAC
AGCCTCGATG TGATCAAAAC CGCCGACTGG ATTATTGATC TTGGGCCTGA GGGTGGTACT
GGCGGCGGCT ATATTATTGC CCAAGGCACG CCTGAAGAAG TGGCCTTGCA TCCAACTTCG
CACACAGCAG TCTTTTTGCG CGATTTGCTG AACCTTGACT GA
 
Protein sequence
MAQDKIVIKG AREHNLKNID IELPRDQLVV LTGVSGSGKS SLAFDTLYAE GQRRYVESLS 
SYARQFLGQL EKPKVDFIGG LSPAIAIEQK SASKNPRSTV GTVTEIYDYL RVLFARVGVP
HCHKCGKAIG SQTAEQMVNR VLELPKGTRF MLLAPMVAAR KGEYKDIFDD ARSQGFSRAR
VDGEIRDLQD EIKLNKKVKH TIDIVVDRLV VPTEDDETFR SRLNDSVETA LRTANGTIII
AIPEFAQQTE KSKSKKAKAK AVATEVNEEE LLPEEESRIS GMNAAGDIVM SEDYACVDCG
ISFLELNPQM FSFNAPQGAC PECAGLGTRL EVDAELLVPN PNLSLHDGAV TYWGELRKKV
GSWGYRALQA ISAHYQFDLD TPWKDLSPRV REILMQGSGS EKVKHVWSEG QSKGEYYRRW
EGLGAEIMRR FQQTGVENMR EHYQQWMSDQ PCHACHGAKL RPESLAVTVG GENVQQICAK
TVAQGYAWAC GLTGSDAHWV SRDGLDTTVL PLLAAAPTPT KLDGRQLEIA GEVLKEIRER
LGFLLNVGLH YLTLDRSAPS LSGGEAQRIR LASQIGAGLM GVMYILDEPS IGLHQRDNRK
LIDTLTKLRD LGNTVIVVEH DEDTMKAADW LVDFGPGAGV NGGKVVAEGR PEYISSNGSL
TGSYLSGRLK IEVPATRRPA HGHITLRGAT HNNLNNLDIT IPLGTLVAVT GVSGSGKSSL
ITETLYPALA NLLNRAQLRV GKYDTLEGLE QLDKVIDIDQ QPIGRTPRSN PATYVKLFDQ
IREVFANTPD AKLRGYEPGR FSFNVKGGRC EACQGNGEQK IEMHFLADVW VRCDECKGKR
YNRETLQVRY KGKTISDVLD MDVHTALEFF ENHPKLKRVL QTLHDVGLDY IKLGQSATTL
SGGEAQRVKL AKELARVATG RTIYILDEPT TGLHFADVQN LLRVIQRLVK AGNTVLVIEH
SLDVIKTADW IIDLGPEGGT GGGYIIAQGT PEEVALHPTS HTAVFLRDLL NLD