Gene Haur_1716 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_1716 
Symbol 
ID5733603 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp1996244 
End bp1998481 
Gene Length2238 bp 
Protein Length745 aa 
Translation table11 
GC content49% 
IMG OID641278858 
ProductABC transporter related 
Protein accessionYP_001544487 
Protein GI159898240 
COG category[L] Replication, recombination and repair 
COG ID[COG0178] Excinuclease ATPase subunit 
TIGRFAM ID[TIGR00630] excinuclease ABC, A subunit 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGCAGG ATCGAGCGTT TATTGAGGTG TATGGCGCAC GCGAAAACAA CCTCAAAAAT 
ATTTCCTTGA ATATTCCAAA ACAGCGGGTG ACAGTCTTTA CCGGAGTCTC TGGTTCGGGC
AAATCGTCGT TGGTGTTTGA TACGATTGCC GCTGAGGCTC AACGCCAACT CAATGAAACC
TTTACCTTTT TTGTGCAAGG CTTTTTGCCG CACTATGGCC AGCCTGATGT CGAGCGGATC
GAGCATCTCA ACTCGCCGAT TATCATCGAT CAAAAGCGGG TTGGCGGTGG TTCGCGCTCA
ACCGTCGGAA CCTACACAGA TATTGCGGTG TTGCTGCGCT TGTTGTTCTC GCGGATTGGT
CAGCCCTACG TTGGGCCTGG CTATGCTTTT TCGTTCAACA CGCCGCAGGG TATGTGCCCC
GAATGTGAAG GCATTGGCAA AACGGTTCAG CTTGATATGG ATAAATTGCT TGACCGCAGC
AAATCGCTGA ATGAAGGTGC GATTTTGCAC CCCGAATTCA AGGTTGGCAA ATGGATGTGG
AAGATGTATC CGCTGTCTGG TTTGTTCGAT AACGATAAGC CGATTCAGGA TTATAACGAA
CAGGAATTGC AAGCATTTTT GTATGGCGCT GATCTTAAGG TTTCGTGGGG CGAATTTAGC
TCGAAATATG AAGGTTTGCT GGAACGTTTC GAGCGTATGT ACCTCAAAAA AGATGCAGCA
GCCATGTCGG ATCGCAATCG GGCCGTGTTT GAGCAATTTA CCTCTTCACA AATCTGCCCA
GTTTGCCATG GTGGGCGTTT GACTCAAGCC GCGCTCAATT GTCGGATTGC TGGGCGTAAC
ATTGCCGAAT TAGCCGATTT TGAAGCGACT GATTTGATTA GCTTTTTGGC TGATGTTACT
GATCCGATTG GTGATCGGGT GGCGGCCAAG TTGTTGGAGC GCATGCAACA GTTGGTCGAT
ATTGGCTTGG GCTATTTGAG CTTGAGCCGC GAAACCTCGA CGCTCTCTGG GGGCGAATCT
CAGCGCGTCA AAATGATTCG GCATCTTGGC AATAGCTTAA CTGAAATGCT CTATATTCTC
GATGAGCCAA GTGTGGGCTT GCATGCGCGT GATGTGGCGC GGCTGAATAA TTTATTGCAA
CAGTTGCGCG ATAAGGGCAA TACGGTGCTA GTGGTTGAGC ATGATCCCGA TGTGATTGCA
ATTGCCGACC ATATTGTCGA TATTGGGCCA CGCGCGGGCG TGCACGGCGG CGAGGTTGTG
TTTGAAGGCA GCTATGCCGA TCTCAAACGT TCAAATACCT TAACTGGCAG CTATTTGCAA
CAAGTAGTGC CAACCAAACA GCATTCGCGT CAGCCAACTG GCTACTTGCC GATTGTTAAT
GCCAACCTGC ATAACCTTAA AAACGTCAAT GTCAATATTC CAACGGGCGT GCTGACCGTG
GTGACAGGTG TGGCCGGATC AGGCAAAAGC ACCTTAATTA ATGAAGTCTT TTTGAGCCAA
CACCCCAATG CGATTGTGAT CGATCAATCA CGGGTAACGG CTAATAGCCG TTCGGCTCCG
GCCACCTATA CCGGAATTAT GGATGATATT CGCCAGACCT TTGCTAAGGC CAATGGCGTG
AGCGCTTCGT TATTCAGCTT CAACTCAACT GGCAGTTGCG ATAATTGTAA TGGTTTGGGC
TTAGTCTACA CCGATTTAGC CTTTATGGAA GGAATTTCCT CGACCTGTGA AATTTGCGAA
GGCAAGCGTT TCAAAGCCGA AGTGCTGGAA TATCGGCTGC GCGGCAAATC GATCAGCGAT
GTGTTGGATA TGACTGCCGA GGAAGCACTC GATTTCTTTA ACGAAAAGAA GATCAAGCCT
GTGCTCCAGG CTATGAATGA TGTTGGCTTG AGCTATTTGA AACTTGGTCA ACCACTCAGC
ACGATATCGG GTGGTGAGGG TCAACGGCTC AAATTGGCGA CCGAACTCCA CAAAAAGGCC
AGCGTTTATG TGATGGACGA GCCAACCACG GGTTTGCATC GCTCGGATAT TGGGCTGCTG
ATGGGGATCA TTGATCGTTT GGTTGATCTC AAGAATACCG TGATTCTGAT CGAACATCAC
TTGGATATTA TTCGTCAGGC CGATTGGATT ATCGACATTG GGCCTGAGGG CGGTAGCGCT
GGTGGCGAGA TTATTTTTGA AGGGCCACCC ATGGCCTTGA AAACTTGCCA ACGCAGCATT
ACCGCCAAGT TTCTCTAA
 
Protein sequence
MMQDRAFIEV YGARENNLKN ISLNIPKQRV TVFTGVSGSG KSSLVFDTIA AEAQRQLNET 
FTFFVQGFLP HYGQPDVERI EHLNSPIIID QKRVGGGSRS TVGTYTDIAV LLRLLFSRIG
QPYVGPGYAF SFNTPQGMCP ECEGIGKTVQ LDMDKLLDRS KSLNEGAILH PEFKVGKWMW
KMYPLSGLFD NDKPIQDYNE QELQAFLYGA DLKVSWGEFS SKYEGLLERF ERMYLKKDAA
AMSDRNRAVF EQFTSSQICP VCHGGRLTQA ALNCRIAGRN IAELADFEAT DLISFLADVT
DPIGDRVAAK LLERMQQLVD IGLGYLSLSR ETSTLSGGES QRVKMIRHLG NSLTEMLYIL
DEPSVGLHAR DVARLNNLLQ QLRDKGNTVL VVEHDPDVIA IADHIVDIGP RAGVHGGEVV
FEGSYADLKR SNTLTGSYLQ QVVPTKQHSR QPTGYLPIVN ANLHNLKNVN VNIPTGVLTV
VTGVAGSGKS TLINEVFLSQ HPNAIVIDQS RVTANSRSAP ATYTGIMDDI RQTFAKANGV
SASLFSFNST GSCDNCNGLG LVYTDLAFME GISSTCEICE GKRFKAEVLE YRLRGKSISD
VLDMTAEEAL DFFNEKKIKP VLQAMNDVGL SYLKLGQPLS TISGGEGQRL KLATELHKKA
SVYVMDEPTT GLHRSDIGLL MGIIDRLVDL KNTVILIEHH LDIIRQADWI IDIGPEGGSA
GGEIIFEGPP MALKTCQRSI TAKFL