Gene Haur_4118 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4118 
Symbol 
ID5735979 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp5265134 
End bp5267992 
Gene Length2859 bp 
Protein Length952 aa 
Translation table11 
GC content52% 
IMG OID641281272 
Productexcinuclease ABC, A subunit 
Protein accessionYP_001546878 
Protein GI159900631 
COG category[L] Replication, recombination and repair 
COG ID[COG0178] Excinuclease ATPase subunit 
TIGRFAM ID[TIGR00630] excinuclease ABC, A subunit 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCGAAA TTGATTCGAT TCAGATTGTT GGCGCACGCC AGCATAATCT CAAAAATATT 
GATTTAGCCA TTCCCAAAGG CAAATTGGTG GTATTCACCG GCCCATCGGG GGCTGGCAAA
TCAACACTCG CCTTTGATAC GATTTATGCT GAAGGCCAGC GTCGCTATGT TGAATCGCTT
TCGAGCTATG CCCGCCAATT TTTGGGTCAA TTGCCCCGAC CTGAAGTGGA TAGCATTCGT
GGTCTAGCGC CAGCAATTGC CGTCGCCCAA CAAACGATCA ACCGTTCGCC GCGCTCAACC
GTTGGCACAA TCACCGAAAT TTATGATCAT TTGCGCTTGT TGTATGCGCG GATTGGCAAA
CCGCATTGCC CAGTCTGTGG CCGTGCGATT GAGCAACAAA CTGCTAGCCA AATCGTCGAT
CAGATTTTGG GTTATCCTGA CGGCACACGC CTGATGATTT TAGCGCCCTT GGTGAATGAA
GAACTTGGTT CGCATGCCAC GGTGCTTGAG CAAACGCGAC GGGCTGGCTT TGTGCGGGTA
CGGGTTGATG GCGCAATTGT TGATCTTGAT GAACCAATTG AGTTGGATCA TCGTCAAGCC
CATAGCATAG ATGTGGTGGT CGATCGGCTG ATTATTCGCC ACAGCGAGGC CGTGAGCTTG
AATGACCATC CGGATCGGGT ACGGGTGAGC GATTCGGTTG AAACAGCCTT GAAAACTGGC
GCGGGAATGG TCTGGGTTCA GCCGCTTGAT GGTCAACAGC TGCGTTACAG CGAACATGCT
GCATGCCCTG AACATGGGCC ATTGGCCAGC GGCGCGATCG AACCACGCAG CTTTTCATTT
AATAGCCCAC ATGGTGCTTG CCCGATTTGC GATGGATTAG GCACAGTTGA TGATTTCGAC
CGGAACCTGT TACAATCTCA GGCAGCGCAA ACCCTTGGCG AACTGCTTTC GAATCCACTT
CGCGCCAGCA CCACGACCTA CCAACAATAT TGGGAAGAAA CCATCCAAGG TTTAGCTGAA
GCTTTGGGCA GCGATCTCGA ACAGCCTGTT GACACAATGG CTAGCTGGGC GTTAGATTTA
TGGCTAGAAG GCACAATTCC AAGCGATGAT CAACTACATT TAAGCAAAAA GCTACGCCAA
CAGCTCGCCA ACTGGTCAGG TTTGCTCGGT TGGTTGCGTC AACATTGGCA ACAAGCAAGC
GAACAACAGC GCGAAAGCCT CAGCATCTAT CGCCAAGGCA CCATTTGTTC GGCCTGTGAA
GGTTCGCGCT TGCGACCAGA GGCGCGGGCA GTTACACTAC AGGGGCTAGC AATCGATCAA
GTTACAGCAA TGTCGATTGA GGCTAGTTTT GCTTGGGTTA GTGAACTACC CAACAAATTG
CGGCGCGAGC GTGAGCAACA GATTGCAGCG CCGATTGTGC GTGAAATGTG CTTGCGTTTA
CAGTTCTTGC GCGAAGTTGG CCTAGACTAT TTGAGTTTGG CGCGAACTGC TGAGAGCCTA
TCGGGCGGCG AGGCTCAACG GATTCAATTG GCCACCCATA TTGGAGCTGG ATTATCAGGG
GTCTTGTATG TCTTGGATGA GCCATCGATT GGCTTGCATC CACGCGATAC TGAGCGTTTA
TTACAGACAT TATTGCAACT ACGCGACCTT CGCAATTCAG TTTTAGTAGT CGAGCATGAT
CCAGCGATTA TTGCTGCTGC TGATTGGGTG GTTGAAGTTG GGCCGACAGC GGGCGTGCAA
GGTGGCTATA TTATGGCCAG CGGCACGCTT GAGCAACTGA TAGCTCAGCC CAATTCCCAA
ACAGGCCAGT ATGTAGCTGG GCAACGGCAG CTAAGCCTGC CTCAAACACG GCGAAAGCCA
ACCCATGCTA CACTAATGCT ACGTGGAGCC AAACAGCATA ATCTCAAAGA TCTGGATGTA
GCGATTCCAT TGGGATGTTT GGTTGCGATT ACGGGAGTAT CAGGTTCGGG GAAATCGACA
CTTATTCATG AGATTCTCTA CCCACGGCTG GCCAACGAAC TGCATGGAAG CCGCCTGCCA
GTGGGCCGAC ATCGAAGCCT TGAAGGCTAT GATCAACTTG AAAAAGTGAT TGCAGTTGAT
CAAACGTCAC TGGGGCGCTC AGGTCGTTCC AACGCTGCCA CCTACACCGG TATTTTCGAC
GCATTGCGTC AATTGTTTGC TGGGACACCT GAAGCCAAAG CGCGGGGCTA TGGAGCCAGT
CGATTTTCCT TTAATCTTAA GGGAGGGCGG TGCGAACAAT GTCGCGGCGA AGGTGTGGTA
TCGATTGCCA TGCAATTTCT GCCAGATCTA GCGGTGACCT GCGACGCATG CGGCGGGTTG
CGTTATAATC GAGAAACCCT TGATATTCGC TATCGTGGGT ATACCATTGC TGATGTGCTC
GCCATGACCG TAGGCCAAGC CTTAAGCGTT TTTGAACGGC TACCGGCTTT GGCGCGAAAA
CTAGAGAGTT TGGTTGAGGT CGGGTTAAGC TATTTGACGT TAGGGCAGCC AGCGGCGACT
CTTTCGGGAG GCGAAGCGCA ACGGGTAAAA CTGGCGGCAG AGCTAGCGCG TCGAGGAACA
GGCCGAACCC TGTACATCCT TGATGAACCA ACCACCGGAT TATATTGGAC GGATGTCGAA
CGGTTAATTG CGATATTGCA ACGATTGGTT GATACAGGAA ACAGCGTTGT GGTGATCGAA
CATCATCTCG ACCTGATCAA GACCGCCGAT TGGGTGATCG ATTTAGGGCC GGAAGGTGGG
GACACTGGTG GCCGGCTGGT AGTGGCTGGT ACGCCGGAAG TAGTCGCTAT GAATCAAGCT
TCATGGACTG GCCGCTTCCT TCAAACCGTG TTAGCTTGA
 
Protein sequence
MAEIDSIQIV GARQHNLKNI DLAIPKGKLV VFTGPSGAGK STLAFDTIYA EGQRRYVESL 
SSYARQFLGQ LPRPEVDSIR GLAPAIAVAQ QTINRSPRST VGTITEIYDH LRLLYARIGK
PHCPVCGRAI EQQTASQIVD QILGYPDGTR LMILAPLVNE ELGSHATVLE QTRRAGFVRV
RVDGAIVDLD EPIELDHRQA HSIDVVVDRL IIRHSEAVSL NDHPDRVRVS DSVETALKTG
AGMVWVQPLD GQQLRYSEHA ACPEHGPLAS GAIEPRSFSF NSPHGACPIC DGLGTVDDFD
RNLLQSQAAQ TLGELLSNPL RASTTTYQQY WEETIQGLAE ALGSDLEQPV DTMASWALDL
WLEGTIPSDD QLHLSKKLRQ QLANWSGLLG WLRQHWQQAS EQQRESLSIY RQGTICSACE
GSRLRPEARA VTLQGLAIDQ VTAMSIEASF AWVSELPNKL RREREQQIAA PIVREMCLRL
QFLREVGLDY LSLARTAESL SGGEAQRIQL ATHIGAGLSG VLYVLDEPSI GLHPRDTERL
LQTLLQLRDL RNSVLVVEHD PAIIAAADWV VEVGPTAGVQ GGYIMASGTL EQLIAQPNSQ
TGQYVAGQRQ LSLPQTRRKP THATLMLRGA KQHNLKDLDV AIPLGCLVAI TGVSGSGKST
LIHEILYPRL ANELHGSRLP VGRHRSLEGY DQLEKVIAVD QTSLGRSGRS NAATYTGIFD
ALRQLFAGTP EAKARGYGAS RFSFNLKGGR CEQCRGEGVV SIAMQFLPDL AVTCDACGGL
RYNRETLDIR YRGYTIADVL AMTVGQALSV FERLPALARK LESLVEVGLS YLTLGQPAAT
LSGGEAQRVK LAAELARRGT GRTLYILDEP TTGLYWTDVE RLIAILQRLV DTGNSVVVIE
HHLDLIKTAD WVIDLGPEGG DTGGRLVVAG TPEVVAMNQA SWTGRFLQTV LA