Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hhal_2183 |
Symbol | |
ID | 4711124 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhodospira halophila SL1 |
Kingdom | Bacteria |
Replicon accession | NC_008789 |
Strand | - |
Start bp | 2393901 |
End bp | 2396738 |
Gene Length | 2838 bp |
Protein Length | 945 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 639856658 |
Product | excinuclease ABC, A subunit |
Protein accession | YP_001003749 |
Protein GI | 121998962 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0178] Excinuclease ATPase subunit |
TIGRFAM ID | [TIGR00630] excinuclease ABC, A subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.666287 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGACCATA TCCGCATCCG CGGCGCCCGC ACCCACAACC TCGACAACCT GGATATCGAC ATCCCGCGCA ACTGCCTGGT GGTGATCACC GGGCCGTCGG GGTCGGGGAA GTCGTCGCTG GCCTTCGACA CCCTCTATGC GGAGGGGCAG CGGCGCTACG TAGAGTCGTT GTCCGCCTAC GCGCGGCAGT TCCTGTCGAT GATGGACAAG CCCGACGTGG ATCACATCGA GGGGCTGTCG CCGGCGATCT CGGTCGAGCA GAAATCGGCG TCCCACAATC CGCGCTCCAC GGTGGGCACC GTGACCGAGA TCCACGACTA CCTGCGCCTG CTCTTCGCCC GGGCCGGCAT CCCGTACTGC CCCGAGCACC AGGTCGCCCT GGAGGCGAGC ACGGTCTCGG AGATGGTCGA TCGCATCCTG GCGGAGCCGG AGGGGAGCAA GATGATGCTG CTCGCCCCGC TGGTCGACGG GAGCCCCGGC GAGCACCGGC GCACCCTGGA GCAGCTGCGG AGCCAGGGGT ATCTCCGGGT GCGGATCGAC GGCCAGGTGG TGGAACTCGA TCCGCTGCCC CAGCTCGACG GCGACAGCGC CCACGATATC GAGGCGGTCA TCGACCGCTT TCGGGTCCGC GACGATCTCG CCGGGCGGCT GGCCGACTCC ATCGAGACCG CCCTGCGCAT CGGCGAGGGG GTGGTGCGGG TCGCCTGGAT GGACGAGCCG GAACGCGAGG CGCTGGTCTT CTCCGCGCAG CACGCCTGCC CGGAGTGCGG CCACGCGGTG GAGCCCCTGG AGCCGCGGAT GTTCTCGTTC AACAACCCTC AGGGCGCCTG CCCGACCTGC GACGGGCTGG GGACCCAGCA CTTCTTCGAC CCCGAGCGCG TAGTCAGCCG GCCGCAGCTG ACGCCGGCCG AGGGGGCGAT TCGTGGCTGG GACCGGCGCA ACCTCTACTA CTTCGCCATT CTGCAGGGGC TGGCGGCGCA CTACGGGTTC AGTCTGGAGA CCCCGTGGGC CGATCTGCCG GAGTCCACGC GACACTGCAT CCTCTACGGC TCCGGCGATG AGGAGATCGT CTTCCACTAC CCCGGCCGCA ATGGCCACAC GGAGCGCGTT CATCCCTTCG AGGGCGTGAT CCCCAACCTG GAGCGGCGTT TCCGGGAGGC CGAGTCCGCC ACCGTCCGCG ACGAGCTGGG TCGCTTCATG GCCCAGCGCA CCTGCCCGGA GTGCCAGGGC GGTCGACTGA ATCAGCGCGC CCGCAACGTG CGGGTGGAGG GCGTCGCCCT GCCGGATATC GCCGCGCTAC CCATCTACGT CGCCCGGGAG CGGGTGCGGG CCCTGGAGCC GGATGGGGCC CGCGGGGAGA TCGCCCGGCC GATCCTCGAG GAGATCCGGC AGCGCCTGGG CTTCCTGGAG GATGTGGGGC TGGGTTACCT GACCCTGGAC CGGGCGGCGG AGACCCTCTC AGGCGGCGAG GCGCAACGCA TCCGCCTGGC CAGTCAGATC GGCGCTGGCT TGGTGGGGGT GCTCTACGTC CTCGACGAAC CCTCCATCGG GCTGCACCCC CGGGACCACG ATCGGTTGCT CGACACCCTG CGTCGGCTGC GGGATCTGGG CAACAGCGTC ATCGTCGTCG AGCACGAGCC GGACGCCATG CGCGCCGCCG ACCACATCAT CGACATGGGG CCGGGCGCCG GGATCCACGG TGGCACGGTG GTGGCCGCCG GTACCCCGCA GGCGGTGGCC GAGCACCCGG ATTCGGTCAC CGGGGCGTTC CTCAGCGGCC GGCGCACCAT TGCGCTGCCG CAGCGCCGGC GTCCTCCGGA GGACGAGCGC TGGGTGCGGA TGACCGGCGC CCGCGGCCAC AACCTGCAGG ACGTGACCGC CGAGATCCCC GTGGGCTTAA TGACCTGTGT CACCGGGGTC TCTGGCTCCG GTAAGTCCAC GCTGATCAAC GACACCCTCT ACCGCAGCGC CGCCCGCGAC CTCAATGGCG CCCAGACCAG CCCCGCGGAG CACGATCGCG TCCACGGTCT CGAGCACTTC GAGAAGGTGG TGGACATCGA TCAGAGCCCC ATCGGCCGCA CGCCGCGCTC CAATCCGGCG ACCTACACCG GGGTGTTCGG CCCGGTGCGC GAACTCTTCG CCGCCACGCC CGAGGCGCGC GCCCGGGGCT ACAAGCCGGG GCGTTTCTCC TTCAACGTGC AGGGTGGGCG CTGCGAGGCG TGCCAGGGCG AGGGCGTGGT CCGCGTGGAG ATGCACCTGC TGCCGGATCT CTACGTCGCC TGCGACAGCT GCCACGGCAC TCGCTTCAAC CGCGAGACCC TGGAGATCCG CTACCGCGGC TACACCATCC ACGAGGTCCT GGAGATGACC GTGGACCAGG CCTACGAATT CTTCGAAGCG GTCCCCGCCA TCCGCCGCAA GCTCGAGACC CTGCGCGAGG TCGGCCTGGG CTATCTGCGC CTGGGGCAGA GTGCGACGAC CCTCTCCGGT GGTGAGGCGC AGCGCGTCAA GCTGGCCCGG GAGCTCTCGC GGCGCGAGCA CGGGCGCAAC CTCTATATCC TGGATGAGCC CACCACCGGG CTGCACTTTG CCGACGTGGA GCAGCTGCTC GCGGTGCTGC AGCGCCTGTG CGACCACGGC AACACCGTGG TGGTGGTGGA ACACGACCTG GACATCATGC GCTGCGCGGA CTGGATCATC GATCTGGGGC CGGAGGGCGG CGACGGCGGC GGGCAGATCC TCGCCGCTGG CCCGCCGGAG CACGTGGCCG AGAGTGCGGC CTCTTACACC GCCGCGTATC TAAGCCAGGC CCTCGGCACG GGGCCGGTCG CCGGATAG
|
Protein sequence | MDHIRIRGAR THNLDNLDID IPRNCLVVIT GPSGSGKSSL AFDTLYAEGQ RRYVESLSAY ARQFLSMMDK PDVDHIEGLS PAISVEQKSA SHNPRSTVGT VTEIHDYLRL LFARAGIPYC PEHQVALEAS TVSEMVDRIL AEPEGSKMML LAPLVDGSPG EHRRTLEQLR SQGYLRVRID GQVVELDPLP QLDGDSAHDI EAVIDRFRVR DDLAGRLADS IETALRIGEG VVRVAWMDEP EREALVFSAQ HACPECGHAV EPLEPRMFSF NNPQGACPTC DGLGTQHFFD PERVVSRPQL TPAEGAIRGW DRRNLYYFAI LQGLAAHYGF SLETPWADLP ESTRHCILYG SGDEEIVFHY PGRNGHTERV HPFEGVIPNL ERRFREAESA TVRDELGRFM AQRTCPECQG GRLNQRARNV RVEGVALPDI AALPIYVARE RVRALEPDGA RGEIARPILE EIRQRLGFLE DVGLGYLTLD RAAETLSGGE AQRIRLASQI GAGLVGVLYV LDEPSIGLHP RDHDRLLDTL RRLRDLGNSV IVVEHEPDAM RAADHIIDMG PGAGIHGGTV VAAGTPQAVA EHPDSVTGAF LSGRRTIALP QRRRPPEDER WVRMTGARGH NLQDVTAEIP VGLMTCVTGV SGSGKSTLIN DTLYRSAARD LNGAQTSPAE HDRVHGLEHF EKVVDIDQSP IGRTPRSNPA TYTGVFGPVR ELFAATPEAR ARGYKPGRFS FNVQGGRCEA CQGEGVVRVE MHLLPDLYVA CDSCHGTRFN RETLEIRYRG YTIHEVLEMT VDQAYEFFEA VPAIRRKLET LREVGLGYLR LGQSATTLSG GEAQRVKLAR ELSRREHGRN LYILDEPTTG LHFADVEQLL AVLQRLCDHG NTVVVVEHDL DIMRCADWII DLGPEGGDGG GQILAAGPPE HVAESAASYT AAYLSQALGT GPVAG
|
| |