Gene Hhal_2183 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_2183 
Symbol 
ID4711124 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp2393901 
End bp2396738 
Gene Length2838 bp 
Protein Length945 aa 
Translation table11 
GC content70% 
IMG OID639856658 
Productexcinuclease ABC, A subunit 
Protein accessionYP_001003749 
Protein GI121998962 
COG category[L] Replication, recombination and repair 
COG ID[COG0178] Excinuclease ATPase subunit 
TIGRFAM ID[TIGR00630] excinuclease ABC, A subunit 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.666287 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACCATA TCCGCATCCG CGGCGCCCGC ACCCACAACC TCGACAACCT GGATATCGAC 
ATCCCGCGCA ACTGCCTGGT GGTGATCACC GGGCCGTCGG GGTCGGGGAA GTCGTCGCTG
GCCTTCGACA CCCTCTATGC GGAGGGGCAG CGGCGCTACG TAGAGTCGTT GTCCGCCTAC
GCGCGGCAGT TCCTGTCGAT GATGGACAAG CCCGACGTGG ATCACATCGA GGGGCTGTCG
CCGGCGATCT CGGTCGAGCA GAAATCGGCG TCCCACAATC CGCGCTCCAC GGTGGGCACC
GTGACCGAGA TCCACGACTA CCTGCGCCTG CTCTTCGCCC GGGCCGGCAT CCCGTACTGC
CCCGAGCACC AGGTCGCCCT GGAGGCGAGC ACGGTCTCGG AGATGGTCGA TCGCATCCTG
GCGGAGCCGG AGGGGAGCAA GATGATGCTG CTCGCCCCGC TGGTCGACGG GAGCCCCGGC
GAGCACCGGC GCACCCTGGA GCAGCTGCGG AGCCAGGGGT ATCTCCGGGT GCGGATCGAC
GGCCAGGTGG TGGAACTCGA TCCGCTGCCC CAGCTCGACG GCGACAGCGC CCACGATATC
GAGGCGGTCA TCGACCGCTT TCGGGTCCGC GACGATCTCG CCGGGCGGCT GGCCGACTCC
ATCGAGACCG CCCTGCGCAT CGGCGAGGGG GTGGTGCGGG TCGCCTGGAT GGACGAGCCG
GAACGCGAGG CGCTGGTCTT CTCCGCGCAG CACGCCTGCC CGGAGTGCGG CCACGCGGTG
GAGCCCCTGG AGCCGCGGAT GTTCTCGTTC AACAACCCTC AGGGCGCCTG CCCGACCTGC
GACGGGCTGG GGACCCAGCA CTTCTTCGAC CCCGAGCGCG TAGTCAGCCG GCCGCAGCTG
ACGCCGGCCG AGGGGGCGAT TCGTGGCTGG GACCGGCGCA ACCTCTACTA CTTCGCCATT
CTGCAGGGGC TGGCGGCGCA CTACGGGTTC AGTCTGGAGA CCCCGTGGGC CGATCTGCCG
GAGTCCACGC GACACTGCAT CCTCTACGGC TCCGGCGATG AGGAGATCGT CTTCCACTAC
CCCGGCCGCA ATGGCCACAC GGAGCGCGTT CATCCCTTCG AGGGCGTGAT CCCCAACCTG
GAGCGGCGTT TCCGGGAGGC CGAGTCCGCC ACCGTCCGCG ACGAGCTGGG TCGCTTCATG
GCCCAGCGCA CCTGCCCGGA GTGCCAGGGC GGTCGACTGA ATCAGCGCGC CCGCAACGTG
CGGGTGGAGG GCGTCGCCCT GCCGGATATC GCCGCGCTAC CCATCTACGT CGCCCGGGAG
CGGGTGCGGG CCCTGGAGCC GGATGGGGCC CGCGGGGAGA TCGCCCGGCC GATCCTCGAG
GAGATCCGGC AGCGCCTGGG CTTCCTGGAG GATGTGGGGC TGGGTTACCT GACCCTGGAC
CGGGCGGCGG AGACCCTCTC AGGCGGCGAG GCGCAACGCA TCCGCCTGGC CAGTCAGATC
GGCGCTGGCT TGGTGGGGGT GCTCTACGTC CTCGACGAAC CCTCCATCGG GCTGCACCCC
CGGGACCACG ATCGGTTGCT CGACACCCTG CGTCGGCTGC GGGATCTGGG CAACAGCGTC
ATCGTCGTCG AGCACGAGCC GGACGCCATG CGCGCCGCCG ACCACATCAT CGACATGGGG
CCGGGCGCCG GGATCCACGG TGGCACGGTG GTGGCCGCCG GTACCCCGCA GGCGGTGGCC
GAGCACCCGG ATTCGGTCAC CGGGGCGTTC CTCAGCGGCC GGCGCACCAT TGCGCTGCCG
CAGCGCCGGC GTCCTCCGGA GGACGAGCGC TGGGTGCGGA TGACCGGCGC CCGCGGCCAC
AACCTGCAGG ACGTGACCGC CGAGATCCCC GTGGGCTTAA TGACCTGTGT CACCGGGGTC
TCTGGCTCCG GTAAGTCCAC GCTGATCAAC GACACCCTCT ACCGCAGCGC CGCCCGCGAC
CTCAATGGCG CCCAGACCAG CCCCGCGGAG CACGATCGCG TCCACGGTCT CGAGCACTTC
GAGAAGGTGG TGGACATCGA TCAGAGCCCC ATCGGCCGCA CGCCGCGCTC CAATCCGGCG
ACCTACACCG GGGTGTTCGG CCCGGTGCGC GAACTCTTCG CCGCCACGCC CGAGGCGCGC
GCCCGGGGCT ACAAGCCGGG GCGTTTCTCC TTCAACGTGC AGGGTGGGCG CTGCGAGGCG
TGCCAGGGCG AGGGCGTGGT CCGCGTGGAG ATGCACCTGC TGCCGGATCT CTACGTCGCC
TGCGACAGCT GCCACGGCAC TCGCTTCAAC CGCGAGACCC TGGAGATCCG CTACCGCGGC
TACACCATCC ACGAGGTCCT GGAGATGACC GTGGACCAGG CCTACGAATT CTTCGAAGCG
GTCCCCGCCA TCCGCCGCAA GCTCGAGACC CTGCGCGAGG TCGGCCTGGG CTATCTGCGC
CTGGGGCAGA GTGCGACGAC CCTCTCCGGT GGTGAGGCGC AGCGCGTCAA GCTGGCCCGG
GAGCTCTCGC GGCGCGAGCA CGGGCGCAAC CTCTATATCC TGGATGAGCC CACCACCGGG
CTGCACTTTG CCGACGTGGA GCAGCTGCTC GCGGTGCTGC AGCGCCTGTG CGACCACGGC
AACACCGTGG TGGTGGTGGA ACACGACCTG GACATCATGC GCTGCGCGGA CTGGATCATC
GATCTGGGGC CGGAGGGCGG CGACGGCGGC GGGCAGATCC TCGCCGCTGG CCCGCCGGAG
CACGTGGCCG AGAGTGCGGC CTCTTACACC GCCGCGTATC TAAGCCAGGC CCTCGGCACG
GGGCCGGTCG CCGGATAG
 
Protein sequence
MDHIRIRGAR THNLDNLDID IPRNCLVVIT GPSGSGKSSL AFDTLYAEGQ RRYVESLSAY 
ARQFLSMMDK PDVDHIEGLS PAISVEQKSA SHNPRSTVGT VTEIHDYLRL LFARAGIPYC
PEHQVALEAS TVSEMVDRIL AEPEGSKMML LAPLVDGSPG EHRRTLEQLR SQGYLRVRID
GQVVELDPLP QLDGDSAHDI EAVIDRFRVR DDLAGRLADS IETALRIGEG VVRVAWMDEP
EREALVFSAQ HACPECGHAV EPLEPRMFSF NNPQGACPTC DGLGTQHFFD PERVVSRPQL
TPAEGAIRGW DRRNLYYFAI LQGLAAHYGF SLETPWADLP ESTRHCILYG SGDEEIVFHY
PGRNGHTERV HPFEGVIPNL ERRFREAESA TVRDELGRFM AQRTCPECQG GRLNQRARNV
RVEGVALPDI AALPIYVARE RVRALEPDGA RGEIARPILE EIRQRLGFLE DVGLGYLTLD
RAAETLSGGE AQRIRLASQI GAGLVGVLYV LDEPSIGLHP RDHDRLLDTL RRLRDLGNSV
IVVEHEPDAM RAADHIIDMG PGAGIHGGTV VAAGTPQAVA EHPDSVTGAF LSGRRTIALP
QRRRPPEDER WVRMTGARGH NLQDVTAEIP VGLMTCVTGV SGSGKSTLIN DTLYRSAARD
LNGAQTSPAE HDRVHGLEHF EKVVDIDQSP IGRTPRSNPA TYTGVFGPVR ELFAATPEAR
ARGYKPGRFS FNVQGGRCEA CQGEGVVRVE MHLLPDLYVA CDSCHGTRFN RETLEIRYRG
YTIHEVLEMT VDQAYEFFEA VPAIRRKLET LREVGLGYLR LGQSATTLSG GEAQRVKLAR
ELSRREHGRN LYILDEPTTG LHFADVEQLL AVLQRLCDHG NTVVVVEHDL DIMRCADWII
DLGPEGGDGG GQILAAGPPE HVAESAASYT AAYLSQALGT GPVAG