Gene Hhal_0022 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_0022 
SymboluvrC 
ID4710239 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp23291 
End bp25132 
Gene Length1842 bp 
Protein Length613 aa 
Translation table11 
GC content64% 
IMG OID639854480 
Productexcinuclease ABC subunit C 
Protein accessionYP_001001619 
Protein GI121996832 
COG category[L] Replication, recombination and repair 
COG ID[COG0322] Nuclease subunit of the excinuclease complex 
TIGRFAM ID[TIGR00194] excinuclease ABC, C subunit 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACAAGG TTTCGGCAGA GCCAACCCCT GGAGTTGAGG CGCTGCGCGA GCGGGTGCGC 
GGCCTCCCGG AGCGACCGGG TGTCTACCGC ATGCTCAGTG CCGAAGGCAC CATCATCTAC
GTGGGCAAGG CGCGCAACCT CCGCCGCCGA GTCTCCAGTT ATTTCACCCC GTCACGAAAG
ACTCCCAAGA CCGAGCGGCT CGTCCAGCTC ATCGCCGATG TCCAGATCAC GGTGACCCAC
ACCGAGGCCG AGGCGTTGAT CCTGGAGAAC AACCTGATCA AGGAACATCG GCCTCGATAC
AACGTCCTGC TAAGGGACGA CAAGTCGTAC CCGTATATCT ACCTCTCGAG TCATCAGCAG
TTTCCGCGTC TGGGATACCA TCGCGGGGCA CGACAGGGAG CGGGACGATT CTTTGGCCCT
TACCCGAACT CCAATGCGGT GCGCGAGACG CTCGGCTACC TGCAGAAGGT CTTCCCCATC
CGACAGTGCC GCGATACCTT CTTCCGCAAT CGCTCGCGTC CCTGCCTGCA GTATCAGATT
CGCCGCTGCA CGGCGCCCTG TGTCGGCTAC ATCAGCGAGG AGGACTACCG CCGCGACGTG
CGCGACGTTG AGTTCTTCCT GGAGGGGCGC TCCGGCGAGG TCATCGCCGA GCTCGTCCGG
CGCATGGAAG AGGCTGCGGA GAACCTCGAG TTCGAGCAGG CGGCGCGCCT GCGCGACCGC
ATCGCCAATC TCCGGCATAT CCAGCAGCGT CAGTACGTTG CCCAGGATCG GGACGACAAC
ATGGATATCG TCGCCTGCGT GGCGGAAGGC GATACGGCGT GTGTCCAGGT CTTCTTCATC
CGGGGAGGGA GCAGTCTGGG CAACCAGTCC TACTTCCCCA ACACCCCGTC AGGTAGCCGC
GAGTCGGATA TCCTGGCCAG CTTCCTGGCC CAACACTATC TCGGTCGTGT AGCGCCGCCC
GAAGTGGTGA TCAATCGCCC GGTCCGGGAG CAGCGGCTCC TTGAGCAGGC CCTGCGCACC
GCGTCCGGTG GAACGGTGGC GATCCGTCAC CGTGTGCGCG GCGATCGCCG GCGCTGGGTG
GAGATGGCGG AAGAAAATGC TCGTTACGCG CTCTCCGCGC GCAGCGCCTC GACGGCCAGT
CAGCAGCGGC GGTACAGCGC CCTTGCCGAG GTCATCGATA GCGATGCCCC GCCAGAGCGG
ATCGAGTGTT TCGACATCTC GCACACCGCT GGTGAGGCCA CCGTAGCCTC CTGCGTCGTC
TTCAATCGGG AGGGACCGGT CAAGAGCGAC TACCGCCGCT TTAATATCCG TAATGTGACC
GCCGGTGATG ACTACGCAGC CATGCATCAG GCACTGACCC GCCGCTACCG GCGCGTTAAG
AGCGGCGAGG CGCCGCTGCC CGACCTCTTG CTAATCGATG GAGGAAAGGG GCAAGTTGCA
CAGGCACGGG ATGTCCTCGA CGAACTGGGC ATCGACGGCG TGGCACTAAT GGGCATCGCC
AAGGGGCCGG AGCGCCGCCC CGGAGAGGAG ACCCTATTGC TCGACGACGG GGAGCGGGAG
ATCGAGTTAC CGGCCGACTC TCCCGCACTC CACCTGTTGC AGCAGGTTCG CGACGAGGCC
CACCGCTTTG CGGTCAGTGG CCACCGCCAG CGCCGCGGCA AGGCGCGGCG GGAGTCGATC
CTCGAGGAGA TCCCCGGCCT GGGGCCAAAG CGCCGCCAGA GCCTTTTGAA ACACTTCGGC
GGGATCCAAG GCATTCGCCA GGCCGGCATC GAGGATCTGG CTCGCGTACC GGGCATCCAT
CGATCGCTCG CTCAACGGAT CTACGACACG TTCCACGGTT AG
 
Protein sequence
MNKVSAEPTP GVEALRERVR GLPERPGVYR MLSAEGTIIY VGKARNLRRR VSSYFTPSRK 
TPKTERLVQL IADVQITVTH TEAEALILEN NLIKEHRPRY NVLLRDDKSY PYIYLSSHQQ
FPRLGYHRGA RQGAGRFFGP YPNSNAVRET LGYLQKVFPI RQCRDTFFRN RSRPCLQYQI
RRCTAPCVGY ISEEDYRRDV RDVEFFLEGR SGEVIAELVR RMEEAAENLE FEQAARLRDR
IANLRHIQQR QYVAQDRDDN MDIVACVAEG DTACVQVFFI RGGSSLGNQS YFPNTPSGSR
ESDILASFLA QHYLGRVAPP EVVINRPVRE QRLLEQALRT ASGGTVAIRH RVRGDRRRWV
EMAEENARYA LSARSASTAS QQRRYSALAE VIDSDAPPER IECFDISHTA GEATVASCVV
FNREGPVKSD YRRFNIRNVT AGDDYAAMHQ ALTRRYRRVK SGEAPLPDLL LIDGGKGQVA
QARDVLDELG IDGVALMGIA KGPERRPGEE TLLLDDGERE IELPADSPAL HLLQQVRDEA
HRFAVSGHRQ RRGKARRESI LEEIPGLGPK RRQSLLKHFG GIQGIRQAGI EDLARVPGIH
RSLAQRIYDT FHG