Gene Hhal_1683 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_1683 
Symbol 
ID4709221 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp1837493 
End bp1838869 
Gene Length1377 bp 
Protein Length458 aa 
Translation table11 
GC content72% 
IMG OID639856150 
Productexodeoxyribonuclease VII, large subunit 
Protein accessionYP_001003249 
Protein GI121998462 
COG category[L] Replication, recombination and repair 
COG ID[COG1570] Exonuclease VII, large subunit 
TIGRFAM ID[TIGR00237] exodeoxyribonuclease VII, large subunit 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.763853 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAGCGCA GTAGAGACGA TACCGCCGTC TACACCCCCT CGCAGCTCAA TCAGGAGGTG 
CGGGGCATGC TCGAGACGGT GCTCCCGTCC GTCTGGGTTG AGGGCGAGAT CTCGAACCTC
GCCCGCCCCT CGTCAGGGCA CATGTATTTC ACCCTCAAGG ACCCCGGCGC CCAGGTCCGC
TGCGCCCTGT TCCGCGGCCG GGCCTCGGCC CTGCGCCACC GCCCCGCCGA CGGCGATCAG
GTCCGCATCC GCGCCAAGGC CAGCCTCTAC CCGGCCCGCG GCGAGTTTCA GCTCATCGTC
GAACACCTGG AACCCTCCGG AGAAGGTGCC CTGCAGCGCG CCTTCGAGGC GCTCAAGCAG
CGCCTGCAGG CCGAGGGCCT GTTCGATGCT GCGAGCAAGC GCCCGGTGCC GAAGATGCCC
CGCCGGCTCG GGGTGATCAC CTCGCCGACG GGGGCCGCTA TCCGCGATGT CCTGCAGGTC
TTGGAGCGGC GCTTCGCGGC GCTGCCGGTG CTGATCTACC CGGTACCGGT CCAGGGCGAA
GCCGCCGCCC CGGCGATCGT CCGAGCCCTG GAACTCGCCG GGCATCGGGC CGAGGTCGAC
GCCTTGCTGC TCACCCGCGG TGGCGGTTCG CTGGAGGACC TCTGGCCCTT CAATGAGGAA
GCGGTCGCGC GGGCCATCCG CGCCTGCCCG ATCCCGGTCG TCAGCGCCGT CGGCCATGAG
GTGGACCTCA CCATCGCCGA TCTGGCTGCG GATCTGCGGG CGCCCACGCC CTCTGCGGCG
GCCGAGACCC TGTCGCCCGA CGGCCAGGCC TGGCAGGAGC AGCTCGAGCG CCTCGGCCAC
CGCCTGGAGG TGGCCGCCGG CAGGCGCCTG GGCCGGGCGG GTGACCAGCT ATCCGGCCTG
CAGCGCCGGC TGGCCGCCCA GCATCCCGGG CGGCGCCTGC GTGATCGCGC ACAGCGACTT
GACGAGCTCG AGGGGCGCTT GCACCGGCTC GGCCACCAGG CCGTCGAATC CCGCCGTAGA
CGCCTTCACA CGGCCGAACA GCGCCTGCAG GTCCAAGACC CTCGCCGACG CACGACCAAC
GAGCGGCAGC GTGTGGCGGA GCTGGCGCAG CGCCTGCACC ACACCGTCCG CGGCCGGCTG
GAGACATCGC AACAACGACT GGGCAATGCC TCGCGTGCCC TGCACGCCGT GAGCCCGCTG
GCCACACTGG AACGCGGCTA CGCCGTGGTA CAGCGGGAGG AGGACAGCGC GATCCTGCGC
CGGGCCGACG CCGTCCGGGT GGGGGAGCGC ATCCGTGCCC GCCTGGCCCA CGGCGCGCTA
GACTGTCGGG TTGAGGCACT GCGTAACGCG GAGGAATCGC TGCCCGATGC CGACTGA
 
Protein sequence
MERSRDDTAV YTPSQLNQEV RGMLETVLPS VWVEGEISNL ARPSSGHMYF TLKDPGAQVR 
CALFRGRASA LRHRPADGDQ VRIRAKASLY PARGEFQLIV EHLEPSGEGA LQRAFEALKQ
RLQAEGLFDA ASKRPVPKMP RRLGVITSPT GAAIRDVLQV LERRFAALPV LIYPVPVQGE
AAAPAIVRAL ELAGHRAEVD ALLLTRGGGS LEDLWPFNEE AVARAIRACP IPVVSAVGHE
VDLTIADLAA DLRAPTPSAA AETLSPDGQA WQEQLERLGH RLEVAAGRRL GRAGDQLSGL
QRRLAAQHPG RRLRDRAQRL DELEGRLHRL GHQAVESRRR RLHTAEQRLQ VQDPRRRTTN
ERQRVAELAQ RLHHTVRGRL ETSQQRLGNA SRALHAVSPL ATLERGYAVV QREEDSAILR
RADAVRVGER IRARLAHGAL DCRVEALRNA EESLPDAD