Gene Hhal_1024 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_1024 
Symbol 
ID4709218 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp1097640 
End bp1099037 
Gene Length1398 bp 
Protein Length465 aa 
Translation table11 
GC content70% 
IMG OID639855495 
Productargininosuccinate lyase 
Protein accessionYP_001002602 
Protein GI121997815 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0165] Argininosuccinate lyase 
TIGRFAM ID[TIGR00838] argininosuccinate lyase 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.350248 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCCAGA ATGACGCAGC CCCCGGCCAG CAGCTGTGGA CCGGCCGCTT CACCGAGGCG 
ACCGACGCCT TCGTCGAGCG CTTCTCGGCC TCCGTGCAGT TCGACGCGCG CCTCGCCCTG
CAGGATATCC AGGGCTCCGA GGCCCACGCC CGCATGCTCG CCGCGCGTGG CGTGCTGACC
GAGGCGGAGC GCGACGCCAT CCTCCAGGGG CTGGCCGAGA TCCGCCAAGA GGTGGTCGAG
GAGCGCTTCC CGTGGTCGCC GCAGCTCGAG GATGTGCACA TGAACATCGA GCATCGGCTG
ACCCAGCGCA TCGGCGAGGC CGGCAAGAAG CTGCACACCG GCCGCTCGCG CAACGATCAG
ATCGCCACCG ACGTGCGCCT CTTCCTGCGC GAGGCCATCG ATACCATCCG TGCCGAGTTG
GCCCGCTTCC AGCACGGGCT GGTGGAACTG GCCGAGCGCG AGGCCGACAC CATCATGCCC
GGCTTCACCC ACCTGCAGGT GGCGCAGCCG GTGACCTTCG GGCACCACAT GCTGGCCTGG
TACGAGATGC TCGAGCGCGA CCGCGACCGC CTGGCGGACT GCCGGCGGCG CCTGAACCAG
TGTCCGCTCG GCGCGGCGGC GCTGGCGGGG ACGTCGTTCC CCATCGACCG CGAGGCGACC
GCCCGGGAGC TGGGCTTCGA CGCGCCGACG CGCAACTCCC TGGACTCGGT CAGCGATCGC
GATTTCGCCA TCGAATTCTG TGCCGACGCC AGCCTGATCC TCGTCCACCT CTCGCGCATG
GCCGAGGAGC TGGTGCTGTG GACCTCTCAG CAGTTCGGCT TCATCGAGCT GCCGGATCGC
TTCTGCACCG GTTCTTCGAT CATGCCGCAG AAGAAGAACC CGGACGTGGC GGAGCTGGTG
CGGGGCAAGG CGGCCCGTGC CCAGGGCAGC CTGGTCCAGC TGCTGACCCT GATGAAAGGC
CAGCCGTTGG CCTATAACCG GGACAACCAG GAGGACAAGG AGCCGCTTTT CGACGCCGTG
GACACGGCGC GGGATGCGCT GACCGCCTTC GCCGACATGG TCCCGGCGCT GAGCGTCAAC
CGCGAGCGTT GCCGCGCGGC GGCCCGGGCC GGCTTCGCCA CGGCCACCGA CCTGGCCGAC
TACCTGGTCC GCCAGGGGCT GGCCTTCCGC GACGCCCACG AGGTGGTCGG CCGTGCCGTG
CGTTACGCCA CCGAAGCCGA TCGGGATCTC GCCGAGCTCA GCCTTGAGGA GCTCCAGCAG
TTCTCCACCG CCATCGGGGA CGACGTCTTC GCCGTGCTCA CCCTCGACGG CTCGGTCGCG
GCGCGCTCCC ACGTTGGCGG CACGGCGCCG GAGCAGGTCC GTGCCCAGGC CCAGGCGGCC
CGGGAGCGCC TGGCTTGA
 
Protein sequence
MSQNDAAPGQ QLWTGRFTEA TDAFVERFSA SVQFDARLAL QDIQGSEAHA RMLAARGVLT 
EAERDAILQG LAEIRQEVVE ERFPWSPQLE DVHMNIEHRL TQRIGEAGKK LHTGRSRNDQ
IATDVRLFLR EAIDTIRAEL ARFQHGLVEL AEREADTIMP GFTHLQVAQP VTFGHHMLAW
YEMLERDRDR LADCRRRLNQ CPLGAAALAG TSFPIDREAT ARELGFDAPT RNSLDSVSDR
DFAIEFCADA SLILVHLSRM AEELVLWTSQ QFGFIELPDR FCTGSSIMPQ KKNPDVAELV
RGKAARAQGS LVQLLTLMKG QPLAYNRDNQ EDKEPLFDAV DTARDALTAF ADMVPALSVN
RERCRAAARA GFATATDLAD YLVRQGLAFR DAHEVVGRAV RYATEADRDL AELSLEELQQ
FSTAIGDDVF AVLTLDGSVA ARSHVGGTAP EQVRAQAQAA RERLA