Gene Hore_02920 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHore_02920 
Symbol 
ID7314678 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalothermothrix orenii H 168 
KingdomBacteria 
Replicon accessionNC_011899 
Strand
Start bp299412 
End bp300806 
Gene Length1395 bp 
Protein Length464 aa 
Translation table11 
GC content40% 
IMG OID643610715 
Productargininosuccinate lyase 
Protein accessionYP_002508048 
Protein GI220931140 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0165] Argininosuccinate lyase 
TIGRFAM ID[TIGR00838] argininosuccinate lyase 


Plasmid Coverage information

Num covering plasmid clones43 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGAAAT TATGGGGTGG CAGGTTCAGT AAACAGACCC ATAAACTTAT GGAAGAATTT 
AATTCATCTC TTTCTTTTGA TAAAAGGCTT TACAGCTATG ATATTAAAGG GAGTATCGCC
CATGTTAAAA TGCTGTCCAG GACTGGTGTT TTATCTAAAT CTGAAGCAGA AACAATTATT
AAGGGTTTAC AGGAGATAAA AGAAGAGATA GATGAAGGTA TAATTTCTCT GGAGGGTCGA
TATGAAGATA TTCACAGTCT GGTAGAAAAA AATTTGATAG ATAAGGTTGG GGCAGTAGGG
GGAAAACTCC ATACTGCCCG TAGTAGAAAT GACCAGGTGG CCCTGGATAC CAGGCTTTAT
TTGCGTGATG AAATCTTTAA TATTCAGGAA CTGTTAATAA TCTTTTTAAA AACCCTCCTT
GAGCTGGGTG AAAAATATAA AAAGGTAGTT ATGCCCGGAT ATACCCATCT TCAAAGAGCC
CAGCCGGTAT CAATGGGCCA TCATTTACTG GCTTATTATT TTAAGTTAAA AAGGGATTAT
GACAGGTTAA ATGATAATAT GAAGCGGGTT AATGTTTTAC CCCTTGGATG TGGAGCCCTG
GCAGGGACTA CTTTCCCCAT TGACAGAGAA TGGGTTGCCA GGGAGCTGGG GTTTGAAAAG
ATAGCCCTCA ATTCTATAGA TGGGGTTAGC GACCGGGATT ATATTATAGA ATTTATGGGT
ATTGCGGCTT CAATAATGGT TCACCTGAGC AGGTTTAGTG CTGAATTAAT CTTGTGGTCT
TCAAGTGAAT TTAGTTTTAT TGAACTGGAT GACAGTTTTA CCACCGGGAG TAGTATTATG
CCCCAGAAGA AAAATCCCGA TGTTGCTGAA TTGGTCAGGG GTAAAAGTGG CAGGATATTC
GGGAATTTAG TTCAATTACT GTCCCTGCTT AAGGGGTTAC CTCTGGCTTA TAATAAAGAT
ATGCAGGAGG ATAAAGAGGC CCTTTTTGAT ACAATTGATA ACCTCAAGAT TATTTTAGAA
ATATTCCCGC CGATGTTAAA AACGATGAAG GTAAACAAGG ACCGGTTATA CCGGGCCGCC
AACCGGGGAT TTGTTAACGC AACTGATTTA GCTGATTATC TGGCCAGGAA AGGGGTTCCC
TTTCGTGAAG CACACGGGAT GGTGGGAAAG GCTGTCTTAT ATGCCCTGGA AAAAGATAAA
GAATTAAACC AGATAACTAT CGAGGAATGG AACCAGCTTT TTCCGGATTA CGGTGATATA
TTTGATAAAG GTTTGTCAGA AATTCTTGAT GTAAATACGA GTTTGAATAA TCGTAAGTCT
TCGGGAGGAC CAGCTCCTGA GGAAGTGGAG AGGGTAATCA GTATAGAGAG GGAGTGGATT
GAATCATTTA CTTAA
 
Protein sequence
MMKLWGGRFS KQTHKLMEEF NSSLSFDKRL YSYDIKGSIA HVKMLSRTGV LSKSEAETII 
KGLQEIKEEI DEGIISLEGR YEDIHSLVEK NLIDKVGAVG GKLHTARSRN DQVALDTRLY
LRDEIFNIQE LLIIFLKTLL ELGEKYKKVV MPGYTHLQRA QPVSMGHHLL AYYFKLKRDY
DRLNDNMKRV NVLPLGCGAL AGTTFPIDRE WVARELGFEK IALNSIDGVS DRDYIIEFMG
IAASIMVHLS RFSAELILWS SSEFSFIELD DSFTTGSSIM PQKKNPDVAE LVRGKSGRIF
GNLVQLLSLL KGLPLAYNKD MQEDKEALFD TIDNLKIILE IFPPMLKTMK VNKDRLYRAA
NRGFVNATDL ADYLARKGVP FREAHGMVGK AVLYALEKDK ELNQITIEEW NQLFPDYGDI
FDKGLSEILD VNTSLNNRKS SGGPAPEEVE RVISIEREWI ESFT