Gene Htur_4055 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHtur_4055 
Symbol 
ID8744683 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaloterrigena turkmenica DSM 5511 
KingdomArchaea 
Replicon accessionNC_013744 
Strand
Start bp307678 
End bp308685 
Gene Length1008 bp 
Protein Length335 aa 
Translation table11 
GC content50% 
IMG OID646514620 
ProductArginase/agmatinase/formiminoglutamase 
Protein accessionYP_003405567 
Protein GI284167289 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0010] Arginase/agmatinase/formimionoglutamate hydrolase, arginase family 
TIGRFAM ID[TIGR01230] agmatinase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTAATG ACTACTCACG CGCGCGTGAA TTCCGAGAAA GCCACGAGGG AGCGGAAGTT 
GAACTCGCAT ACACAGGCAA GCAGACGTTT CTCAAGGGAG ATCCTCGAAA CGTAGATGAT
CTCCAAGATA TAGATGTGGC AGTGCTCGGT GCACCGCTTG ATACTGCTGC AAGCAACAGA
CCTGGTGCCC GATACGGTCC GGAAGCAGTC CGAAAGGCTA GCACTTGGTG GGCTTACCTC
TCGGGATACA AGAGCGGTCT CACAAACATG AATACTCGTG CTCAAGTCGA TTATGATGAC
TTGAAAATTG CTGATTGTGG GGATATCCCT GTATTTCCGC AGGATCAAAA GCAATCCGCG
GATAGTATCA CTGCTCACGT TGCTACTGCG GCTGAGCAGG CGTTTCCAGT ACTGATTGGC
GGCGACCATT ATTGTACCTA CCCGTCCTTC TGTGGATTTG CTGAAGCTAT CGACGCCGAC
AATGTTGGTC TGGTGCAGAT TGATGCCCAT AGTGACACCT CAGATGGAAG CCCAGTTTTC
GGAGATCACT TTCATGGGTC GAGTACACGG TTGATTGCAG AGTCAGAGTA TTCTGACTAC
GAGCATATCA GTCAGATAGG GATTCGAGGG TACGAAGCAC CGGGATTCTT CGAGTTCGCC
GAAGAAACTG GATTGAATCT TTACACTATG CGGGACATTC AAGCGCAGGG AATTCGCAAC
GTTGTAACAG AAGCAATACA GAACGCATCT GAAGATACAG ACGCTGTCTA CGTAACGTTC
GACATTGATT CGGTTGATCC TAGTACCGCA CCTGGAACAG GAACTCCGGA ACCAGGTGGC
CTGAATAGTC ATCAGGCTCT TACAATTATG GAAATCCTCG GTACTCACGA GGCAGTCGGT
GCTGCAGATT TGATGGAGGT TGCTCCAAAT TACGATCCAA CCCAGTCAAC TCAGACACTC
GCAGCATACC TACTTGTAAC ACTCGTTGAG CGACAGTTCG CTGAGTAG
 
Protein sequence
MSNDYSRARE FRESHEGAEV ELAYTGKQTF LKGDPRNVDD LQDIDVAVLG APLDTAASNR 
PGARYGPEAV RKASTWWAYL SGYKSGLTNM NTRAQVDYDD LKIADCGDIP VFPQDQKQSA
DSITAHVATA AEQAFPVLIG GDHYCTYPSF CGFAEAIDAD NVGLVQIDAH SDTSDGSPVF
GDHFHGSSTR LIAESEYSDY EHISQIGIRG YEAPGFFEFA EETGLNLYTM RDIQAQGIRN
VVTEAIQNAS EDTDAVYVTF DIDSVDPSTA PGTGTPEPGG LNSHQALTIM EILGTHEAVG
AADLMEVAPN YDPTQSTQTL AAYLLVTLVE RQFAE