Gene Htur_4105 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHtur_4105 
Symbol 
ID8744733 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaloterrigena turkmenica DSM 5511 
KingdomArchaea 
Replicon accessionNC_013744 
Strand
Start bp369048 
End bp370331 
Gene Length1284 bp 
Protein Length427 aa 
Translation table11 
GC content67% 
IMG OID646514662 
Productimidazolonepropionase 
Protein accessionYP_003405609 
Protein GI284167331 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1228] Imidazolonepropionase and related amidohydrolases 
TIGRFAM ID[TIGR01224] imidazolonepropionase 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTGCCG CGACACCAGA CCCCAGGGGG CCGCTCTGTA TCGTGTACAA CGCGGGGGAA 
CTCGTCGTTG GGCCGGTCGA TGACACAGCC GACACGGCAC CAGCCGACGA CGCGCGCACC
GACGGCCCGC TGGAGATTCG CGAGGATGCG GCGTTCGCCG CGATCAACGG CGAGGTTGTC
GCCGTCGGCC CAACCGACGA GATCACGCGG GCGTATCCGC CCGACAATGC CGCCACCGCG
ATCGACGCGG ACGGGAACGC CGTCGTCCCG GGATTCGTCG ATCCGCACAC GCACGCTCTC
TTCGCGGGAG ACCGATCGGA CGAGTTCGCG GCCAAGGTGC GCGGCCGGAG CTACCAGGAG
ATCCTCGCAG AGGGTGGCGG GATCCTCCGA ACCGTCCGTG CGGTCCACGA CGCGAGCGAC
GACGACCTCC TCGCGAACCT GCGATCACAC CTCGACGTCA TGCTCGCCCA CGGGACCACT
ACTGTTGAGG TCAAATCCGG CTACGGCCTC GAGGCCGAGA CCGAGCTCCG ACTGCTCGAG
ACGATCGATC GCGCCGCCGC CAAACACCCG ATCACTCTCG TACCGACGTT CATGGGAGCA
CACGCGGTTC CGGCGGACAC GGACACCGAA GACTACGTTG ACCGCGTCAT CTCGGACCAG
CTCCCCGCCG TCGCCGACCA AGGGATCGCC GAGTTCTGTG ACGTCTTCTG TGAGGCGGAT
GTGTTCGATG TCGACCAGTC TCAGCGCGTG CTCAAGGCGG GTGCAGCCGC GGGACTCACG
CCGAAAATCC ACGCCGAGGA GTTTACCCGT CTCGGGGGCG CACAGCTCGC CGCCGAGCTC
GAGGCTGCGA GCGCCGATCA CCTGTTGCAC GCGACGGCCG AGGACGTCGC GGCGCTCGTC
GAGGCAGCCG TCGTTCCCGT GCTCTTGCCC GGGACGGCGT TCGGCCTCGG TGCAGCGTAC
GCCGACGCGC GAGCGTTTCT AGAGGCGGGG GCTTCCGTTG CGCTCGCGAC AGACTTCAAT
CCAAACTGTC ACGTACGGAC GATGGAGTTC GTCCAGACCC TCGCCGTCAT GAAGATGGAC
CTCACGCCGG CCGAAGCGCT GCTCGCAGCG ACGCGCAACG CGGCACTCGC GATCGACCGA
GACGACGGCA CTGGGACGCT TCGCGAAGGG GCCCCTGCGG ATGCCGCGGT GCTGGCAGCG
CCATCGTACG TCCACCTCGC GTACCGGTTC GACACCACGG CCATCGAGAC GGTCGTCAAA
AACGGCAGGG AGGTAGGGAC GTGA
 
Protein sequence
MTAATPDPRG PLCIVYNAGE LVVGPVDDTA DTAPADDART DGPLEIREDA AFAAINGEVV 
AVGPTDEITR AYPPDNAATA IDADGNAVVP GFVDPHTHAL FAGDRSDEFA AKVRGRSYQE
ILAEGGGILR TVRAVHDASD DDLLANLRSH LDVMLAHGTT TVEVKSGYGL EAETELRLLE
TIDRAAAKHP ITLVPTFMGA HAVPADTDTE DYVDRVISDQ LPAVADQGIA EFCDVFCEAD
VFDVDQSQRV LKAGAAAGLT PKIHAEEFTR LGGAQLAAEL EAASADHLLH ATAEDVAALV
EAAVVPVLLP GTAFGLGAAY ADARAFLEAG ASVALATDFN PNCHVRTMEF VQTLAVMKMD
LTPAEALLAA TRNAALAIDR DDGTGTLREG APADAAVLAA PSYVHLAYRF DTTAIETVVK
NGREVGT