Gene Huta_2897 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHuta_2897 
Symbol 
ID8385206 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhabdus utahensis DSM 12940 
KingdomArchaea 
Replicon accessionNC_013158 
Strand
Start bp2977420 
End bp2979555 
Gene Length2136 bp 
Protein Length711 aa 
Translation table11 
GC content65% 
IMG OID644973975 
ProductATP-dependent protease Lon 
Protein accessionYP_003131791 
Protein GI257053958 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1067] Predicted ATP-dependent protease 
TIGRFAM ID[TIGR00764] lon-related putative ATP-dependent protease 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.512346 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGAAA ACAAGGATAC TGCCGACGAT GCGCCGCCCG AGCACGAGGA GGCTACCGTC 
TCGGACGACG GCCAGACGGA CGGGTCCGAC GCGTCGGCCG AGCGATCGAC GGGTGCGTCC
AGCATTGGGA CGGACGACCG GCCCGTCGAA GAGACCGACG CTGCGGACGT GGCCGAGGAG
TCGGATACCG ACGACGAAGT GGCCGATCTC GGGAGCGACG TCACACTCGA CGACGAGGAG
TCCACAGTCG TCGACGACGA AGACGATGTA CTAGGTGGAT TGAACATCGA CTCGACCGAG
GATATCGAGG TGCCCGAGCG GTTGGTCGAT CAGGTCATCG GGCAGAGTCA CGCCCGTGAC
ATCGTCCTGA AGGCGGCCAA ACAGCGCCGC CACGTGATGA TGATCGGCTC GCCCGGGACG
GGCAAGTCGA TGCTCGCAAA GGCGATGAGC CAGCTCCTCC CCAAAGAGGA CCTCCAGGAC
GTTCTGGTCT ATCACAACCC CGACGACGGC AACGAGCCGA AAGTCCGGAC GGTCCCCGCG
GGCAAGGGCG AGCAGATCGT CGACGCCCAC AAGGAGGAGG CCCGCAAGCG CAACCAGATG
CGGTCGTTCC TGATGTGGAT CATCATCGCG ATCGTGATCG GGTACGCGCT GTTCGCCGGC
AATCCGCTGC TGGGCGTGCT CGCAGCGGGC GTCATCTATC TGGCGTTCCG CTACGGGGCG
CGTGGTGGCG ATTCGATGAT CCCGAACCTG CTGATCAACA ACGCCAACGA GCAGACCGCG
CCCTTCGAGG AGGCGACGGG TGCCCACGCC GGTGCACTGC TCGGCGACGT CCGCCACGAC
CCCTTCCAGT CCGGCGGCAT GGAGACGCCC AGCCACGACC GCGTCGAGGC TGGTGCGATC
CACAAGGCCA ACAAGGGCGT GCTGTTCGTC GACGAGATCA ACACGCTGGA CATCCGTTCC
CAGCAGAAGC TGATGACCGC AATCCAGGAG GGCGAGTTCT CGATCACTGG CCAGAGCGAA
CGCTCCTCCG GCGCGATGGT CCAGACCGAA CCCGTTCCGA CTGACTTCAT CATGGTCGCG
GCGGGGAACA TGGACGCGAT GGAGAACATG CACCCGGCGC TTCGCGACCG GATCAAGGGC
TACGGGTACG AAGTCTACAT GGACGATACC ATCGAAGACG ACCCGGAGAT GCGCCGGAAG
TACGCCCGCT TCGTCGCCCA GGAGGTCGAG AAGGACGGGC GGCTCCCGCA CTTCACCGAG
GACGCCGTCG AGGAGATCAT CCTCGAAGCG CGTCGCCGGG CCGGCCGCAA GGAGCACCTC
TCGCTGAAAC TCCGGAACCT CGGCGGACTC GTCCGTGTCG CCGGCGACAT CGCCCGCGCA
GCGGACAAGG AGTTCACCGA ACGCGAAGAC GTGCTGCAGG CCAAGGATCG CTCGCGTTCG
ATCGAACAAC AGCTCGCGGA CAACTACATC GAGCGCCGCA AGGACTACAA GATGACCGTC
AACGAGGGCA GCGCCGTCGG TCGCGTCAAC GGCCTGGCCG TCATGGGCGA GGACAGCGGG
ATCGTCATGC CCGTCATGGC CGAGGTCGCG CCCTCCCAGG GTCCCGGCGA GGTCATCGCG
ACCGGAAAGC TCCAGGAGAT CGCGATGGAG GCCGTCCAGA ACGTCAGCGC GATCATCAAG
AAGTTCTCAG ACGAGGACAT CTCAGAGAAG GACATCCACA TCCAGTTCGT CCAGTCCTAC
GAGGGCGTCG AGGGCGACTC CGCGTCGGTG ACGGTCGCGA CGGCCGTCAT CTCCGCCTTA
GAGAACATCC CCGTCGAGCA GAACCTCGCG ATGACCGGCT CGCTGTCGGT TCGCGGTGAC
GTCCTGCCCG TCGGCGGTGT CACCCACAAG ATCGAGGCCG CCGCCAAGAC CGGCCTCGAC
ACGGTGATCA TCCCGAAGGC CAACGAACAG GACGTGATGA TCGAGGACGA GTACGAGGAC
CAGATCGAGA TCATCCCCGT CAGCCACCTC TCGGAAGTGC TGGAAGTCGC GCTGGCTGGC
GAACCCGAAA AGGACAGCCT GGTCGATCGG CTGAAGTCGA TCACCGGCCA GGCACTCGAA
CGGAAGATCG GCCAGACGAA CCCCAGCCTG CAGTAA
 
Protein sequence
MSENKDTADD APPEHEEATV SDDGQTDGSD ASAERSTGAS SIGTDDRPVE ETDAADVAEE 
SDTDDEVADL GSDVTLDDEE STVVDDEDDV LGGLNIDSTE DIEVPERLVD QVIGQSHARD
IVLKAAKQRR HVMMIGSPGT GKSMLAKAMS QLLPKEDLQD VLVYHNPDDG NEPKVRTVPA
GKGEQIVDAH KEEARKRNQM RSFLMWIIIA IVIGYALFAG NPLLGVLAAG VIYLAFRYGA
RGGDSMIPNL LINNANEQTA PFEEATGAHA GALLGDVRHD PFQSGGMETP SHDRVEAGAI
HKANKGVLFV DEINTLDIRS QQKLMTAIQE GEFSITGQSE RSSGAMVQTE PVPTDFIMVA
AGNMDAMENM HPALRDRIKG YGYEVYMDDT IEDDPEMRRK YARFVAQEVE KDGRLPHFTE
DAVEEIILEA RRRAGRKEHL SLKLRNLGGL VRVAGDIARA ADKEFTERED VLQAKDRSRS
IEQQLADNYI ERRKDYKMTV NEGSAVGRVN GLAVMGEDSG IVMPVMAEVA PSQGPGEVIA
TGKLQEIAME AVQNVSAIIK KFSDEDISEK DIHIQFVQSY EGVEGDSASV TVATAVISAL
ENIPVEQNLA MTGSLSVRGD VLPVGGVTHK IEAAAKTGLD TVIIPKANEQ DVMIEDEYED
QIEIIPVSHL SEVLEVALAG EPEKDSLVDR LKSITGQALE RKIGQTNPSL Q