Gene Huta_0500 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHuta_0500 
Symbol 
ID8382767 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhabdus utahensis DSM 12940 
KingdomArchaea 
Replicon accessionNC_013158 
Strand
Start bp505330 
End bp506508 
Gene Length1179 bp 
Protein Length392 aa 
Translation table11 
GC content66% 
IMG OID644971562 
Productpeptidase M24 
Protein accessionYP_003129420 
Protein GI257051587 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0006] Xaa-Pro aminopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAGCCAG ATCTTTCTGC AGTCGACGAA CGGCTGGCCG AGTTCGAGGC CGACGGATAT 
CTCCTTGATG CCGATGGAAC CGACCCGAAC CAGCAGTATC TCTCCGGGTT TGACGCCCCC
GATCCGTTCG TGACGCTGTA CGCTGGGGAG ACGCATCTCC TGTTTGTCCG GAGTCTGGAG
TTCGGCCGCG CAAAGCGGGA GGCGCGTGCC GACACCGTCG AACGGTTCGT CGACTTCGAC
TACGACCGAC TTCGCGAGGA ACACGACCGT CGTGAAGCGG CCGCCCGCGT TCGCGCCACG
TTCCTTCGTG AGCACGATGT CGAGCACGTC GCCGTCCCGC CGCGGTTCCC GACGGGAACG
GCCGACGCAC TGCGCGAACA GAACATCGAG GTCACGGTCG ATCACGACGA CGCGATCGAG
ACCGCTCGGG CGACGAAGAC CGCCGCGGAG ATCGACCATA TCCGGACTGC CCAGCGAGCC
AACGAGGCCG CGATGGCGGC CGCCGAGGGC CTCATCAGGG GAGCCGCTGT CGACGACGAG
GGGCGACTGC TCGCCGAAGG TGAGGTGTTG ACCAGCGAAC TGGTCCGCGA GGAGATCGAA
GTAACACTGC TCCGGAACGG CTGTGCGCTC GACGAGACGA TCGTCGCCTG TGGCGCGGAC
GCCGCCGATC CCCACGATCG CGGAAGCGGC CCCCTCGTGG CCGACGAGCC CATCATCGTC
GACATCTTCC CCCAGGACAA GGACTCAAAA TACCACGCCG ACATGACCAG GACGTTCCTG
GTCGGCGAAC CGGACGAGAC GGTCGAGGAG TGGTTCGAGC TGACCGATCA GGCTCGTAAG
GCAGCCATCG ACGCGGTCGA ACCGGGCGTC ACGGGCGCCG AAGTTCACGA TATCGTCTGT
GACGTCTACG AGGACGCCGG CCTGCCGACG CTCCGGAGCG ACGGGAGCGC CGAGACGGGA
TTCATCCACT CGACCGGCCA CGGCGTCGGG CTGGCAGTCC ACGAACAGCC GAGCGTGAGC
CAGCGCGGCG GGGAACTCGA ACCGGGCCAC ATCATTACGA TCGAGCCCGG CCTCTACGAT
CCGGCGGTCG GCGGCGTCCG GATCGAGGAT CTGCTGGTCG TGACCGACGA CGGTGCGGAG
AACCTGACCG AGTACCCGGT GGCACTCACC GGGGAGTAA
 
Protein sequence
MEPDLSAVDE RLAEFEADGY LLDADGTDPN QQYLSGFDAP DPFVTLYAGE THLLFVRSLE 
FGRAKREARA DTVERFVDFD YDRLREEHDR REAAARVRAT FLREHDVEHV AVPPRFPTGT
ADALREQNIE VTVDHDDAIE TARATKTAAE IDHIRTAQRA NEAAMAAAEG LIRGAAVDDE
GRLLAEGEVL TSELVREEIE VTLLRNGCAL DETIVACGAD AADPHDRGSG PLVADEPIIV
DIFPQDKDSK YHADMTRTFL VGEPDETVEE WFELTDQARK AAIDAVEPGV TGAEVHDIVC
DVYEDAGLPT LRSDGSAETG FIHSTGHGVG LAVHEQPSVS QRGGELEPGH IITIEPGLYD
PAVGGVRIED LLVVTDDGAE NLTEYPVALT GE