Gene Huta_1834 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHuta_1834 
Symbol 
ID8384124 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhabdus utahensis DSM 12940 
KingdomArchaea 
Replicon accessionNC_013158 
Strand
Start bp1843543 
End bp1844643 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content63% 
IMG OID644972902 
Productpeptidase M29 aminopeptidase II 
Protein accessionYP_003130737 
Protein GI257052904 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2309] Leucyl aminopeptidase (aminopeptidase T) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACCCAC GCATCCGCGA ACACGCACAG CTGGTCGCCG ACGCGCTCGA TCTGAGCGAG 
GGCGACAACC TGCTCATCAA GAGCGAGCCC GCCGCCGAGG ATTTGGTTGT TGCACTCTAC
GAGATCGCCG GTGATCGCGG CGCACACCCC GTTTCGATGC GTACCAATCG AAGCGGACGG
GCCATCCGAA GCTATCTCCA GGCGGCCGAC GCCGCCGACG TCGACTTCGA GACACCGGCC
CACGAACAGG CCCTGGTCGA GGCGGCCGAC TGTCACGCCG TCATTCGCTC TCACGTAAAC
GTCACCGAAC TGGAGGACGT CGCTCCCGAG GTCAACTCCG ATTACGAGAA GGCCCACCAG
CCGATTCTCA ACGAGCGGCT GACCGACCGC TGGACGCTCA CCCAGCATCC CACGCCCGCC
GACGCCCAAC TCGCCGAGAT GAGCACCGAA GCCTACGAGA ACTTCGTCTA CGATGCGATC
CTCAAAGACT GGGACGAACA ACGCGAGTTC CAGGCTCAGT TGGTCGAGAT CCTCGAGGAC
GCCAGCGAGG TCAGGATCGC GAGCGGCGAC ACCACCGACG TCACCATGTC CGTCGACGGC
AACCACGTCA TCAACGACAC GGACACGCAC AACCTCCCCG GCGGCGAGGT GTTCACCGCC
CCGATCCCCG ACAGCGTCGA AGGTGACGTC CTCTTCGACA AGCCCGTCTA TCGCCGTGGG
CGGGAGATTA CCGACGCACG ACTCGTCTTC GAAGACGGCG AAGTCGTCGA GCACAGCGCC
TCGAAGAACG AGGAGCTGCT GACGAGCATT CTCGACACCG ACGAGGGAGC CCGCCGCCTG
GGCGAACTCG GGATCGGGAT GAACCGCGAT ATCGACCGCT TCACCTACAA CATGCTCTTC
GACGAGAAGA TGGGCGACAC CGTCCACATG GCCGTCGGCC GCGCCTACGA GGACAACGTC
GGCGTTGGCA ACGAGCAAAA CGAGTCGGCC CAACACGTCG ACATGATCGT CGACATGAGC
GAGGACTCAT TCATCGAAGT CGACGGTGAA GTGGTTCAGG AAGACGGGAC GTTCGTGTTC
GAAGACGGCT TCAAGGGATA A
 
Protein sequence
MDPRIREHAQ LVADALDLSE GDNLLIKSEP AAEDLVVALY EIAGDRGAHP VSMRTNRSGR 
AIRSYLQAAD AADVDFETPA HEQALVEAAD CHAVIRSHVN VTELEDVAPE VNSDYEKAHQ
PILNERLTDR WTLTQHPTPA DAQLAEMSTE AYENFVYDAI LKDWDEQREF QAQLVEILED
ASEVRIASGD TTDVTMSVDG NHVINDTDTH NLPGGEVFTA PIPDSVEGDV LFDKPVYRRG
REITDARLVF EDGEVVEHSA SKNEELLTSI LDTDEGARRL GELGIGMNRD IDRFTYNMLF
DEKMGDTVHM AVGRAYEDNV GVGNEQNESA QHVDMIVDMS EDSFIEVDGE VVQEDGTFVF
EDGFKG