Gene Huta_0741 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHuta_0741 
Symbol 
ID8383011 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhabdus utahensis DSM 12940 
KingdomArchaea 
Replicon accessionNC_013158 
Strand
Start bp726859 
End bp727950 
Gene Length1092 bp 
Protein Length363 aa 
Translation table11 
GC content67% 
IMG OID644971804 
Productpeptidase M29 aminopeptidase II 
Protein accessionYP_003129659 
Protein GI257051826 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2309] Leucyl aminopeptidase (aminopeptidase T) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACGATC GCATCCACGA ACACGCCGCA GTGCTGGTCG ACTGGAGCGC ACGGATCGAG 
TCGGGTGACG ACGTCGTTCT CTCCGTGGAC GAGGGCGCAC ACGATCTCGC TGTCGCGGTC
GCCGAAAAGC TGGGTGACCG GGGCGCGAAC CTCGTGAACG TCTATCGCTC TGACGAGATT
CAGCGTGCAT ATCTCCAAGC GCACGACGAC GATTTTGACG ACGATCCTGA GTACGAACGC
ACACTCTACG AGAACGCCGA CAGCGTCCTC GTGCTGAAGG GTACACGCAA CACCGCTGGA
ATGGCCGACG TCCCCGACGA CCGCCAGCAG GCGTTTGCCC GCGCCAGAGA AGAGGTCCGG
GAAGCGCGCC TGGCGACCGA CTGGGTCTCG ACGCTGCATC CGACCCGCGC ACTCGCCCAG
GGGGCCGGGA TGGCGTTCGA GGAGTACCGC GAGTTCGTCT ACGACGCCAC GCTCCGGGAC
TGGGAATCCC TCTCCGAAGA GATGGATCGA CTCAAGACGA TTCTCGACCA GGGCGACGAG
GTCCACATCG ACGCCCCTGG CACCGATCTC ACGCTCTCGA TCGCGGGGCG GACGGCAGTC
AACAGCGCCG CGTCGGTGGC CTACGACTCC CATAACCTCC CCAGCGGCGA GGTCTTCACC
GCGCCCGCCG ACGCCGAGGG CGAGGTCACT TTCGACGTGC CGATGACGGT CCGGGGCAAC
ACCCTGCGGG ACGTCCACCT CGTCTTCGAG GACGGCGACG TCGTCGAGCA CGCGGCCGCG
GCCGGCGAGG AGACGCTGGC AGCGCTACTG GAGACCGACG CTGGCGCTCG TCGGCTCGGC
GAGCTCGGCG TCGGCATGAA TCGCGGCATC GACCGCTACA CGGACAATAT CCTCTTCGAC
GAGAAGATGG CCGAGACCGT CCACCTGGCG CTGGGCCGGG CCTACGACGC CTGTCTGCCC
GAGGGCGAAT CCGGCAACGA CAGCGCGATC CACGTCGACC TGATCGCCGA CACGAGCGAG
GACGCGACGC TGTCGGTCGA CGGCGAAGTG ATCCAGCGGG ACGGCGTCTT CCGGTGGGAA
GACGGCTTCT AG
 
Protein sequence
MDDRIHEHAA VLVDWSARIE SGDDVVLSVD EGAHDLAVAV AEKLGDRGAN LVNVYRSDEI 
QRAYLQAHDD DFDDDPEYER TLYENADSVL VLKGTRNTAG MADVPDDRQQ AFARAREEVR
EARLATDWVS TLHPTRALAQ GAGMAFEEYR EFVYDATLRD WESLSEEMDR LKTILDQGDE
VHIDAPGTDL TLSIAGRTAV NSAASVAYDS HNLPSGEVFT APADAEGEVT FDVPMTVRGN
TLRDVHLVFE DGDVVEHAAA AGEETLAALL ETDAGARRLG ELGVGMNRGI DRYTDNILFD
EKMAETVHLA LGRAYDACLP EGESGNDSAI HVDLIADTSE DATLSVDGEV IQRDGVFRWE
DGF