Gene Huta_2114 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHuta_2114 
Symbol 
ID8384408 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhabdus utahensis DSM 12940 
KingdomArchaea 
Replicon accessionNC_013158 
Strand
Start bp2151197 
End bp2152327 
Gene Length1131 bp 
Protein Length376 aa 
Translation table11 
GC content64% 
IMG OID644973183 
Productoligopeptide/dipeptide ABC transporter, ATPase subunit 
Protein accessionYP_003131014 
Protein GI257053181 
COG category[E] Amino acid transport and metabolism 
COG ID[COG4608] ABC-type oligopeptide transport system, ATPase component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGAAA CTGAATCAGA CGAACCACTG CTCTCGATGG AGGACGTCGA GGTACACTTC 
AAACCCGGCG GCGTTATCGA GAAAGCGCTC TCCGAGGAGA TCGTTCGGGC CGTCGACGGA
ATCTCCCTCG AAGTCGAGGA GGAGGACATC GTCGCACTGG TCGGCGAGAG CGGGTGTGGC
AAAACCACGC TCGGGAAGGC CGCGATCGGC CTCCAGCGAC CGACCGGCGG TTCGATCAAG
TACCGTGGCC AGAACATCTG GAAGGCGAAG GACCGCTCGA GCGACGCCAA GATCTCGAAG
GACGAGATCC AGCAGGCGTT GCAGATCATC CACCAGGACC CGGGGAGTGC GCTCAACTCT
TCCCGGCGCG TTCGGGCGAC CCTGGCTGAC CCACTCAAGC GGTGGCGCAA GGAACTCGGC
CCCGACGAGC GCCTCGAGAC GATCTATCAC TTCCTCGAGT ACGTCGGGAT GACGCCGGTC
GAGGACTACG CCGAGCGGTT CCCCCACCAG CTCTCGGGCG GCGAACAACA GCGGGTCGTC
CTCGGGCGGG CGCTGTTGAC GAATCCCGAC CTCGTGCTCG CGGACGAGGC GGTGTCGGCG
CTGGACGTCT CCCTGCGCGT CGAGATGATG GACCTGCTGC TCGAACTCCA GGACATGTTC
GGGACCTCGT TCGTGTTCGT CTCCCACGAC CTGGCGAACG CCCGCTATCT CACGAAGAAG
TCCGACGGCC GCATCGCCGT GATGTACCTC GGCGACATCG TCGAAATCGG TGATCCCGAC
GAGCTCATCG AGAACCCGAC CCACCCCTAC ACGAAGGTGC TGCGGTGGTC GACGCCGCCG
GCCGACCCGG ACGTGGCCAG CGAGACCATG CACATGCAGC CGCCGGTCCG CCGGATCGAC
ATCCCCGACC CCGCAGATCC GCCGGAAGGC TGTAAGTTCC ACACCCGGTG TGAGCACGCT
CGCGAGGTGT GTAAACAAGA GGACCCGGAC CTCTACGACG CCGATGGCAC CGATGCGAAG
TGCTTCCGGG CGCTGGACAA CCACGAGTAC TGGCACAGTG AGGAACTCAC GGATCGCGAG
GAACTCGGCT TCACCTCAAG CCTGGACGAG GAAGAGCCGG CGGACGACTG A
 
Protein sequence
MSETESDEPL LSMEDVEVHF KPGGVIEKAL SEEIVRAVDG ISLEVEEEDI VALVGESGCG 
KTTLGKAAIG LQRPTGGSIK YRGQNIWKAK DRSSDAKISK DEIQQALQII HQDPGSALNS
SRRVRATLAD PLKRWRKELG PDERLETIYH FLEYVGMTPV EDYAERFPHQ LSGGEQQRVV
LGRALLTNPD LVLADEAVSA LDVSLRVEMM DLLLELQDMF GTSFVFVSHD LANARYLTKK
SDGRIAVMYL GDIVEIGDPD ELIENPTHPY TKVLRWSTPP ADPDVASETM HMQPPVRRID
IPDPADPPEG CKFHTRCEHA REVCKQEDPD LYDADGTDAK CFRALDNHEY WHSEELTDRE
ELGFTSSLDE EEPADD