Gene Huta_1895 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHuta_1895 
Symbol 
ID8384186 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhabdus utahensis DSM 12940 
KingdomArchaea 
Replicon accessionNC_013158 
Strand
Start bp1905643 
End bp1906893 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content69% 
IMG OID644972963 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_003130797 
Protein GI257052964 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG2223] Nitrate/nitrite transporter 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACGAGG AGGTCGACCA CGCCGCCCGT GCCCGTGACG TCCTCGGGTT CTCGCGGTGG 
TGGCTGGTGC TCGCGGCCGC GGCCATGATG GCCGTCGTCG GCCCCTACCA GTACGTCTGG
AGCAGTCTCC GGGATCCAGT CGCGACCAAC CTGGGGATCG ACAGCGCCGC ACTGTCGACG
GTGTTCACGC TGTTCGTCGT CGTCCAGGCC GGGAGCCAGT TCCCGGTCGG GTGGTGGCGT
GATCGACACG GTCCCCGCGC GGTGAGTGTC GCCGCGGCTA TCCTCGCCGG CGGCGGCTAC
CTCGGCCTCT CAGTGGCCGA GACGACCTGG GAGATCTATC TCGCCTATTC GCTGGGAGCC
CTGGGCGTCG GCATCGTCTA CACCGTCGCC GTCAACACCG CGTTGAAGTG GTTCCCTGAC
CGGCGTGGAC TCACCGCGGG GGTCGGTACG ATGGCCTTCG CGGGGGGGAG CGCCGCCTTG
GTCCCCTACG TTCGGGCGAA TACCGGCGCG GGTGCGCCCG TGACGGCATA CGTCGGCGTC
CTCCAGCAGG TCGGCGTCGT GATCTTCGCG GTCGTGCTCG TCGGTGCGCT GGTCCTCCGG
GACCCACCCG AGGGATGGCT GTCCGACGGC AGCGTCACCG ACACGGGCCC ACAGTTCACC
TGGCGGGAAA TGGTCCGAAC CCGGCAGTTC TGGCTCATGT ACGCCATGTT CGTCGCCGTC
TCTGGGGCCG GCCTCATGCT GACCGAGAAG ATCGTCTCCT ACGCCGACCA CCTGGGACTC
GCGGGCGTGA TCGCCACGGC AGCCGCGACC CTCCTTCCGC TGGCCGGCGG CATCGGCCGA
CTCGTGCTGG GTGAGGTCAG CGACCGGGTC GATCGGACCA ACGCGATGGC CGGGGCGTTC
ACGCTCTGTG GCCTCGGCCT GTTCGCCGTC GCGTACTTCG GCGTCAGCGG GATGGGGAGT
GCCTTCGTCG TGGCAGTCGT CGTCGCCACC TTCTTCTGGA GCCCGCAGTT CACGCTGTTC
CCCAGCGTCG TCGGTGACTA CTACGGCGAG AAACACTCGA GTGCGAACTA CGCCCTGCTT
TACTCCGGGA AGATCTGGGG CGGCGTCTTC GGCGGGACCG TCACGGGTGC CGCCGTCGTC
GCCGTCGGCT GGACGGAAAC GTTTCTGCTC GGGGGCACAC TGGCCGTCCT GGCAGGTGTG
GCTGCCCTCG GACTCGACGC GCCGTCGCCA CCTGACGCCG ACAAACGCTG A
 
Protein sequence
MNEEVDHAAR ARDVLGFSRW WLVLAAAAMM AVVGPYQYVW SSLRDPVATN LGIDSAALST 
VFTLFVVVQA GSQFPVGWWR DRHGPRAVSV AAAILAGGGY LGLSVAETTW EIYLAYSLGA
LGVGIVYTVA VNTALKWFPD RRGLTAGVGT MAFAGGSAAL VPYVRANTGA GAPVTAYVGV
LQQVGVVIFA VVLVGALVLR DPPEGWLSDG SVTDTGPQFT WREMVRTRQF WLMYAMFVAV
SGAGLMLTEK IVSYADHLGL AGVIATAAAT LLPLAGGIGR LVLGEVSDRV DRTNAMAGAF
TLCGLGLFAV AYFGVSGMGS AFVVAVVVAT FFWSPQFTLF PSVVGDYYGE KHSSANYALL
YSGKIWGGVF GGTVTGAAVV AVGWTETFLL GGTLAVLAGV AALGLDAPSP PDADKR