Gene Huta_2507 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHuta_2507 
Symbol 
ID8384809 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhabdus utahensis DSM 12940 
KingdomArchaea 
Replicon accessionNC_013158 
Strand
Start bp2579092 
End bp2580162 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content65% 
IMG OID644973581 
ProductProtein of unknown function DUF1119 
Protein accessionYP_003131404 
Protein GI257053571 
COG category[S] Function unknown 
COG ID[COG3389] Uncharacterized protein conserved in archaea 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0629064 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCCGGT ACCGCGGACT CACGCTTTCG ATTTCGGTGA TCGTCTCGAT CTTCCTGTTC 
GTTCAACTCG GCGCGCTTGC GCTGGTCGAT CCGTTCAAGA CTGCCGGACT CCAGGCGGTC
GAGGATCCCC AGAACCCGGT CAATAGCCTG CTGTACATCG CGGCGATCCT CGTGATGACC
GGCGTGATGC TCGCCGCGTT CAAGTACGAG GTCCAGTGGG CGATCCGTGG CCTGATCGTC
GCGACCGGCG CGTACATCGC CCTGCTCGTG TTCTCGATCC TGCTGCCGCC CGTCGTGACG
CTGCCAGTCG GGGACGGCCT CCACGGGCTT GCGTGGGTCG GCGCGATCGG CCTCGGCGTC
GCACTGTACG CCTATCCGGA GTGGTACGTC ATCGATGCCA CGGGTGCCGT CATGGGTGCG
GGAGCGGCCG GCCTGTTCGG TATCACCTTC GGTGTGTTCC CGGCCCTTGT CTTGCTCTCC
GTCCTGGCTG TCTACGATGC CATCAGCGTC TACGGCACCG AACACATGCT GACGATCGCT
TCGGGCGTGA TGGATCTCAA AGTCCCCGTC GTACTCGTCG CGCCGATGTC CGTCGGCTAC
TCCTTCCGGG AGGATACGGC AGGGCTCGAC GAGGAGTCCG ACAATGAGCA AGCGGATCCG
ACTGCGGACG ACGCCACCAC TGAGCCGGAG GACACCGACG TAACTGCCGA ATCCGGATCA
GCCGAGGCTG CTGAGGGCGA CAGCGCCGAT CCACTCGAAG ACCGTGAGGC GCTGTTCATC
GGTCTCGGCG ACGCGATCAT TCCGACGGTG CTGGTCGCAT CAGCCGCATT CTTCGCGGAT
GCGTCCGTTC CGACCGTCGA TATCGGCGCG TTCTCGGTCG CCGTGCCCGC AGCCACTGCC
GTGGTCGGGA CGTTCCTCGG GCTGGCCGTG TTGTTACGGA TGGTTCTGGC CGGGCGCGCA
CACGCCGGGC TCCCACTGTT GAACGGTGGG GCCATCGCGG GGTACCTCGT CGGGGCACTC
GCCAGCGGGA TGACCCTCGT CGAGACGCTC GGACTGGGGC CGTATCTTTA G
 
Protein sequence
MARYRGLTLS ISVIVSIFLF VQLGALALVD PFKTAGLQAV EDPQNPVNSL LYIAAILVMT 
GVMLAAFKYE VQWAIRGLIV ATGAYIALLV FSILLPPVVT LPVGDGLHGL AWVGAIGLGV
ALYAYPEWYV IDATGAVMGA GAAGLFGITF GVFPALVLLS VLAVYDAISV YGTEHMLTIA
SGVMDLKVPV VLVAPMSVGY SFREDTAGLD EESDNEQADP TADDATTEPE DTDVTAESGS
AEAAEGDSAD PLEDREALFI GLGDAIIPTV LVASAAFFAD ASVPTVDIGA FSVAVPAATA
VVGTFLGLAV LLRMVLAGRA HAGLPLLNGG AIAGYLVGAL ASGMTLVETL GLGPYL