Gene Huta_2131 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHuta_2131 
Symbol 
ID8384425 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhabdus utahensis DSM 12940 
KingdomArchaea 
Replicon accessionNC_013158 
Strand
Start bp2170771 
End bp2173716 
Gene Length2946 bp 
Protein Length981 aa 
Translation table11 
GC content64% 
IMG OID644973200 
ProductLaminin G sub domain 2 
Protein accessionYP_003131031 
Protein GI257053198 
COG category[R] General function prediction only 
COG ID[COG1287] Uncharacterized membrane protein, required for N-linked glycosylation 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAACGG GCCGCGACGA GGTCGAGTCG TTCCTCGAAG CCAATCCAGA GCTGGAAGCC 
GAACTCGAGG CACTCCTCAC AATCGACGCC CGCGGCCCGT GGGAGTTCGA CGACATCCCG
CTCGATTCAG GCGCGTTCGG TGAATGCGTG TCCCGGGGGA TCGCGGTCGA ACACGACGAC
GGGGGCTATC GCCTCGCCAA CCCTGACGCG GTGCGGGCCG CACTCGACCA CGACGTCGAT
TCCGCGGCCG AGTCCGAACA GTCGTCCAGG ATCCCGGACG TCAATGTTAC ACTCGATGTC
CCGCGGGAAA CGGTGCTGGC CTTGCTCGGC GCCCTCTCGC TACTCGTCGC ATTCCGTGTG
GTCTTCGTCT ACCAGGGAGT TTTCCGGGAG GTGATCACGC TGCTGGGGAA CGACCCGTAC
AAGCGACTGT ACTGGGTCGA ACAGCTCCAG GCGTTCCCGG CATTCGATCC AGGTGCGCTC
GGCTCGATTC CGGAGGGCGT CCGCCTCACG GGTGATACGA TGATGCTCGT GACGCTCTGG
TGGGTGTCAG AGCTGTTCGG TGGGACCCCG GCGGCGGCTC GCGTTACGCT CGCGATCTAT
CCGGTCGTCG CTGCCGTCGT GACTGGGCTG TTCGTCTACG CGACGACGAA GCTCCTGACG
GACGACATTC GGATCGCGCT TGCGAGTGTG CTCATGCTCG CGGTGACGCC CATCAACGGA
TACCGGATGG CACTCGGGTT CGGCGATCAT CACGCCTTCG ATTACGTCTG GATCGCGCTG
ACAGTGCTGC TCGTCGTCTG GTGGGAACGC TCGGCCGACG GGATGGCCAC TGGCCCGATC
CGGAACCGAC TCCGGTCGCG ACGCTTGTGG GTAGCCATAG CAGCACTCGG GGGCACGCTT
GCGGTCCAGA CGATGAGCTG GGTCGGTTCC CCGATGGTGT TGGTCCCCTT CGGTCTGCTG
ATGTTCGTTC GGGGGGCAGT TGCGTTCCGG CAGTCGCGTT CTCCGGCAGT TGCGGCACTG
CCGTACCTCG GCGGAATCGG CGTCGCCAGC ACGATCGTCG GCACAATGCA TCTCGCGTTC
GGGTGGTTGC CGGTCTCGCG GGTGATCGTG CCGCTAGCCG TGTTCGTCCT CGGGGCGGGA
GCAGTCGGGT ACTTCGAGGC CTGTCGGTCA CTATCGTTGT CAGCGACAGT GACGGTGGCA
AGTAGTGTCG TCTTCTCGGT AGTTGGGACG GTCGGGTTCG CCCTCGTTGG ACCCGGAATG
GACGCCGTCG TCGAACGAGC AGTTGGGTTG TTCAGGGGGA AACAGATCGC GGAGTCAGCC
GGTCTGTTCA GTGCCGAATC CGGACTGGTC GTCCAGCCGA TCCTGTTCTT CGGATTCGTG
CTGTTTCTCG CGTTGCCGTT CATCGTCCTC GGCGCGTGGC ACGTCTTCGA GAACAACGAC
CCACGCTGGA CGGCACCCGT GGTCTATGCG GTGTACTTCC TGTTCTGGGC TGGCGTGAAA
GTTCGGTTCG GGGGGCCGCT GACCATATTC ACGGCAATCT TTGGCGGCAT CGGGTTCGTC
AAAGCTGCCG CATGGGTCGA CCTTGCCAGG CCAGTCACCA GCTTCACCGA GAAAACGCCG
ATCGCCCGTT TCGAACGACC CGAGGGCGGG CAGCTGGTGT CGCTGTTCCT GCTGTTTCTG
CTGGTGGGTA GTCTCGGGAT GGTCCAGCTC CCCATCAAGC AAAGTCAACT CGTCGTTTCC
GAGGATACCT ACGAGACGGC ACAGTGGATC GACGGCTACA GCGACGAACG GAACCTGGAG
TATCCCGAAA ACGGCGTCTT CACCGGGTGG AGTTCGACGC GCATCTACAA TTACTTCGTG
AACGGCCACG CGGACTCCTA CTGGTTCGAA CAACAGTATT TCGAGTCGTT CCTCGGGTCG
ACTACTCCGG ACGCGTGGTA CGAGCGACTG CGGGACCGGT ACGGGTTCGT CGTCTACAGC
AGGCCGATGA ATGGGTCGAG CGGCCCGACC GTCGAAAGAC AGCTGGAAAC CGGGGACTCC
ACGCCGGGTT TCGCCCAGTA TCGTCTGGTC CATACGAGCG GTCCGAAACG GGTGTTTCAG
CTGGTCGAAG GAGCGACGAT CGTCGGGATC GACCGGACCA GTGACACCGT CACGGCCGAG
ACGAGTCCGA CCGTTTCGGG GAAATCCCAT ACCTACGAGC GGAACGCAGC ACCCAACCCC
TACGGAACGT ATGCGGTGAC CGTCCCGTAC CCTGGATCGT ATTCGATCGC CGGCGACCAG
GTCGACGTTG CTGCCTCCGC CGTCGAGAAC GGGACCAGAG TGGTACGCCA CGCCAGAGAC
GGGCTTGCCC ACTGGCCCTT CGATGCGACC GACGGAACCG TCGCCTACGA CCGGGTCGGC
GGCATTCAGG GGGACATATC GAACGCGACG GTAGCCGAAA ACGGCGTCAA CGGCACTGCC
CTCGAATTCA CCCGCGAGAA CGACAGCCAG GTCCGGGCGG CCGTCGAGTC GCCCCCGGAG
TTCACGGTAA GCATGTGGCT CAAGCCCCAG GCGCTGGACA CGACCGAGGC GAACGATTAT
CGCATCCTGG CGCGGAGTGG ACGCGGACTG GTGCTCAACG TCGAGGAGAG TGGACGCCTT
ACCTTCCGGC TGCCGGGAAC TGACGCGAAG GCACTGGGTG GTGGGTCCGT CCCGGTCGGC
AACTGGACGC ACGTGGCCGC GACCTACGAC GGCAGCCAGC GAACGCTGTA TGTCGACGGG
GCGGCCGTTG CGACCGACAC CGTCGATGTC GGGCCTCCCT CCTGGGGTGG GCAACTGACG
TTCGGCGGAG GTGGTGACCC GACGCATACG TTCGACGGGA CGATCGACGA GATCCGGCTC
TATGAGCGTG CGTTGAACGA CACGGAACTT TCGGCCCAGG CAGTACAGTC CCGGAACGAT
CAGTGA
 
Protein sequence
METGRDEVES FLEANPELEA ELEALLTIDA RGPWEFDDIP LDSGAFGECV SRGIAVEHDD 
GGYRLANPDA VRAALDHDVD SAAESEQSSR IPDVNVTLDV PRETVLALLG ALSLLVAFRV
VFVYQGVFRE VITLLGNDPY KRLYWVEQLQ AFPAFDPGAL GSIPEGVRLT GDTMMLVTLW
WVSELFGGTP AAARVTLAIY PVVAAVVTGL FVYATTKLLT DDIRIALASV LMLAVTPING
YRMALGFGDH HAFDYVWIAL TVLLVVWWER SADGMATGPI RNRLRSRRLW VAIAALGGTL
AVQTMSWVGS PMVLVPFGLL MFVRGAVAFR QSRSPAVAAL PYLGGIGVAS TIVGTMHLAF
GWLPVSRVIV PLAVFVLGAG AVGYFEACRS LSLSATVTVA SSVVFSVVGT VGFALVGPGM
DAVVERAVGL FRGKQIAESA GLFSAESGLV VQPILFFGFV LFLALPFIVL GAWHVFENND
PRWTAPVVYA VYFLFWAGVK VRFGGPLTIF TAIFGGIGFV KAAAWVDLAR PVTSFTEKTP
IARFERPEGG QLVSLFLLFL LVGSLGMVQL PIKQSQLVVS EDTYETAQWI DGYSDERNLE
YPENGVFTGW SSTRIYNYFV NGHADSYWFE QQYFESFLGS TTPDAWYERL RDRYGFVVYS
RPMNGSSGPT VERQLETGDS TPGFAQYRLV HTSGPKRVFQ LVEGATIVGI DRTSDTVTAE
TSPTVSGKSH TYERNAAPNP YGTYAVTVPY PGSYSIAGDQ VDVAASAVEN GTRVVRHARD
GLAHWPFDAT DGTVAYDRVG GIQGDISNAT VAENGVNGTA LEFTRENDSQ VRAAVESPPE
FTVSMWLKPQ ALDTTEANDY RILARSGRGL VLNVEESGRL TFRLPGTDAK ALGGGSVPVG
NWTHVAATYD GSQRTLYVDG AAVATDTVDV GPPSWGGQLT FGGGGDPTHT FDGTIDEIRL
YERALNDTEL SAQAVQSRND Q