Gene Huta_2419 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHuta_2419 
Symbol 
ID8384719 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhabdus utahensis DSM 12940 
KingdomArchaea 
Replicon accessionNC_013158 
Strand
Start bp2484965 
End bp2486881 
Gene Length1917 bp 
Protein Length638 aa 
Translation table11 
GC content66% 
IMG OID644973493 
Productprotein of unknown function DUF58 
Protein accessionYP_003131318 
Protein GI257053485 
COG category[R] General function prediction only 
COG ID[COG1721] Uncharacterized conserved protein (some members contain a von Willebrand factor type A (vWA) domain) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGTGAAC GCTCTCGGCG CCCCGACGAG GAGACGGAGG GGGCCGCCGC AGTCGGATTG 
GGCCGGTTTC TGAGTGTGCT CGGTGTGTTC GCGGTCGCGG TCGGCTTCCT GCTTTTGGTT
TCGCCGGATC TCGGGGAGTC GCTCTCGGTG CAATACATCG TCGTGCTGGC GATCGGTGGG
CTGTTGCTCT TGTTCGCCGC TCGCCGGTGG TTCAAACGAC TGACTTCGTC GATCGACACC
GCCGAAACCC CGCCTGTCGA GCGTCGAACG ACGGTCTCGG TTCCGGGCGA CGGTTTCGAC
AAACTGCTCG TCGACACCGC ATCGAACGCG ATGGGCCGGT TGCAGGTGAA GGCGACGGCC
CGCGAGCGCA TTCGTGCGGT TGCCAAAGTG GTGGTTAGTG GGGATCCCGA GGATATCGAC
GACCAACTTG CAGCCGGGGC CTGGAGTGAT GATCCCGACG CCAACGCGCT GTTCTCGAAC
GGAACTGCCA GCGTCCGGGA TCGGGTCTCG TCGTTCGTCA GCGGGACATC CATCCTGAAA
CGGCGGGTTG TCAGCGCAAT CGAAGCCCTG GCCCGTCTCG CTGACGAGGA TGTCGAATGG
GAGGCCGAGC CGGTCGTCCC GGAAGTAACG CAAGAGCCCG CCGAGGAAGG CGACCATGCG
ACCGGCCGCT GGAACGGACT GACTGCCGTG GCACTGACGG TAGTCGGGTT CGGTGTTCTT
CTGGCCCGGC CCGGTTTGGT TCTTTCAGGC GCTGTTCTGT CAGGGCTCGG CGCGTACGCG
GTCGCCGGGT CCTCCCCCTC GACGGCGGTC CGCATCTCCC GGGAGATCCA GCCGGCCGCT
CCGCGACCCG GCGAACCCGT CGACGTCACC GTCGAAGTCG AAAACATCGG TGAGCAGTTT
CTCCCTGACC TCCGGATCGT CGACGGCGTG CCGGCTGATC TCACAATCGA GGCGGACAGC
CCACGTCACG GGACCGCGCT TCGCCCCGGC GCGACGATGG AATACACGTA TACAGTCCGT
GGGATCCGCG GCAGCCACAC GTTCGAGGAC GCGTTTCTCG TCTCCCGAAA CCTTCCGGGG
ACGCTCGAAC GAGTCGAGGA ATTCGGTGTC GATGGCGACC GGACCGTCAC GTACGATGTC
TCCTCGGCCC TCGATCTGTC GGTCCCGCTT CGCAAACAGG CCTCGATGCA CGTTGGGCGT
GTCTTGACTG ACTCAGCCGG GAGTGGCCTG GAGTTTCACT CGGTTCGGGA ATATCGAAGC
GGTGACCCAC TGACACGTAT CGACTGGAGT CGGGCGGCCC GTGGCGAAGG GCTGGCGACG
CTGCAGTTTC ACGAGGAGCG AGCGGCGACT GTCGTCCTGT TGATCGACGC TCGCAAGGAG
GCCTACGTCG CCAACGACGA CGATTCCCCC TCGGCCGTCG ACCGGAGTGT CCTCGCGGCG
GCGAAGCTCG CGTCGGCGTT GCTCGCGGCG GACGATCGAG TCGGCTTGGC CGCCCTCTCG
CCCAGACAGT GCTGGCTCGC GCCCGGAGCA GGGCATACGC ACCTCGCACG CCTGCAGGAC
GTGCTCGCGA CTGACGGGGC CTTCGCTCCG TCGCCACCGA CGCTCCCGTA CTACCAACGG
ATCAACCTGC CTGCGCTCCG GAAACGGTTA TCGTCCGACA GCCAACTGGT CGTGTTCTCG
CCGCTGGTCG ACGACGAAGT AGTCGACATC GTCCGCCAAC TCCAGGCTAG CGGCCACCCG
GTAACGATCA TCAGTCCGGA CGCTTCTGGC AGTGGAACGC CGGGTCGGAC GCTCGCGCGA
CTCGAACGCC GGAAACGACT CTCGGAGCTT CGGGGAGCCA ACGTTCGCGT GGTCGACTGG
GACGCCGATG AATCGCTCGC ACTCGCGCTG ACGAACGCCG GACGGCGGTG GTCATGA
 
Protein sequence
MRERSRRPDE ETEGAAAVGL GRFLSVLGVF AVAVGFLLLV SPDLGESLSV QYIVVLAIGG 
LLLLFAARRW FKRLTSSIDT AETPPVERRT TVSVPGDGFD KLLVDTASNA MGRLQVKATA
RERIRAVAKV VVSGDPEDID DQLAAGAWSD DPDANALFSN GTASVRDRVS SFVSGTSILK
RRVVSAIEAL ARLADEDVEW EAEPVVPEVT QEPAEEGDHA TGRWNGLTAV ALTVVGFGVL
LARPGLVLSG AVLSGLGAYA VAGSSPSTAV RISREIQPAA PRPGEPVDVT VEVENIGEQF
LPDLRIVDGV PADLTIEADS PRHGTALRPG ATMEYTYTVR GIRGSHTFED AFLVSRNLPG
TLERVEEFGV DGDRTVTYDV SSALDLSVPL RKQASMHVGR VLTDSAGSGL EFHSVREYRS
GDPLTRIDWS RAARGEGLAT LQFHEERAAT VVLLIDARKE AYVANDDDSP SAVDRSVLAA
AKLASALLAA DDRVGLAALS PRQCWLAPGA GHTHLARLQD VLATDGAFAP SPPTLPYYQR
INLPALRKRL SSDSQLVVFS PLVDDEVVDI VRQLQASGHP VTIISPDASG SGTPGRTLAR
LERRKRLSEL RGANVRVVDW DADESLALAL TNAGRRWS