Gene Huta_0921 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHuta_0921 
Symbol 
ID8383194 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhabdus utahensis DSM 12940 
KingdomArchaea 
Replicon accessionNC_013158 
Strand
Start bp887466 
End bp888797 
Gene Length1332 bp 
Protein Length443 aa 
Translation table11 
GC content61% 
IMG OID644971985 
Productprotein of unknown function DUF21 
Protein accessionYP_003129837 
Protein GI257052004 
COG category[R] General function prediction only 
COG ID[COG1253] Hemolysins and related proteins containing CBS domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTAAACG TCGCGCTCTC GGCAGCACAA CTTGTCTTGG CGCTGATTCT GGTGGTTTTA 
AACGGCTTCT TCGTCGCTGC GGAGTTCGCC TTCGTTCGGG TTCGGGGGAC GTCGGTCGAC
CAGCTCGCCG AGGAAGGGCG GCCCGGCTCG GCGACGCTGC AAGAAGTGAT GACGAATCTC
GATAACTACC TCGCCACGAC GCAACTCGGC ATCACCATCG CCTCGCTCGG ATTGGGGTGG
GTCGGCGAAC CCGCCGTGGC GGCGCTCATC GAGCCAATAC TGGAATCGGT TCTCCCGGCG
AGTCTCATCC ATCTCGTGGC GTTCGCAATC GGATTTAGTA TCATCACGTT CCTCCACGTC
GTCTTCGGTG AACTCGCGCC GAAGACGATC GCAATCGCCC AGACCGAGCG ACTCTCGCTG
TTTCTCGCCC CGCCGATGAA GTTCTTTTAT TTCATACTCT ATCCGGGTAT TGTCGTCTTT
AACGGGGCGG CTAACGCGTT CACGCGGTCG CTCGGTGTGC CGCCCGCTTC CGAGACGGAT
GAGACACTCG GTGAGCGAGA GCTCCTCCGG GTACTCACAC GATCGGGTGA GGTGGGGGAC
ATCGACCTGG CAGAAGTGAC GATGATCGAG CGGGTCTTCG ATCTCGACGA CATCGTGGTG
CGGGAGGTCA TGGTTCCACG GCCTGACGTG GTGAGCGTTC GGGCGGATGC CGCGCTCTCG
GACCTCCAGT CGATCGTCCT CGAGGCTGGT CATACCCGCT ATCCAGTGCT TGCCGCCGAG
GACGGCGATC AAGTGATCGG ATTCGTGGAC GTCAAGGACG TGTTACGAGC GGAGGTGGAG
GGTGGGGACG CCGAGTCAGT CGGCGACATC GCCCGCGAGA TCGCGATCGC CCCCGAGACG
ATGGCACTCA GCGATCTCCT GAGACAGTTC AGGGAAGACC AACAGCAAAT GGTTGCAGTT
ATCGACGAGT GGGGGGCGTT TGAGGGGATC GCAACGGTTG AAGACGTCGT CGAGGCACTC
GTCGGGGACC TCCGGGATGA GTTTGATATG GACGAGCGCG AACCCTCGAT TCGCCCGCGT
GATGATGGGG GATACGACAT TGATGGGGGC GTCCCGTTGT CAAAAATCAA CGACATGATC
GAGGGGGAGT TCACGAGTGA CGAGGTCGAA ACGATCGGTG GGCTGGTACT CGAGCAACTC
AACCGTGCGC CGGAACGTGG CGATCGCGTT GCGGTCGCCG GGTACGTCGT CACGGTGACG
AGCGTCGAGG GGTCCCGAAT TTCGACGATC CGGGTCCAGG AACGTCAAGA GGGCGACTCA
GCAGTAGACT GA
 
Protein sequence
MVNVALSAAQ LVLALILVVL NGFFVAAEFA FVRVRGTSVD QLAEEGRPGS ATLQEVMTNL 
DNYLATTQLG ITIASLGLGW VGEPAVAALI EPILESVLPA SLIHLVAFAI GFSIITFLHV
VFGELAPKTI AIAQTERLSL FLAPPMKFFY FILYPGIVVF NGAANAFTRS LGVPPASETD
ETLGERELLR VLTRSGEVGD IDLAEVTMIE RVFDLDDIVV REVMVPRPDV VSVRADAALS
DLQSIVLEAG HTRYPVLAAE DGDQVIGFVD VKDVLRAEVE GGDAESVGDI AREIAIAPET
MALSDLLRQF REDQQQMVAV IDEWGAFEGI ATVEDVVEAL VGDLRDEFDM DEREPSIRPR
DDGGYDIDGG VPLSKINDMI EGEFTSDEVE TIGGLVLEQL NRAPERGDRV AVAGYVVTVT
SVEGSRISTI RVQERQEGDS AVD