Gene Huta_0995 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHuta_0995 
Symbol 
ID8383268 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhabdus utahensis DSM 12940 
KingdomArchaea 
Replicon accessionNC_013158 
Strand
Start bp961560 
End bp962993 
Gene Length1434 bp 
Protein Length477 aa 
Translation table11 
GC content61% 
IMG OID644972059 
Productexopolysaccharide biosynthesis polyprenyl glycosylphosphotransferase 
Protein accessionYP_003129911 
Protein GI257052078 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG2148] Sugar transferases involved in lipopolysaccharide synthesis 
TIGRFAM ID[TIGR03025] exopolysaccharide biosynthesis polyprenyl glycosylphosphotransferase 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAACCC ACTGGCGGTA CCGGATTACG GGCGTGTTCG GAACCGCCAT CATCGCGTTC 
GTCGGGGTCG TCCTGTCGAA CGAGCCACTT ATCCGATCGG TTACCGGTCT GGTGCCGGTG
TTGTCGACAC TGCCGGCCGA TCGGGCGGTC GGCGGTGAAC TCCTCTTCGA GGCCACGACG
GCGACGGTCG TGGTGCTCGC GGCGATGGCA CCGCTGTACA AACCGCGCCC ACGCCGGATC
CTGGACACGT GGATGAAGGC CGTCCGTCGG ACGATACTGG CGTTACTTTC GCTTGCGACC
ATCGGCTACT TCGATTACAC CTATCAGCTT CCACGGGCGA CGTTGCTTGT AACCGGTGGC
TTCCTGCTGG TGGCCCTGCC GCTTTGGTTC GTGGCGATCC GGCGTCGACC GGCCGAAACC
GGCGAGCGCA CGATCATCAT TGGCGACGAC GCAGCGCCAA TCCGGGAGAT CTTGCGCGCC
GTGGATCGAC GTGTCATCGG CTACGTTTCA CCACCATCCG CGTACGTGGG AGACGACCAG
CCACCGGCGG TGCGCGCCCG AAAGACGGAC GGTGGAGAAG TGCCGAGCCC TATCGAGACC
CTGTCGTATC TCGGGGGGCT CTCACGGCTC GAAGAGATCC TGCTCGATTA CGACATCGAC
ACGGCCATCC TCGCGTTCGA CCGACCGGAC CGTGCGGAGT TTTTCGGAGC ATTAGACACC
TGTTATAAGC ACGGTGTCGC CGCGAAGGTA CACCGGGATC ACGCCGATGT CGTCCTGACC
AACGGGGCCG CAGGCAGCGA ACTCGTCGAC GTCGAACTCG AACCCTGGGA CTGGCAAGAC
CATCTGATCA AGCGCGCGTT CGATCTCGCG TTCGCCACAG TGGGATTGAT CGTCCTCTCG
CCGATGATCG GGCTGATCGC CGTCGCGATC AAACTCGAAG ATGGGGGCTC AGTTCTCTAC
AGTCAGGACC GGACGGCCAC GTTCGGGGAG ACGTTTACCA TCTATAAGTT CCGAAGCATG
GTGCCTGATG CCGAAAGTGA GACCGGCGTG AAACTGAGCG AAGAAGACAG CGGAGGACGC
GACCCCCGGG TGACCAGAAC GGGCCGGATC ATCCGTCAGA CACATCTCGA CGAGATCCCA
CAGTTGTGGT CAATACTGGT CGGAGACATG AGCGTCGTCG GCCCGCGTCC GGAGCGGCCC
ACACTGGACG ACGATATCGA GAGTACCGTC TCCGAATGGC GACGACGGTG GTTCGTGAAG
CCCGGATTGA CGGGACTCGC CCAGATCAAC GATCTGACCG GAACCGAACC CGCACAGAAG
TTCCGCTACG ACATGACGTA TATCCGCAAA CAGTCCTTTT GGTTCGACCT GCAGATCGTG
GTTCGGCAAA TTTGGAAGGT GCTGGGTGAC CTCTGTAAGC TAACCCACGG CTAA
 
Protein sequence
METHWRYRIT GVFGTAIIAF VGVVLSNEPL IRSVTGLVPV LSTLPADRAV GGELLFEATT 
ATVVVLAAMA PLYKPRPRRI LDTWMKAVRR TILALLSLAT IGYFDYTYQL PRATLLVTGG
FLLVALPLWF VAIRRRPAET GERTIIIGDD AAPIREILRA VDRRVIGYVS PPSAYVGDDQ
PPAVRARKTD GGEVPSPIET LSYLGGLSRL EEILLDYDID TAILAFDRPD RAEFFGALDT
CYKHGVAAKV HRDHADVVLT NGAAGSELVD VELEPWDWQD HLIKRAFDLA FATVGLIVLS
PMIGLIAVAI KLEDGGSVLY SQDRTATFGE TFTIYKFRSM VPDAESETGV KLSEEDSGGR
DPRVTRTGRI IRQTHLDEIP QLWSILVGDM SVVGPRPERP TLDDDIESTV SEWRRRWFVK
PGLTGLAQIN DLTGTEPAQK FRYDMTYIRK QSFWFDLQIV VRQIWKVLGD LCKLTHG