Gene Huta_1068 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHuta_1068 
Symbol 
ID8383342 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhabdus utahensis DSM 12940 
KingdomArchaea 
Replicon accessionNC_013158 
Strand
Start bp1042073 
End bp1043197 
Gene Length1125 bp 
Protein Length374 aa 
Translation table11 
GC content54% 
IMG OID644972133 
ProductCapsular polysaccharide biosynthesis protein- like protein 
Protein accessionYP_003129984 
Protein GI257052151 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG4421] Capsular polysaccharide biosynthesis protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATTGGGA GAAAGTACCT TAGTCAGGTT CTCAACCTTT CGTCTACCCT TCTATCTACG 
GTTGGGAAGC CGGTGGTGAT GCAGCGTCCC GAACTGAAAC AGTATGCACG GGAACGGGGT
CAGTTATTCC ACTTTGGTTC ATCTGAGACC TATTCTTTCG GCCCGCCCCA ATATCCCGAC
GAAGTGCTGT ACTCCCTGCA ACGCAGATTT GGCGAATATC AAACTCCGAA GCCGTTCGTA
GCCGAAGTGC GCGACGTTTC ACTAATCGGT CTCTATCCAA TTCCGTTCAA GAGGGGAAAT
GCCCTTGTAG AACCTGTCGT CAGCGAACGG AACTTAATCC TCAATCTCTT CTATTCGGTT
GTTGACTCAC GCCTGCAACA CCGCAGCCTG ACGTCGAAGG ATTTCGATTG CGCCTGCCTC
CTATTCAACT CCCAGAGCCG GGGGGTGTTT CATTGGATGG TAGAGGACGC CTTGAGAGCC
GAAGGCGTCC TGAGATACGA AGAGAAAACG GGTCGCCGTC CGACTCTTAT TATACCCCCA
AACCCGAAAT CGTGGCAGAC CGAGACACTG GAACTGCTTG GATTCGAGTC GGAAGACTGG
GTCGAATGGG ACGCTTTTAG AGGGAAGGTG GACCGATTGG TCGTCCCCTC CGTCCGGCGG
ATATACGATG ATGGAGTGGT CTCCCCGGCA CAAACGGAGT GGTTCAGTGA GCGGATGGTG
GGCGGTGCAG AAGGCCAAGT GGAGGCTTCC AAAACCTCCT CACGGGTGTA TATCTCCCGT
GACGATGCCG GGAGACGTCG GTTGACGAAC GAAGACGATC TTATGGATCA ACTCGGTGAT
CTCGGGTTCG AACGCCACTA TCTTGAGCGC ATGTCCACCG CCCAGATCGT GAGCCTGTTC
AATAACGCCA ACATAATAGT CGCTCCTCAC GGAGCAGGCT TGACCAACAT CATGTTCGCA
ACGGATGCCA GCGTCATCGA ACTGCGCCCC AACGACTCCT ATTCCTGGGT ATATTACGTA
TTGAGCGAGC AAAATGGACT CGACTACTGC TACGTGATGG GTGACGACGA TAAGGAAGGA
ACGGACTTCC GGGTAGAGCC TGCGAAGGTC ATTGATGCTC TGTAA
 
Protein sequence
MIGRKYLSQV LNLSSTLLST VGKPVVMQRP ELKQYARERG QLFHFGSSET YSFGPPQYPD 
EVLYSLQRRF GEYQTPKPFV AEVRDVSLIG LYPIPFKRGN ALVEPVVSER NLILNLFYSV
VDSRLQHRSL TSKDFDCACL LFNSQSRGVF HWMVEDALRA EGVLRYEEKT GRRPTLIIPP
NPKSWQTETL ELLGFESEDW VEWDAFRGKV DRLVVPSVRR IYDDGVVSPA QTEWFSERMV
GGAEGQVEAS KTSSRVYISR DDAGRRRLTN EDDLMDQLGD LGFERHYLER MSTAQIVSLF
NNANIIVAPH GAGLTNIMFA TDASVIELRP NDSYSWVYYV LSEQNGLDYC YVMGDDDKEG
TDFRVEPAKV IDAL