Gene Huta_0956 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHuta_0956 
Symbol 
ID8383229 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhabdus utahensis DSM 12940 
KingdomArchaea 
Replicon accessionNC_013158 
Strand
Start bp921135 
End bp922856 
Gene Length1722 bp 
Protein Length573 aa 
Translation table11 
GC content65% 
IMG OID644972020 
Productflagella protein 
Protein accessionYP_003129872 
Protein GI257052039 
COG category[N] Cell motility 
COG ID[COG3351] Putative archaeal flagellar protein D/E 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.451324 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTACGC TCTCGTTGCT CCAGTCCGGG ACAGTGGTCG AGGGGGTGGA CCCTACCCTC 
CTCGCGGCCA CCGGTGACTG GATCGTCGAG TCGGTCGGCT CGCTGTCGAC CGAACCGCTC
GCCGTCGGTG TCCTGACGGT GGGGGTGGTC GGCATGAGTA TCCGATCCGT CTTCGACTCG
ATCCTCTCGG ACGAGGACGG GGGCCCCGAC GTGGACGAGG GGGACGACGA CGGGTTGATG
CCCGAGGAAG GCGGGGATGA CCTCGGCGAC CTTGGCGGCG GGTTCGGTGA TGGCGGCGAC
GACTTCGGAG ATTTCGAAGA CGACGACTTC GGCGAGATGG ACGGGGGCGG TGCGGACACT
GACGAACTAC AGCACCGCCT CGACGAGCTC GAAACCGAGG TCGGCAGCCT CTCCTCGACG
GTCAACACGG TCCGCAACGA GAACGAGCAG ATCTCTGAAA CGGTCGACGA CGTCGAGGAA
AACGTTCGAA AACTGCTCGA CATCTACGAG ATGGTGACTC GGGGCGTCAA TCCCTTCGCC
GACGACATCG ACGCCGGCGG ATTGGGCGGC CCCGGAGAGG AGTCGTTTGG CCTCTTCGAC
AATGGTGACG ACCAGTCCGA AGAGGGAGAC CTCGACGAAG ACATCGCCAA CGCCGACGCG
GAAGGGTTCT TCGACGAAGA CCTCGTCGAG GACGACGAGC TGGAAGCCGA TGCCGACGTC
GGCGATGTCC TGGGCACGGG CGAAGACGAC GGCGGTGACG ACGGTGGGTT CGAAGACGAC
TTCGAGGACG ACTTCGACAT GGACGATGAT TTCGGCGACG CCGAAGACGA CTTCGACATG
GACGAGGGCG GCGACGGTGA CGCGGACAGC GGCGGTGAGG GCGGGAAATC CTTCGCCGAG
CTCAAAGACG AGTACGAGGC CGGCGACGCC GAATGGGCCG AAGGCGACGC CGAAGATCCG
GACGAAAGCA TCGAGGAGAC GACCGACGAC CTGGCGGCAG ACGACGACTC GCTTGATGAC
GATGAAGCCC TCGTGGACGA GGACGACGAC TTCGCCATGG AGGACGACGG TGGAGAGGCG
GGCGACGGGC TGGCCGACGA CGATCTCTTC GATACGGTTA TCGAGGACGA GGGCGACGAA
GAGAGCGCAG ACGAGGCTGT CGATGCGACT GAATCCGCCA CGAGCGTCGT CGAGGACGAC
ACAGAGACAG AAGAAACCGA GCCGGATCAG GAGACGATTA CCCCCGAGGA GGAGAGCGTC
ACGGAAACGG CGCAGACCGA GGAGGACGCG ACTGAGCCGG CGGCCAACGC CGAAGCGCCG
TCAGCCAAAG AAGAGGCCGA CGGGTCGGGA GCAACCGAAG GAGCGAGCGC CGAATCCGAC
GACGGAAAGC CATACCTCAC GTCGCTTCCG GACGGGTTCC TCGCCGACCT GATCGTCGTC
GAATGGCTGG AGTTCCTGGT CGAGGAAGTC GGGATCCGGG CCACCGCGGA GGCCATCGAC
TACTACGAAC GCATCGACTG GATCGACGAG TCCGTCGCCG ACCAGTTACA GGCGTACCTC
AAGGGCTTCG AGGAAGGCGG TGAGTCCGAG AGCCTGACCA TCGACCACCA CACCAAGAGT
CTGCGGTACG TGAGCCAGCT GAACGGCGGC GGGGCGGAGT CGATCGCCCT CCAACAGCTG
CCACGCCAGA CCGGAGGTGG CCCCGATGGG ATTCAGCGTT AG
 
Protein sequence
MSTLSLLQSG TVVEGVDPTL LAATGDWIVE SVGSLSTEPL AVGVLTVGVV GMSIRSVFDS 
ILSDEDGGPD VDEGDDDGLM PEEGGDDLGD LGGGFGDGGD DFGDFEDDDF GEMDGGGADT
DELQHRLDEL ETEVGSLSST VNTVRNENEQ ISETVDDVEE NVRKLLDIYE MVTRGVNPFA
DDIDAGGLGG PGEESFGLFD NGDDQSEEGD LDEDIANADA EGFFDEDLVE DDELEADADV
GDVLGTGEDD GGDDGGFEDD FEDDFDMDDD FGDAEDDFDM DEGGDGDADS GGEGGKSFAE
LKDEYEAGDA EWAEGDAEDP DESIEETTDD LAADDDSLDD DEALVDEDDD FAMEDDGGEA
GDGLADDDLF DTVIEDEGDE ESADEAVDAT ESATSVVEDD TETEETEPDQ ETITPEEESV
TETAQTEEDA TEPAANAEAP SAKEEADGSG ATEGASAESD DGKPYLTSLP DGFLADLIVV
EWLEFLVEEV GIRATAEAID YYERIDWIDE SVADQLQAYL KGFEEGGESE SLTIDHHTKS
LRYVSQLNGG GAESIALQQL PRQTGGGPDG IQR