Gene Huta_2172 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHuta_2172 
Symbol 
ID8384466 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhabdus utahensis DSM 12940 
KingdomArchaea 
Replicon accessionNC_013158 
Strand
Start bp2222983 
End bp2224278 
Gene Length1296 bp 
Protein Length431 aa 
Translation table11 
GC content65% 
IMG OID644973241 
Productquinoprotein (ISS); K06485 integrin alpha 6 
Protein accessionYP_003131072 
Protein GI257053239 
COG category 
COG ID 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACATCT ACCGCCGGCG GGCCCTGGCA CTCCTCGGCT CGGCGAGTAT CACGGCAGTT 
GCCGGCTGCA GCGGGATCGC CGCTCCGGGC CAATCCTTCG AAGACCGAAC GGGACTCACA
GCTGACGACG GCGGCACTGG CGACAATTTC GGCCACGAGG TCGCGGTATC GGCGGACGGA
TCGACAGTAC TCGTCGGCGC ATCTCGAGCA GACACGCCCG ACGGGACGGA AACAGGCGCT
GCCTACGTCT TCGAACGAAC AGACGGATCG TGGACCCAAA ACGGGAAGCT CGTTCCCGAA
ACGATAGAGG AAAAGTCTAT GTTTGGATTC GGTGTGGCAC TGTCGGCTGA CGGCAGGACA
GCTGTCGTCG GATGCGCGTT CGACGAGCGA CGCGAGACAC GCCCATCCGG CGCGGTGTAC
GTCTTCGAAC GCGTCGACGG AGAGTGGATT CAACGGGCGA GATTGATCGA GGACATGGCA
GCCGACCCTG CAGACAGATC TGCAGCGTAT ACCGATGCAC TCGGTGAATC CGTCGCGGTC
TCGGCCGACG GCGAGACAGT CCTGGCCGGC GCACCGTTCC ACGCTGTCAG TGGGAGCAGC
GGTGCGCCGC CGGTGTCGGG CGGGTCGGAC TTCGAAAACC TGGTTGCGTC CGTCTGGAAC
CCGATCGGCC CGGCACCGGG GGCGGCGTTC GTCTTCGAGC GATCCGACGG CGAGTGGCAC
CAGACGGCGA AGTTCGCCAC CGGCGAATGG CAGGGACAGA GCCACGTGGG ATCGGCCGTC
GCACTCACGG CTGATGGGTC GAAAGCATTC GTCGCCTCCC AGGGACGCAG TTCGGTTTCC
GTGTTCGAGC GCGTTGACGG ATCCTGGACC GAAGCATCGG CGCTCCCGAT CGATGATGAC
GTCTTCCTGA GGCGCGACGG TGTGATCGGC ATATCGACTG ACGGAACGAC GGCGGTCACC
GGGGACTACG GCGGTCCGGC AGCCGTCTTC GAGTGGGCGG ACTGCGAGTG GACCCGATCG
GCGACACTCG AAGCCAGCCC CACGGAGGAC CCCTGGAGTA ACGCGAATGT CGCGCTCTCC
GGTGACGGCG ATACCGCCCT CATCGTCAGA GAGAGTGGCG AGCGGAAGAA TGCCGTCGAG
GCTTTCACTC GGTCGGGAAG CTCCTGGAGC CGAAAGACGG TACTCACGAC TGGCGATGAC
AGACCGGAAA TGGGTTTCGG GGCGTCACTG GCCCTCTCCG GTGACGCCAC GACGGCCGTC
GTCGGTGCTG CTGGCGCAGC ATTCGTCTTC GAGTGA
 
Protein sequence
MDIYRRRALA LLGSASITAV AGCSGIAAPG QSFEDRTGLT ADDGGTGDNF GHEVAVSADG 
STVLVGASRA DTPDGTETGA AYVFERTDGS WTQNGKLVPE TIEEKSMFGF GVALSADGRT
AVVGCAFDER RETRPSGAVY VFERVDGEWI QRARLIEDMA ADPADRSAAY TDALGESVAV
SADGETVLAG APFHAVSGSS GAPPVSGGSD FENLVASVWN PIGPAPGAAF VFERSDGEWH
QTAKFATGEW QGQSHVGSAV ALTADGSKAF VASQGRSSVS VFERVDGSWT EASALPIDDD
VFLRRDGVIG ISTDGTTAVT GDYGGPAAVF EWADCEWTRS ATLEASPTED PWSNANVALS
GDGDTALIVR ESGERKNAVE AFTRSGSSWS RKTVLTTGDD RPEMGFGASL ALSGDATTAV
VGAAGAAFVF E