Gene Htur_4474 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHtur_4474 
Symbol 
ID8745103 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaloterrigena turkmenica DSM 5511 
KingdomArchaea 
Replicon accessionNC_013745 
Strand
Start bp64016 
End bp65926 
Gene Length1911 bp 
Protein Length636 aa 
Translation table11 
GC content66% 
IMG OID646515011 
Productprotein of unknown function DUF1680 
Protein accessionYP_003405958 
Protein GI284172576 
COG category[S] Function unknown 
COG ID[COG3533] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0254713 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGTCCAC GAGATGCGGT TACGGACGGG ATTTCGCTCT CCGAAGTCGC GATCGACGAC 
GGATTTTGGT CGCCGCGACG GGAGCGAAAC AGGGACGTGA CGATCGAGTA CCAGTACGAA
CAACTCGAGG AGTCGGGCTC CCTCGAGAAC TTCCGGCGCG TCGCCGCCGG GGAGTCCGGC
GACTTTCAGG GGATGTGGTT TCAGGACACG GACGCGTACA AGTGGATCGA AGCCGCCTCG
TACGTGCTCG CCCAGCGCGA CGATCCGGAG CTGGAAGCGA AGGTCGACGG GGTCATCTCG
CTGATCGCCG ACGCACAGCA GCCCGACGGC TATCTGAACA CGTACTTCTC GCTGGTCGAA
CCCGAGAACC GGTGGACGAA CCTCCACATG ATGCACGAAC TCTACTGCGC CGGGCATCTG
ATTGAGGCCG CGGTCGCCCA CTACCGTGCA ACGGAGAAGG AGACGCTGCT CGAGGTGGCG
GTCGACTTCG CAGATCTTGT CGACGACGTC TTCGGCGACG AGGTCGAGGG CGTTCCCGGC
CACGAGGAGA TCGAACTGGC GCTCCTGAAG CTCTACCGGG TCACCGACGA GACGCGGTAT
CTCGAGCTCG CGAAGTACTT CATCGACCTC CGCGGGAAGG ACGACCGACT GGCGTGGGAG
ATCGACAATC CGGAGACGCT CGGCGGCGGC GAGTACGAGG ACGGTTCTAT CATTCCTGCC
GCACGGGACG TCTTCACCCA CGAGGACGGA ACGTACGACG GGCGGTACGC CCAGGCCCAC
GAGCCGCTCC GGGACCAGGA GACCGTCGAA GGCCACTCCG TCAGGGCGAT GTACCTGTTC
GCGGCCGCGA CGGACCTCGC CATCGAGACG GGCGAAGACG AGTTGATCGA GTCCCTCGAG
CGCCTCTGGA CGAACATGAC GACAAAGCGG ATGTACGTCA CCGGCGGGCT CGGTCCCGAG
GAAGCCCACG AGGGGTTCAC GACGGACTAC GACCTCAGGA ACGACGCCTA CGCCGAGACC
TGCGCCGCGA TCGGGAGCGT CTACTGGAAC CAGCGGCTGT TCGAACTCTC CGGCGAGGCG
AAGTACGCCG ACCTCATCGA GCGGACGCTG TACAACGGCT TCCTCGCCGG CGTCTCGCTG
GACGGGACCG AGTTCTTCTA CGAGAATCCC CTCGAGAGCG ACGGCGACCA CCACCGAAAG
GGTTGGTTCA CCTGCGCGTG CTGTCCGCCG AACGCGGCCC GGCTGCTCGC GTCGCTGGGC
GAATACGTCT ACAGCCAGCG GGACTCGGCG ATCTACGTCA ATCAGTACCT CGGCAGTAGC
GTCACGACGG CGGTCGACGG CGCGACCGTC GAGCTATCGC AGGACAGCTC CCTCCCGTGG
TCGGGCGAGG TGACCGTCGA CGTCGACGCC GACGGGGCGT CGGTGCCGCT CCGACTCCGC
ATCCCGGAGT GGGCCGAGTC GTCGACGGTG ACGGTAAACG GCGAGTCGGT CGAGACGCCG
TCCGAGGGCT ACCTCGAGAT CGAACGGGTG TGGGACGACG ACCGGATCGA GCTGACCTTC
GAGCAGACGG TCACGCGGCT GGAAGCCCAT CCTGACGTGG CGGCCGACGC CGGTCGCGTC
GCCCTCAAAC GCGGGCCGCT CGTCTACTGC CTCGAGGCGA TCGACAACGA CCGGCCGCTC
CACCAGTACG AGGATCCGTC GCCCACCTCG ACGACGCACC GACCGGACCT ACTCGAGGGC
GTCACGGTCA TCGAGGGCGA AGCGAGCGTG CCGGACCGGG CGGGTTGGGA CGGCCGACTG
TACCGGCCGG CCGACGAAAC GGCTCGAGAA CGAACCGAGT TCACGGCGGT CCCGTACTAC
GCGTGGGACA ACCGCGAGCC GGGAGCGATG AGAGTCTGGA TTCGGTCGTA G
 
Protein sequence
MGPRDAVTDG ISLSEVAIDD GFWSPRRERN RDVTIEYQYE QLEESGSLEN FRRVAAGESG 
DFQGMWFQDT DAYKWIEAAS YVLAQRDDPE LEAKVDGVIS LIADAQQPDG YLNTYFSLVE
PENRWTNLHM MHELYCAGHL IEAAVAHYRA TEKETLLEVA VDFADLVDDV FGDEVEGVPG
HEEIELALLK LYRVTDETRY LELAKYFIDL RGKDDRLAWE IDNPETLGGG EYEDGSIIPA
ARDVFTHEDG TYDGRYAQAH EPLRDQETVE GHSVRAMYLF AAATDLAIET GEDELIESLE
RLWTNMTTKR MYVTGGLGPE EAHEGFTTDY DLRNDAYAET CAAIGSVYWN QRLFELSGEA
KYADLIERTL YNGFLAGVSL DGTEFFYENP LESDGDHHRK GWFTCACCPP NAARLLASLG
EYVYSQRDSA IYVNQYLGSS VTTAVDGATV ELSQDSSLPW SGEVTVDVDA DGASVPLRLR
IPEWAESSTV TVNGESVETP SEGYLEIERV WDDDRIELTF EQTVTRLEAH PDVAADAGRV
ALKRGPLVYC LEAIDNDRPL HQYEDPSPTS TTHRPDLLEG VTVIEGEASV PDRAGWDGRL
YRPADETARE RTEFTAVPYY AWDNREPGAM RVWIRS