Gene Htur_5022 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHtur_5022 
Symbol 
ID8745828 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaloterrigena turkmenica DSM 5511 
KingdomArchaea 
Replicon accessionNC_013748 
Strand
Start bp13327 
End bp14592 
Gene Length1266 bp 
Protein Length421 aa 
Translation table11 
GC content53% 
IMG OID646515636 
Producthypothetical protein 
Protein accessionYP_003406583 
Protein GI284176307 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value0.00965074 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCACTCG GAATGCCGGC ATGGTTAGGC GGCTGGGGCG ACGAGTCCGA CACATTGGAA 
GATCTTGGTT TCTTCGACAC GCATTACGTC GCCAATAGTG GAAAGAAACT CAAAATCGAC
CCCAGCGTTC TAGACTTTGA CGAAAATCTG GAAGCGAGAC TTTCATCTGA GGGTGTGACG
ATTTCAGGCT CACGAACACC GACTTTAGAC GAAATCTCGG GCTCTATCGC TGGTGGTGGG
ACGATTTCCA ACCTCACGGG GACGAATTTG TATATCGACG ATACCGGGAC GCTCAATGCC
ACTGCAAGTA GCGACTCAAC CGACACGACT CTCAATACAA TCCATCTATC GGATTCAGAC
GACATTCACA ACGTCATCGA CTCTGCAGCC CCCTACACGC GGATTATCGG CGACAGAGAC
AACCAACATA CACTCTCAAA GCGGATTGAC ATCACAACTG ACGGCCTGAT TCTACAGGAC
TGCAACCTCA AACTCGGGGC AAGCGTCAAC GACGACGTCA TCTACGTCCA CGACTGCAAG
GACGCGAAGG TCCTCAACTG TTTCATCGAC GGAAACTACC AGAACCAAGA CTACACCAAC
AACGGCGTCA GTAACGGGGT TGAGGTCTCG AACGCGCATA ATATCGAGGT CGGAGACAAC
GAAGTCGTCC GCGCTGCTGG ACAGGGTATC ACGGCCACTT CGTACCCGCT CGCGCAAAAC
AATGATTACG GTGGCGACAA GCCGGGTGGC CCGATCTCAA ACATCTATAT CGAGGATAAC
GAACTCTCGG AGATTCAGAA CGGTGATATC CTTCTCTCCG GCGGCAACGG AGTTGCCGCT
GAGTACGGTT ATATCACCGG GAACGTTTGT ACGTCGACCC AGCAGGATAT TCTGAACGTC
ATTGACGGCT TCCAGCACGC GAAAGTCGAG GACAACTACT GTATCGGTGG CGGTGTCGGG
CTCGCCATCG AACAGCACGG GAGCCGTGGC GTTGACCGGA AGGTCCACGA TGTGACTGTC
CGGAACAATA CGTTCGAGGT GTCGGGCGCG AACGGCATCG AGTTCGACCA CGACACGTAC
CCGTTCCGAA ACATCAAGCT CAACGATAAC ACGTTCATTG GGAACAACAC TGGCGTGTAC
GTCCCCTCGA GTTTCGACCT TGATGGCTTC ATGGTTCGGA ACAATACGTT CGAGAGTTGC
AGCACGGACA TCAGCATCAA CTCCACAATC TCGAACCAGT CTGTAGGCGA CAACCTGACA
TGGTGA
 
Protein sequence
MSLGMPAWLG GWGDESDTLE DLGFFDTHYV ANSGKKLKID PSVLDFDENL EARLSSEGVT 
ISGSRTPTLD EISGSIAGGG TISNLTGTNL YIDDTGTLNA TASSDSTDTT LNTIHLSDSD
DIHNVIDSAA PYTRIIGDRD NQHTLSKRID ITTDGLILQD CNLKLGASVN DDVIYVHDCK
DAKVLNCFID GNYQNQDYTN NGVSNGVEVS NAHNIEVGDN EVVRAAGQGI TATSYPLAQN
NDYGGDKPGG PISNIYIEDN ELSEIQNGDI LLSGGNGVAA EYGYITGNVC TSTQQDILNV
IDGFQHAKVE DNYCIGGGVG LAIEQHGSRG VDRKVHDVTV RNNTFEVSGA NGIEFDHDTY
PFRNIKLNDN TFIGNNTGVY VPSSFDLDGF MVRNNTFESC STDISINSTI SNQSVGDNLT
W