Gene Htur_4440 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHtur_4440 
Symbol 
ID8745069 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaloterrigena turkmenica DSM 5511 
KingdomArchaea 
Replicon accessionNC_013745 
Strand
Start bp20789 
End bp22489 
Gene Length1701 bp 
Protein Length566 aa 
Translation table11 
GC content67% 
IMG OID646514977 
ProductRicin B lectin 
Protein accessionYP_003405924 
Protein GI284172542 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0151299 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACAGA CACGACGAAC CTACCTGAAA GGAACGGCGG CATCGGCACT GATCGGAATC 
GGCGCGCTTA GCGGCCTCTC CGGGTCGGCG GCGGCGGAAT CGAACTTCGA CCTCGAGGCC
GGCTTCGCGG ACACGTCGTG GCTCGACGAC GACGTCGACG TCCACACGAT CACCGAACCG
ACGCGGAGCG CGGTCGAATC GGCGTTCAGC GCCAGCGGAG CGCGCGTGGT CGTCTTCGAG
ACGAGCGGAA CTATCGACCT CGGTGGAAAC GATCTGGCGA TCACCGAAGA CTACTGCTGG
GTGGCCGGCC AGACCGCGCC GTCGCCCGGT ATCACGTTCA TCAACGGACA GGTCCGGATC
AGCGCGAACA ACTGCGTCGT CCAGCACATC CGCTCGCGAA TCGGCCCCGG TTCCGACGGC
TCGATCCAGA GCAACGACGC GTTCAACACC GCCGACGGTA CCCAGAACAA CGTCGTCGAT
CACGTCAGCG CCTCGTGGGG CACCGATGAG TGCCTCTCCG TCGGCTACGA CACGCAGGAT
ACGACGGTAA CCAACTGTCT CATTTACGAG GGGCTGTACG ACCCCTACGG CAACGAGGCG
GACCACAACT ACGGGAGCCT GATCGGCGAC GGCGCCTCGA ACGTCACCCT CGCGGGCAAC
GTCTGGGGGA AGGTCCGCGG TCGCGCGCCG CGACTCAAGA GCGACACCGA GACCGTCGTC
GTCAACAACC TCCTGTACTT CTTCGACGAG TCGGCCAACG CCGACGACTC TGCGGTCACG
AGCTTCGTCG GTAACGCGGC GATCTGTGCG GACGACGATG ACGCCATTCT CGAGGGCAGT
CCGACCGCGT ACCACGCCGA CAACATTGCG TACGATCCGC CGATGGTCGA CGAGCAGCCG
ATCGCCGAAC CGGAGTCGAC GAGTTCGCCG CCGCTGTGGC CGAGCGGCCT CAGCGAGATG
CCGTCGGGTG ACGTCGAGAG CCACAACCTC ACCAACGCCG GGGCGCGGCC GGCCGATCGA
ACGCAAAACG ACGCGCGAAT CGTCCAGGAG ATCGCCGACC GCGCCGGGCT CGACTACCTC
GACTCGCCGT ACGACTACTG GGTCGGCCAC CACGACGAGG TCGGCGGCTA TCCGGAGCTC
CCCGTGAACA CCCACTCGCT CGAGGTCCCC GACAGCGGTA TCCGCGACTG GCTCGCCGGC
TGGGCCCAGG CCGTCGAGGA GGGCAGTTCG CCGCCCGACG GCGGTAGCGG CGACGACGGG
AGCAGCGGTC CGATCCCGAC GGGCACCTAC GAGATCGCCA ACGTCAACAG CGGGCAGCTG
CTCGAGGTGG CCGACGCGTC CACCGCGGAC GGCGCCAACG TCCAGCAGTG GTCCGCGACC
GATCACGCCA CGCAGCAGTG GTACGTCGAG GATACCGGGA ACGGCGAGTA CGTCCTCCAG
AACGCGAACA GCGGGCTGTT GCTCGAGGTC GCCGACGGCT CCACCGAGGA CGGCGCGAAC
GTCCAGCAGC ACGCGGACAC GGGTTGCGAC TGCCAGCGGT GGTCCATCAA CGACGTGGGC
AACGGAGAGT ACATCCTCGA GGCGGTCCAC AGCGGAAAGG TAGCCGACGT CGAGGGAGCG
TCGACCAGCG ACGGGGCGAA CGTACTCCAG TGGCCCGACA CCGGCGGCGC GAACCAGCGC
TGGACGTTCG ACTCGGTGTA G
 
Protein sequence
MKQTRRTYLK GTAASALIGI GALSGLSGSA AAESNFDLEA GFADTSWLDD DVDVHTITEP 
TRSAVESAFS ASGARVVVFE TSGTIDLGGN DLAITEDYCW VAGQTAPSPG ITFINGQVRI
SANNCVVQHI RSRIGPGSDG SIQSNDAFNT ADGTQNNVVD HVSASWGTDE CLSVGYDTQD
TTVTNCLIYE GLYDPYGNEA DHNYGSLIGD GASNVTLAGN VWGKVRGRAP RLKSDTETVV
VNNLLYFFDE SANADDSAVT SFVGNAAICA DDDDAILEGS PTAYHADNIA YDPPMVDEQP
IAEPESTSSP PLWPSGLSEM PSGDVESHNL TNAGARPADR TQNDARIVQE IADRAGLDYL
DSPYDYWVGH HDEVGGYPEL PVNTHSLEVP DSGIRDWLAG WAQAVEEGSS PPDGGSGDDG
SSGPIPTGTY EIANVNSGQL LEVADASTAD GANVQQWSAT DHATQQWYVE DTGNGEYVLQ
NANSGLLLEV ADGSTEDGAN VQQHADTGCD CQRWSINDVG NGEYILEAVH SGKVADVEGA
STSDGANVLQ WPDTGGANQR WTFDSV