Gene Htur_3808 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHtur_3808 
Symbol 
ID8744436 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaloterrigena turkmenica DSM 5511 
KingdomArchaea 
Replicon accessionNC_013744 
Strand
Start bp30207 
End bp31511 
Gene Length1305 bp 
Protein Length434 aa 
Translation table11 
GC content63% 
IMG OID646514396 
ProductExtracellular ligand-binding receptor 
Protein accessionYP_003405343 
Protein GI284167065 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTACAGA CCGTCATGGT AGATAGTGGC AAAAGAGGCT CGAGAACCGC TGGGAGACGG 
CCCGATATCG ACCGACGATC GTTCCTCACG GCGTCCGGCG CGAGTGCATT GGCAGCAACG
GCGGGTTGCC TCGGTGGCGG CGGCGGAAGT AGCGACGGTA TCACGATCGG CGGCCTCTTC
GGACTGCCCG GCGATCACCC GGCCGGGACC GGGATGAAGC AGACGACCGA AGTGCTAGTC
GAGAATCTCA ACGCAGACGG TGGACTCCTC GGTGAAGAAG TCGAACTGGT CACTCGGGAC
ACCGAGTTCG ATCCGGCGAC CGCTCGCGAC GGCTACCGGG AACTGATTCT CGACGAGAGC
GTCGACGTAA CGACGGGAAT CTACTCGACC GAGGTCGGAA CGGCGATCTT CGACGAGGTA
CCCGAGTTCG AAACCATCCA CCTAACCGGT GGCTCCGCGC CGGATCGTCC GATGGAGAAC
TACGACGACT ACAAGTACTG GTTCCGCGGC ATTCACGGGG GGTACCTGGG TCGGGCGACG
GCGAACTACG CCGCGTCACA TTTCGAGGAT CTGGGTATTA CGGAGGTCGG CATCGCGGCC
GAGGACGTCG ACGGATTCGA TCCGGTCCTC GAGGAATTCG CTGCCGGCCT TCCGAGTTCC
GTCAACGTCC ACTTCGAAGA GCGGTTCTCG TCGGACACGA GCGATTTCAG TCCGATCCTC
GACCAGGGCG AACAGAACGA CATCCAGATG ATGGTCGGGT TCGTCTCCCA GGGCGGCTCC
GCCCTCATGA GTCAGTGGGC CCAGCGCCAG CCGAACTACA TGTTCGGCGG CGGGGACGTC
TTCTCCTCGA ATCCGGACCG GTGGGAGAAC ACGAGCGGCG AGGTCGAATA CGTCTGGTCG
TACATCGGCG GTGCCGCTCC GGGGCTCGAG GTCACTGAGA CGACCAGCCA GCTCATCGAA
GACCACAGAT CGATGTTCGA CGGGGCGGCC CCGCCTCACG CCCAGTCGTA CACGCAGTAC
GACGCGATTC TGACGTGGGC CGAAGCGGTG AAGGAGGCGG AGACGACGGA CATTGACGAG
GTCGTTTCGA CCATCGAAGA GATGACGCTC GAGGGATCGA CCGGCACGCT CGACTGGTAC
GGCGAGGACG GTGAACTCCC GCACACGCCT CAGTTCGGCG AGGAATACGT CCATCCGCCA
GTCATGCAGT GGCAGGAAGT CGATGGCGAG GGTCGGCAGA TCGGGCTCTA CCCCGATACC
GTTCGGAGCG GCGAGTTGCA GATCCCGCCG TGGGTAAGTC TGTAG
 
Protein sequence
MLQTVMVDSG KRGSRTAGRR PDIDRRSFLT ASGASALAAT AGCLGGGGGS SDGITIGGLF 
GLPGDHPAGT GMKQTTEVLV ENLNADGGLL GEEVELVTRD TEFDPATARD GYRELILDES
VDVTTGIYST EVGTAIFDEV PEFETIHLTG GSAPDRPMEN YDDYKYWFRG IHGGYLGRAT
ANYAASHFED LGITEVGIAA EDVDGFDPVL EEFAAGLPSS VNVHFEERFS SDTSDFSPIL
DQGEQNDIQM MVGFVSQGGS ALMSQWAQRQ PNYMFGGGDV FSSNPDRWEN TSGEVEYVWS
YIGGAAPGLE VTETTSQLIE DHRSMFDGAA PPHAQSYTQY DAILTWAEAV KEAETTDIDE
VVSTIEEMTL EGSTGTLDWY GEDGELPHTP QFGEEYVHPP VMQWQEVDGE GRQIGLYPDT
VRSGELQIPP WVSL