Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Htur_5120 |
Symbol | |
ID | 8745668 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haloterrigena turkmenica DSM 5511 |
Kingdom | Archaea |
Replicon accession | NC_013747 |
Strand | - |
Start bp | 18114 |
End bp | 20060 |
Gene Length | 1947 bp |
Protein Length | 648 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 646515477 |
Product | type II secretion system protein E |
Protein accession | YP_003406424 |
Protein GI | 284176147 |
COG category | [N] Cell motility [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG0630] Type IV secretory pathway, VirB11 components, and related ATPases involved in archaeal flagella biosynthesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.0155589 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGGGAG ACGGAGCCGC GGGCGGCCTT CGATCGGTTA TCGACGGATT CGGCGTCGAT CGCGTGTTCG ACAGCCTCGA GCGATTCGGC GACGCCGAGC CGCCGGACGC GTGTTCGTGT GGCGTCGCCA CAACGGGCGA GACCCTCGTC CTCGACGCCG GCGACTGCGA CGGCGACCTC GCGACCGCGC TCGCGTGTCG CCGGACGGCC GTCGACGCGC TGACCGACCG CGATGCCCGC CAGATCGTCG TTCGAACGAA CGGCCTCGAG TACCAGTATC GCGGCCGCGG CGTCGAACTG CTCGCCGCGG CCGGGCAGTT CGTCGACCGA CTGGGAGACC GGAACGAAGC GTTGCTCGCG ACTGCACGGC GGGATCCGCT CGCGATCGCC GACGAACTCG GACCGCGAAC CGGCGTCGAT ACCGACATCG TTCGCGAATC GACGCTCGTC GATGTCGCAC GAGGGATCGA GAACTACGAT GCAGTGCTGT CCCCGACGGT CGGACTCACG ATCGGCCACT ACCGGATCGA TCCGACGATC CCGGACGACA CGCGGTTGCG GGACGGTCGG TCGCTCGAGA CGGGCAGCGA CGTGCGGATC TACGACCGAC CCGGCGGCGT CACGGAGTAC GCGCTCGACG TCGTCGATCT GACCCTTTCG GCGACCGAAC GGTCGCGTCT CCTCGAGGGC TACGAGGCCG TCGCCGAGGG CGTGGTCGAC GGCGAACGCG CGGCCCCGCG GGCGATCGAA CACGTCACCG ACGGTCCGGC CGATTCACTG GAGATGGACA TTCTGACGAA GCATACGAGC GGGTACGGAA TTCTCGAGGA CCTGTTCGCC GATCCCCGGC TTTCGGACGT CTACGTGACG TCGCCCGTGG ATCGAAACCC GCTCCGCGTC GTCGTCGACG GCGAGTCGAT GGCGACGAAC GTCTATCTGA CGTCCGAGGG CGCACGCGCG CTCGCGTCCC GGGTGCGACG GACGAGCGGC CGGGCCTTCT CGCGTGCGAA TCCGACGGTC GACGCGACGG CGGTCCTCGA GAACGGCACC GGCGTCCGCG TCGCCGGCGT CACTGATCCC GTCGCTGACG GCGTCGCCTT CGCGTTTCGG GAGCGAACGG ACGATCGGTT CACGCTCCCC GAACTCGTCG CGAACGGGAC GGTGCCGGCG GAAGCAGCGG CGTTCCTCTC GATCGCCGTC GAGCGAAACG TCGCCGGCCT GATCGCCGGC ACCCGCGGCG CGGGAAAGAC GACGCTGCTC GGGACCCTGT TGTACGAACT GCACCCCGAG ACGCGAACGG TCCTGCTCGA GGACACGCCC GAACTCCCGG TCGCCGCGCT CCAGTCGGTC GGGCGAAACG TGCAGGCGCT GCGTACCGGC AGCGAGGATG GGCCGGAGAT CACCCCGACA GAGGCACTCA GAACCGCGCT TCGACTTGGC GACGGCGCGC TCGTGGTCGG CGAGATCAGG GGCGAGGAGG CCCGCGTGCT CTACGAGGCG ATGCGAGTCG GTGCCAACGC CAACGCCGTG TTGGGAACGA TCCACGGCGA CGGGGCCAGC GAGGTCTACG AACGCGTCGT CTCCGACCTC GGCGTCGCGC CCTCCTCGTT CGGCGCGACG GATCTAATCG TCACCGTTCA GTCCCGGCGC ACGGCCGAGG GCCGACGACG GCGGCTCGCC CGCATCGAAG AGGTCATCAG TGACGGCGCC GATCACTGGT TCGAACCGCT GTACGAACTC GAGGACGGAG TCGCCACGCC GACCGGCCGG ATCGATCGCG GCGAGAGCCG ACTCGTCGAT CGACTCGCCG GACCGGACGA GACCTACGCC GACGTTCGCC GTGCGCTCGA GACGCGAACG AACCAGCTGG CGGCGCTCGC CGCCGACGGG CGGACGAGCC CGCGCGAGGT CGCCACGGCC TGCGCCGAAC GGCAGTACGA CGGGTGA
|
Protein sequence | MTGDGAAGGL RSVIDGFGVD RVFDSLERFG DAEPPDACSC GVATTGETLV LDAGDCDGDL ATALACRRTA VDALTDRDAR QIVVRTNGLE YQYRGRGVEL LAAAGQFVDR LGDRNEALLA TARRDPLAIA DELGPRTGVD TDIVRESTLV DVARGIENYD AVLSPTVGLT IGHYRIDPTI PDDTRLRDGR SLETGSDVRI YDRPGGVTEY ALDVVDLTLS ATERSRLLEG YEAVAEGVVD GERAAPRAIE HVTDGPADSL EMDILTKHTS GYGILEDLFA DPRLSDVYVT SPVDRNPLRV VVDGESMATN VYLTSEGARA LASRVRRTSG RAFSRANPTV DATAVLENGT GVRVAGVTDP VADGVAFAFR ERTDDRFTLP ELVANGTVPA EAAAFLSIAV ERNVAGLIAG TRGAGKTTLL GTLLYELHPE TRTVLLEDTP ELPVAALQSV GRNVQALRTG SEDGPEITPT EALRTALRLG DGALVVGEIR GEEARVLYEA MRVGANANAV LGTIHGDGAS EVYERVVSDL GVAPSSFGAT DLIVTVQSRR TAEGRRRRLA RIEEVISDGA DHWFEPLYEL EDGVATPTGR IDRGESRLVD RLAGPDETYA DVRRALETRT NQLAALAADG RTSPREVATA CAERQYDG
|
| |