Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Htur_2853 |
Symbol | |
ID | 8743470 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haloterrigena turkmenica DSM 5511 |
Kingdom | Archaea |
Replicon accession | NC_013743 |
Strand | + |
Start bp | 2923842 |
End bp | 2926013 |
Gene Length | 2172 bp |
Protein Length | 723 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 646513438 |
Product | Fibronectin-binding A domain protein |
Protein accession | YP_003404395 |
Protein GI | 284166116 |
COG category | [K] Transcription |
COG ID | [COG1293] Predicted RNA-binding protein homologous to eukaryotic snRNP |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGATCCAA AGCGGGAGCT TACTAGCGTC GACCTCGCCG CCCTCGTTGG GGAACTCGGT GCCTACGAGG GAGCGAAGGT CGACAAGGCC TACCTCTACG GCGACGATCT CGTCCGGCTC AAGATGCGGG ACTTCGATCG GGGCCGCGTC GAACTCCTCC TCGAGGTCGG CGAGACCAAG CGGGCCCACA CGGTCGCCCC CGAGCGGGTG CCCGACGCGC CCGGCCGACC GCCGCAGTTC GCGATGATGC TCCGCAATCG ACTCTCCGGG GCCGATTTCG CCGGCGTCGA ACAGTACGAG TTCGACCGCA TCCTCGAGTT CGTCTTCGAG CGCGAGGACG GCACCACGCG GATCATCGTC GAACTGTTCG GTCAGGGGAA CGTCGCCGTC ACCGACGGCG AGTACGAGGT GATCGACTGC CTCGAGACCG TCCGCCTCAA GTCCCGAACC GTCGTGCCGG GCTCGCGCTA CGAGTTCCCC GACAGTCGGA CGAACCCGCT GACGGTCTCC CGGGAGGCGT TCGACCGCGA GATGGAAGAC TCCGACACGG ACGTCGTCCG GACGCTGGCG ACCCAGCTCA ACTTCGGCGG CCTCTACGCC GAGGAGATCT GTACCCGCGC CGGCGTCGAG AAGGCGATGG ACATCGCCGA GGCCGACGAG GACGTCTACG ATCGGATCTA CGGCGCCATC GAACGACTCG CGCTGGACCT GCGCAACGGG AACTTCGATC CGCGGCTGTA CGTCGCGGAC GACGACGGCG ACGAAGACGA GAGCGAATCG GGCGACGAAA ACGGCGACGA CTCGAGTTCC GACCGGGTCG TCGACGCGAC GCCGTTCCCG CTCGAAGAGC ACGTCGAACT GGCCTCGGAG CCGTACGACT CCTTCCTCGC GGCGCTGGAC GACTACTTCT ACCGGCTCGA ACTCGCCGAC GACGAGGAGG AAACCGATCC GACCACGCAA CGACCCGACT TCGAGGAGGA GATCGCCAAG TACGAGCGGA TCATCGAGCA ACAGCGGGGC GCGATCGAGG GGTTCGAGCA GGAGGCCGAC GCCCTCCGCG AGCAGGCCGA ACTGCTGTAC GCCGAGTACG GGCTGGTCGA CGACATCCTC TCGACGGTTC AGGAGGCCCG CGCCCAGGAT CGACCCTGGG ACGAGATCGA GGAGCGCTTC GCCGAAGGAG CAGACCGCGG CATCGCGGCC GCCGAAGCCG TCGTCAACGT CGACGGCAGC GAGGGGACCG TCACCGTCGA ACTCGACGGC GAGCGCATCG ACCTCGTGGC CAAGCAGGGC GTCGAACAGA ACGCCGACCG CCTCTACACC GAGGCCAAGC GCGTCGGGGA GAAAAAGGAG GGCGCGCTGG CGGCCATCGA GGACACCCGC GAGGACCTCG GGGAAGCCAA GGCCCGCCGA GACCGGTGGG AGGAAGCGGA CGCCGCCGAC GAGGGCGAGG ACGATGAGGA CGACGAGGGC GAGGAGCGCG ACTGGCTGTC GGAACCCTCC GTTCCGATCC GCGAGAACGA GCCGTGGTTC GACCGCTTCC GCTGGTTCCA CACCAGCGAC GGCTACCTCG TGATCGGGGG GCGCAACGCC GACCAGAACG AGGAGTTAGT GAAAAAGTAC CTCGAGCCCG GCGACAAGGT CCTCCACACG CAGGCCCACG GCGGCCCCGT CACCGTGCTC AAGGCGACTG ACCCCAGCGA GGCCTCCTCG TCGGACATCG AGTTACCCGA CTCGAGCATC GAGGAGGCCG CGCAGTTCGC GGTCTCCTAC TCGTCGGTCT GGAAGGACGG CCGCTACGCC GGCGACGTCT ACGCCGTCGA CTCCGATCAG GTCACCAAGA CCCCCGAGAG CGGCGAGTAC CTCGAGAAGG GCGGGTTCGC GATCCGCGGC GACCGCACCT ACTACCGGGA CACGCCGGTC GACGTCGCGG TCGGCATCCA GTGTGCGCCC TACACGCGCG TGATCGGCGG TCCGCCGTCG GCCATCGAGG GGCAGGCGGT GACGACGATC GAAATCGAGC CGGGACGGTA CGCACAGGCC GACGCGGCCA AACGGCTCTA CCGCCGGTTC CGCGAGCGCT TCGAGGACGA GTCGTTCGTC CGGAAGATCG CCAGTCCGGA CCGCATCCAA CACTTCATGC CGCCGGGCGG GAGTCGAATC AGCGAGGAGT GA
|
Protein sequence | MDPKRELTSV DLAALVGELG AYEGAKVDKA YLYGDDLVRL KMRDFDRGRV ELLLEVGETK RAHTVAPERV PDAPGRPPQF AMMLRNRLSG ADFAGVEQYE FDRILEFVFE REDGTTRIIV ELFGQGNVAV TDGEYEVIDC LETVRLKSRT VVPGSRYEFP DSRTNPLTVS REAFDREMED SDTDVVRTLA TQLNFGGLYA EEICTRAGVE KAMDIAEADE DVYDRIYGAI ERLALDLRNG NFDPRLYVAD DDGDEDESES GDENGDDSSS DRVVDATPFP LEEHVELASE PYDSFLAALD DYFYRLELAD DEEETDPTTQ RPDFEEEIAK YERIIEQQRG AIEGFEQEAD ALREQAELLY AEYGLVDDIL STVQEARAQD RPWDEIEERF AEGADRGIAA AEAVVNVDGS EGTVTVELDG ERIDLVAKQG VEQNADRLYT EAKRVGEKKE GALAAIEDTR EDLGEAKARR DRWEEADAAD EGEDDEDDEG EERDWLSEPS VPIRENEPWF DRFRWFHTSD GYLVIGGRNA DQNEELVKKY LEPGDKVLHT QAHGGPVTVL KATDPSEASS SDIELPDSSI EEAAQFAVSY SSVWKDGRYA GDVYAVDSDQ VTKTPESGEY LEKGGFAIRG DRTYYRDTPV DVAVGIQCAP YTRVIGGPPS AIEGQAVTTI EIEPGRYAQA DAAKRLYRRF RERFEDESFV RKIASPDRIQ HFMPPGGSRI SEE
|
| |