Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Htur_0566 |
Symbol | |
ID | 8741148 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haloterrigena turkmenica DSM 5511 |
Kingdom | Archaea |
Replicon accession | NC_013743 |
Strand | + |
Start bp | 595671 |
End bp | 597965 |
Gene Length | 2295 bp |
Protein Length | 764 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 646511145 |
Product | amino acid permease-associated region |
Protein accession | YP_003402137 |
Protein GI | 284163858 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0531] Amino acid transporters |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTGACG AAGAACTCGC CAAGGACCTC GGACTGCTCT CGGCGCTGGC GATCGGGATG GGGACGATGA TCGGCGCCGG CATCTTCGTG TTGCCCGGCG TCGCGGCCCA GGAGGCCGGG CCGATCGTCG TCGTCTCGTT CGTGGTCGGC GGCATGATCG CGATGGTCAA CGCCTTGGCG GTGAGCGAAC TCGGGACGGC GATGCCGAAG GCCGGCGGCG GCTACTACTA CATCAACCGC GGACTCGGGC CGCTGTTCGG CTCGATATCC GGAATGGGTG ACTGGATGGG GCTGGCCTTC GCCTCGGCGT TTTACTGCAT CGGGTTCGGC GGCTACCTCA CCGATCTGCT GGCCGGAACG ATGCTCGCCC TGCCGACCCT TGAGCTCGGT CTCGTTGCCC TGTCGGACAT CCAGCTCGGC GCGCTGATCG CCGGCCTGCT GTTCGTCGGC GTCAACTACA TGGGCGCCAA GGAGACCGGG GGCGTCCAGA CAGTGATCGT TACGGTCCTG CTGGGGATCC TCACCGTCTT CGCTGCCTCC GGCTTCTTCC ATTTCGATTG GGCGACGCTC ACGACAGACG GACTCGCGCC GACCGATCGG GGGTACGGTG CGATCCTGCC GGGGACCGCC CTCGTCTTCG TCTCCTTCCT CGGCTACGCG AAGATCGCGA CGGTCGCCGA GGAACTGCAA AACCCCGGCC GAAACCTCCC GATCGCGGTC ATCGGCAGCG TCGCCGTCGT GACGGCGATC TACGCGATCC TCGTCGCCAC GATGGTCGGC ATCGTTCCGT GGCACTCGCT GGACGACAGC GTCCCCGTCT CCCAGGTCGC CGAGATCACC TTCGCCGGGG TCCCCGTCCT CGACGCCGTC GGCGTGACGC TGATCTCGCT GGCCGCAATG CTGGCGACCG CCTCGAGCGC CAACGCCTCG ATTCTCTCGT CGGCTCGAAT CAACTTCGCG ATGGGCCGCG ATAAGATCGT CACGGACAAA CTCAACGAGA TCCACCCGCG GTACGCGACG CCGTACCGGT CGATCCTGCT TACCGGGGGC GTCATCATCG TCTTCATCGC CGCGCTGGGC CAGGACCTCG AGATCCTCGC GAAGGCCGCC AGCGTGCTCC ACCTGATCGT CTACGGGCTC ATGAACCTCG CGCTGATCGC CTTCCGCGAG GCGGACGTCC CCGAGTACGA CCCCGACTTC CGGGTCCCGT TCTACCCGGT GACGCCGATC CTCGGTGCGC TGCTCTCCTT CGGTCTCGTC GCCTTCATGG ACACGATCGA GATCGCGTTA AGCCTCGCGT TCGTCGCCGT CGCGGTGCTG TGGTACGCGT TCTACGCTCG CGCGAAGACG CCCCGACAGG GCGTCCTCGG CGAGTACATC CTCGATCGCT CCGAGGAGCT GCCCGACGTC GCGGTCTCGG CGGCCAGCGC TGCTCGACCG GACGGCTCGG GCCAGTATCG CGTCATGGTG CCGCTGGCCA ACCCGCGCAC CGAGCGCCAC CTGATCGAAC TGGCCAGCAC GCTCGCCGCC GAAAACGACG GCGTCGTCCA CGCGGTCCAC ATCGTGCAGG TCCCCGACCA GACGCCACTG GATCGCGGTG CCGAGCACAT CGAGCGGATC GACGCCGAGT CCGAAGCGCT ACTCGAGCTG GCCCGCGAAC ACGCCGAAGA CCGCGGCGCC GAGATCGAGA CGACGACGAT CGTCTCCCAC CGCTCGTTCG AGGAGGTGTT CGACGCGGCC CGCCAGCACG ACGTCGATCA GGTCGTGATG GGCTGGGGCG ACGATCGGCC GTGGTCCGCG GGGCGGGCCG AGCGCCCGCT CGACGAACTC GCCGCTGACC TGCCGTGTGA CTTCCTCGTG TTGAAGGATC GCGGCTACGA TCCGTCTCGA ATTCTCCTGC CGACCGCCGG CGGTCCAGAT TCGGATCTCA GCGCCGAGGT CGCGAAGACG CTGCGCTCGG CGACGGGATC GACGATTCAG TTGCTCCACG TCGTCGACGA CGAGCGCGAG CGCGAATCGG GCGAGCAGTT CCTCGCGAAC TGGGCGGCCG AACACGGGTT CGGCGACGCC GTGTTGACCG TCGACGACTC CGGCGATGTC GAGGGCGCGA TCGCCCGCGA GGCCGAAGAC AGGACGCTCG TCGTCATCGG CGCGACCGAG CGCGGTCTGC TCTCGCGGCT CGTCCGCGGC TCGCTCGTCT TCGACGTCGT CGACGAAGTC GACTGTTCGG TCCTGCTCGC CGAACGGCCG GCGGAGCGCT CGCTCCGGGA ACGGCTCTTC GGACGCTCAG AATAG
|
Protein sequence | MSDEELAKDL GLLSALAIGM GTMIGAGIFV LPGVAAQEAG PIVVVSFVVG GMIAMVNALA VSELGTAMPK AGGGYYYINR GLGPLFGSIS GMGDWMGLAF ASAFYCIGFG GYLTDLLAGT MLALPTLELG LVALSDIQLG ALIAGLLFVG VNYMGAKETG GVQTVIVTVL LGILTVFAAS GFFHFDWATL TTDGLAPTDR GYGAILPGTA LVFVSFLGYA KIATVAEELQ NPGRNLPIAV IGSVAVVTAI YAILVATMVG IVPWHSLDDS VPVSQVAEIT FAGVPVLDAV GVTLISLAAM LATASSANAS ILSSARINFA MGRDKIVTDK LNEIHPRYAT PYRSILLTGG VIIVFIAALG QDLEILAKAA SVLHLIVYGL MNLALIAFRE ADVPEYDPDF RVPFYPVTPI LGALLSFGLV AFMDTIEIAL SLAFVAVAVL WYAFYARAKT PRQGVLGEYI LDRSEELPDV AVSAASAARP DGSGQYRVMV PLANPRTERH LIELASTLAA ENDGVVHAVH IVQVPDQTPL DRGAEHIERI DAESEALLEL AREHAEDRGA EIETTTIVSH RSFEEVFDAA RQHDVDQVVM GWGDDRPWSA GRAERPLDEL AADLPCDFLV LKDRGYDPSR ILLPTAGGPD SDLSAEVAKT LRSATGSTIQ LLHVVDDERE RESGEQFLAN WAAEHGFGDA VLTVDDSGDV EGAIAREAED RTLVVIGATE RGLLSRLVRG SLVFDVVDEV DCSVLLAERP AERSLRERLF GRSE
|
| |