Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Huta_2415 |
Symbol | |
ID | 8384715 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhabdus utahensis DSM 12940 |
Kingdom | Archaea |
Replicon accession | NC_013158 |
Strand | + |
Start bp | 2481247 |
End bp | 2483163 |
Gene Length | 1917 bp |
Protein Length | 638 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 644973489 |
Product | conserved repeat domain protein |
Protein accession | YP_003131314 |
Protein GI | 257053481 |
COG category | [R] General function prediction only |
COG ID | [COG1721] Uncharacterized conserved protein (some members contain a von Willebrand factor type A (vWA) domain) |
TIGRFAM ID | [TIGR01451] conserved repeat domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.470241 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCGCCA CGGATGGATC GACTGGCGAA ACCGAGTTGA ACGCACTGGG CGTCGGACGC GCCCTCGGTG TGCTCGGTCT GCTCGCGGTC CTGGGCGGTC TCGTGACGAT CCTCCAGCCC TCGCTCGCGT CGGATACAGC ATTCGGGTAC GGCTTCGTGA CGCTGCTAGG CGTCGTCATT CTGGCGGCCA GTCTGCGATA CTGGGTCGGA CTCGCACTGA CGACGGTCGA GTCGGCGACG CCACCGTCAG TCGAGGAACG ACGCGAGATC TCCATTCCGG GTGTCGCGTT CGATCGAACG CTCGCCGACC ATCGACTCAC GGCAGTCTCA CGCCTGCGGG CCAGGCGGGA CGTCCTCGAC CGACTACGAT CGGTTGCAGC CCGAATCCAC GAACGGGACC ACGGTGGTTC GGACGGGCAA CCGGACGCCG ACGTCTTCGG TGAGGACGCG GAGACGGTGC TGTTTGACGA CGCCGACCGC GAGCAGGGGC TGACAGGCCG TCTGGCGGCC GTCCGGGCGA AGGGCACGCC GCTACAGCGC CGCGTCAGCG ATGTCGTCGC CGAACTTGCA CGCCAGTACG ACGGCGAGCG CGTCTCGGTC GAATTCGACG TGCCCGAGCG GACGACAGCG ACACGGCCAT CCGCTCCCGG CCTGCGGGCG ACCAATCGTT TTCGACTCCT CAAACCGATG GCGCTGTCGG CGGTCGGCGT GGGCGTCGTG TTCGGTTCGG CGGCGTTGCT TCTCGTCGGT GCGCTGTTCG GCGGTGTCGC CGTCTACGCT GCGGCCGATT CGCCGCCGAC GGCTGCCGTT CGCGTCACTC GATCAGTCGA GACGGACCAG CCGGCCCCGG GTGAGTCCGT CCGGGTGACC GTCTCGATCG AGAACACGGG GGATCGGCTG CTCCCCGATC TCACGGTGAT CGATCGGGTT CCGGAGGGGT GTTCGGTCAC GGCCGGGTCA CCACGGCACG GGACGTCGCT CCGGGCCGGC GCGACCGCGA CCTTCGGGTA CGAACTCGAG GCCGTCAGGG GTCACCACGA GTTCGGTGAC GCGGCTGTCA TCGTCCGGAA CCTCTCGGGG TCGTTCGAAC GTGTCGCGGA TGTGTCCGCC GACGGTGACG CGTCGATCAC GTACGATGTG ACCGCGGTCA CGGATCATTC TATCCCCCTG CGAAAACAGA CGTCTCGAAG CGTCGGCCGT GTGGTCACTG ACGTCGGTGG CAGCGGCATC GAACTCCACT CCGTCCGGGA GTACCGGACC GGCGATCCGC TCAACCGGAT CGACTGGAAC CGGGCGGCCG CCGGCGAGGA TCTGGCGACG CTGCAGTTTC GCGAGGAACG ATCGGCGACG GTCGTCCTGC TGGTCGATGC CCGGGTGGAA GCATACGTCG CCCCCGAGGC GGACGCACCG TCGGCCGTCG ATCGTAGCGT CCTCGCGGCG GCCGAAAGCG CCTCGGCGCT GCTGGCTGCC GACGATCAGG TCGGGCTGGC CGCGCTCTCG CCCCGCACGT GCTGGATCGC TCCGAGTTCC GGGCACGCTC ATCGGACCCG CATACACGAA CAGCTGGCGA ACGCGGAGGC CTTCGATCCA CGACCGCCGG ATCGGTCGTT CAAGCCCGCG ATTCGCCTTC AGTCGATCCG CAAGCGGCTC CCGAGTGCTG CCCAGCTGCT GGTGTTCTCG CCGGTCTGTG ACGACGAAGT CGTCGATGTC GTTCGCCGGC TGGGGGCCAG CGGACATGCG ACCACGGTGA TCAGCCCGAA TGCCACGGGC GGCAGGACAC CGGGTCGAAC GCTGGCCCGG ATCGAACGCA GTCTCCGCCT CTCACGACTT CGGGAGGCCG GTGTTCGCGT CGTCGACTGG GACGCCGAGG AATCACTCGC ATTGTCTCTG GACAAAGCCG GCAGGCGGTG GTCGTGA
|
Protein sequence | MTATDGSTGE TELNALGVGR ALGVLGLLAV LGGLVTILQP SLASDTAFGY GFVTLLGVVI LAASLRYWVG LALTTVESAT PPSVEERREI SIPGVAFDRT LADHRLTAVS RLRARRDVLD RLRSVAARIH ERDHGGSDGQ PDADVFGEDA ETVLFDDADR EQGLTGRLAA VRAKGTPLQR RVSDVVAELA RQYDGERVSV EFDVPERTTA TRPSAPGLRA TNRFRLLKPM ALSAVGVGVV FGSAALLLVG ALFGGVAVYA AADSPPTAAV RVTRSVETDQ PAPGESVRVT VSIENTGDRL LPDLTVIDRV PEGCSVTAGS PRHGTSLRAG ATATFGYELE AVRGHHEFGD AAVIVRNLSG SFERVADVSA DGDASITYDV TAVTDHSIPL RKQTSRSVGR VVTDVGGSGI ELHSVREYRT GDPLNRIDWN RAAAGEDLAT LQFREERSAT VVLLVDARVE AYVAPEADAP SAVDRSVLAA AESASALLAA DDQVGLAALS PRTCWIAPSS GHAHRTRIHE QLANAEAFDP RPPDRSFKPA IRLQSIRKRL PSAAQLLVFS PVCDDEVVDV VRRLGASGHA TTVISPNATG GRTPGRTLAR IERSLRLSRL REAGVRVVDW DAEESLALSL DKAGRRWS
|
| |