Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hlac_1865 |
Symbol | |
ID | 7400058 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorubrum lacusprofundi ATCC 49239 |
Kingdom | Archaea |
Replicon accession | NC_012029 |
Strand | + |
Start bp | 1869740 |
End bp | 1871707 |
Gene Length | 1968 bp |
Protein Length | 655 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 643708935 |
Product | hypothetical protein |
Protein accession | YP_002566513 |
Protein GI | 222480276 |
COG category | [R] General function prediction only |
COG ID | [COG0433] Predicted ATPase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.0131162 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGCGGG CGGTGTCTCT CATAAGTATG ACCGAGCAGG AGGCGATCCA CGTCGCAGAC GTCAGTGAGG GGATCGGCGG CGACGCGACG GCGGAGCCGG GCTCGCCGGT AGAGCTTCCG GTCGTCGACG TGCTCACCGG CCGATCGTTT ATTACGGGAA AAAGCGGATC CGGCAAGAGC AACACGGCGT CCGTCGTCAT CGAGAACCTC CTCGATAACG GCTTCCCCGT GATGATCGTG GACACGGACG GGGAGTATTA CGGCCTTAAG GAAGAGTACG AAATTCTGCA CGCCGGCGCC GACGACGAGT GCGACATCGT CGTCTCACCC GAACACGCCG AGAAGCTCGC CAATCTCGCC TTAGAGCAGA ACGTTCCGAT CATCCTCGAC GTCTCGGGCT ACCTCGAAGA AGACACCGCC AACGAGCTCC TCTTGGAGGT CGTTAAACAG CTGTTCGCGA AGGAGAAGCG CCTCAAGAAG CCGTTCCTGC TCGTCGTCGA GGAGTGTCAC GAGTACATCC CGGAGGGCGG CGGGATGGAC GAGACGGGGA AGATGCTGAT CAAGGTGGGC AAGCGCGGCC GGAAACACGG CCTCGGCATC GTCGGGATCA GCCAGCGCCC GGCCGACGTC AAAAAGGACT TCATCACCCA GTGCGACTGG CTCTGCTGGC ACCGGCTCAC CTGGGACAAC GACACGAAGG TCGTGGGACG CATCCTCGGC TCGAAGTACG CGAGCGCCGT CGAGGATCTG GGCGACGGCG AGGCCTTCCT GATGACCGAC TGGGACGAGT CGATCCGCCG GATCCAGTTC CACCGCAAGC AGACGTTCGA CGCGGGCGCG ACGCCCGGGC TCGACGACTT CGAGCGCCCG GAGCTCAAGT CGGTCTCCAG CGATCTCGTC GGGGAGCTCC AGTCCATCTC CGACGAGGAG GAGCGTCGCG AGTCCGAGAT CGCCGACCTC AAACAGGATC TCGACAAACG GGACCAGCGC ATCCAGGAGC TCGAACGCGA GCTAGAGGAC GCCCGCGACC TGAGCAACAT GGCCGACCAG TTCGCGCAGG CGCTTCTCGG GAAGGCCGAA GCGCCGTACC GCGGCGGAAA CGGCCCGTCC GTGGCCGCCG GCGGAGAGGG AACGACTGGC GACGACGGCG ACCAGTCCGT GCTCAAGTCG TACGACGAGG CGGTCGCCGC GACCGAGAGA AGCGAGGACG CCGACGGAGA CGCCGATGTC GACACAGACA CCACCGCGGG CAACCGCGAC ACAGCCCCGA ACGACACCGA CTCCGACGCC CCGGCCGACG ACTCCCCGAC CGGCGAGACC CCAACCGACG ACTCGAGCGA CGGCGCCCCC GTCGATGACG TCGATCCGGT CACGCGCGCC GACGTGGCCG AGAGCGCGCT CCGGTTCGAC GACGCGGTCG AGCTCGGCAC CCGCGAGGCC GTTATCGAGG AGCTTCGGAG TCGGATCGAG GCACTTCCGG AGCTCTCACA GGGGATGCTC CGACACTACC GACGCGAGGG CGTCAGCGAC CCCGTCGCCG CCCACATCGA CGCCGGCGGC GACGGCGACT CCGGCCACGC GTACAGCCGT CACCGCCCCA TCCGACGGGC GGGGGTCATT CGTCACGCCG GTCGAGGCCA CTACGCCTAC GCCGTCCCCG ACCTGATCCG CGAGGCGTAC GCCGACCGAC TCGACGAGGA GAGCGTCCGT GAGATCGTCC GCGCGGTCGA GACCGCCTTC GTCCCGCCGG CCGAGCGGAG CTACCCGCCG GACGCCGATC CGGCCGAGGT CGGCGTCGGA AACGGGTCGG CAGAGACCGA TTTGGACGGT GAGGTCGTTT CGCCCGGCGA AGGTGACGAC AGCGACGGAA GCGCGTCGGA AACCGAGGCG AGTGCGGATC AGCCGACGAG CCACCTGAGC GACGCCGCAC GGAAGATCGC GGAGCGCGGG CACACCGTCG ACGAGTAA
|
Protein sequence | MTRAVSLISM TEQEAIHVAD VSEGIGGDAT AEPGSPVELP VVDVLTGRSF ITGKSGSGKS NTASVVIENL LDNGFPVMIV DTDGEYYGLK EEYEILHAGA DDECDIVVSP EHAEKLANLA LEQNVPIILD VSGYLEEDTA NELLLEVVKQ LFAKEKRLKK PFLLVVEECH EYIPEGGGMD ETGKMLIKVG KRGRKHGLGI VGISQRPADV KKDFITQCDW LCWHRLTWDN DTKVVGRILG SKYASAVEDL GDGEAFLMTD WDESIRRIQF HRKQTFDAGA TPGLDDFERP ELKSVSSDLV GELQSISDEE ERRESEIADL KQDLDKRDQR IQELERELED ARDLSNMADQ FAQALLGKAE APYRGGNGPS VAAGGEGTTG DDGDQSVLKS YDEAVAATER SEDADGDADV DTDTTAGNRD TAPNDTDSDA PADDSPTGET PTDDSSDGAP VDDVDPVTRA DVAESALRFD DAVELGTREA VIEELRSRIE ALPELSQGML RHYRREGVSD PVAAHIDAGG DGDSGHAYSR HRPIRRAGVI RHAGRGHYAY AVPDLIREAY ADRLDEESVR EIVRAVETAF VPPAERSYPP DADPAEVGVG NGSAETDLDG EVVSPGEGDD SDGSASETEA SADQPTSHLS DAARKIAERG HTVDE
|
| |