Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hlac_1479 |
Symbol | |
ID | 7400307 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorubrum lacusprofundi ATCC 49239 |
Kingdom | Archaea |
Replicon accession | NC_012029 |
Strand | - |
Start bp | 1486305 |
End bp | 1488506 |
Gene Length | 2202 bp |
Protein Length | 733 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 643708541 |
Product | Fibronectin-binding A domain protein |
Protein accession | YP_002566137 |
Protein GI | 222479900 |
COG category | [K] Transcription |
COG ID | [COG1293] Predicted RNA-binding protein homologous to eukaryotic snRNP |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 35 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACCAGA AGCGGGAGCT GTCGAGCATC GACCTCGCGG CCCTCGTTAC CGAGCTCAAT CGGTACGAGG GCGCGAAGGT CGACAAGGCG TACCTCTACG ACGACGACCT GCTCCGGCTG AAGCTCCGCG ACTTCGACCG CGGGCGGGTC GAACTCATGA TCGAGGTGGG AGACATCAAA CGCGCCCACG TCGCGGATCC CGAGAACGTC GCCGACGCCC CCGGTCGGCC GCCGAACTTC GCGAAGATGC TGCGAAATCG GATGTCCGGC GCCGACTTCG CGGGCGTCGA GCAGTACGAA TTCGACCGGA TCCTCACGTT CGAGTTCGAG CGCGAGGACG AGAACACGAC GCTCGTCGCC GAGCTGTTCG GACAGGGGAA CGTCGCCGCG CTCGACGAGA CCGGAGAGGT CGTCGGGTCG CTCCAGACCG TCCGGCTCAA GTCCCGGACG GTCGCGCCCG GCGCGCAATA CGAGTACCCC GCTTCGCGGC TCAACCCCCT CGACGTGAGC CTCGGCGGGT TCAAACGACA CATGCGCGAG TCCGACAGCG ACGTGGTGCG GACGCTGGCG ACTCAGCTCA ACCTCGGCGG GCTCTACGCC GAGGAGGTGT GTACCCGCGC AGGCGTCGAG AAGGAGACGC CGATCGACGA CGTGACCGAC GACCAGCTCC GTGCGCTCCA CGAGGCCCTC GAACGCATCG GCGAGCGGCT CCGCTCGGGC GATGTCGACC CACGGGTGTA CGAAGAGGAA CTGTCCGACG ACGAGGCGGA GGACCGCGAC CCCCGCGTCG TCGACGTCAC GCCTTTCCCC CTCTCGGAGC ACGAGGGGCT GCCGAGCGTC GGCTTCGACT CCTTCAACGC CGCCGTCGAC GAGTACTTCT ACCGGCTCGA TCGCGACGGG AGCGAGGAGG GCGAAGCCCC GGCCGACGCT AGCCCGTCTC GGCCCGACTT CGAGGAGGAA ATCGGTAAAC AAGAGCGGAT CGTCGAACAG CAGCAGGGGG CGATCGAGGG GTTCGAAGAG CAGGCCGAAG CGGAGCGCGA GCGCGCGGAG CTGCTGTACG CCGAGTACGA CCTCGTCGAC GAGGTGCTCT CGACGGTACA GGAAGCCCGC GAGGCGGAGG TGCCGTGGGA CGAGATCGCC GAAACCCTCG ACGCCGGCGC GGAGCAGGGA ATCCCGGCGG CGGAGACGGT GGTCGATGTC GACGGCGGCG AGGGCACGGT GACGGTCGAA CTCCGCGGCG GTGACGGTGA GGACGACGAC GGAGAGACGA CTCGGATCGA GCTCGACGCG AGTGCGGGCG TGGAGGTCAA CGCCGACCGC CTCTATCAGG AAGCCAAGCG CATCGAAGGG AAGAAAGAGG GCGCGATGGA GGCGATTAAG TCGACTCGCG CGGAGTTGGA GGCCGTCAAA GAGCGGAAAG CCGAGTGGGA GGCGAAGGAG GCGGCTGCCG ACGAGACCGC CGGGGACGGA GCCGACGACG GGGAAGAAGA GGAGGACGGC GAGGAGTACC AGACCGACTG GCTCTCCCGC TCCTCGATCC CGATCCGGAG CCCCGACGAC TGGTACGACC GCTTCCGGTG GTTCTACACT TCGACGGGCT ACCTCGTCAT CGGCGGACGC AACGCCGACC AGAACGAGGA GCTTGTCAAG AAGTACATGG GCAAACACGA CCGGTTCTTC CACACGCAGG CGCACGGGGG CCCGGTGACG CTCCTGAAGG CCGCGGGGCC CTCGGAGTCG GCCGATCCGG TCGACTTCTC GGAGGAGACC TTACGCGAAG TCGCGCAGTT CGCCGTCTCC TACTCGTCGG ACTGGAAGGA CGGGCGCGGC GCGGGCGACG CGTACATGGT CGAGCCCGAT CAGGTGTCGA AGACTCCCGA GAGCGGCGAG TACATCGAGA AGGGGAGCTT CGTGATCCGC GGTGACCGCA CCTACTTCGA GGACGTGCCC TGTCGGATCG CCGTCGGCGT CCAGTGCGAG CCCGTTACGC AGGCCATCGG CGGGCCGCCC TCGGCGATCG TCGATCGCGT GGCGACGAGC GTCACGCTGG AGCCGGGGAT GTACGCCCAA AACGACGCCG CGATGATGGT GTACCGCGAG CTAAAGGGGC GCTTCGCCGA CCAATCGTTC GTTCGGAAGG TGGCGAGCGC CGACCAGCTG CAGGAGTTCA TTCCGGCGGG CGGCTCCGAC ATCGTGGACT GA
|
Protein sequence | MDQKRELSSI DLAALVTELN RYEGAKVDKA YLYDDDLLRL KLRDFDRGRV ELMIEVGDIK RAHVADPENV ADAPGRPPNF AKMLRNRMSG ADFAGVEQYE FDRILTFEFE REDENTTLVA ELFGQGNVAA LDETGEVVGS LQTVRLKSRT VAPGAQYEYP ASRLNPLDVS LGGFKRHMRE SDSDVVRTLA TQLNLGGLYA EEVCTRAGVE KETPIDDVTD DQLRALHEAL ERIGERLRSG DVDPRVYEEE LSDDEAEDRD PRVVDVTPFP LSEHEGLPSV GFDSFNAAVD EYFYRLDRDG SEEGEAPADA SPSRPDFEEE IGKQERIVEQ QQGAIEGFEE QAEAERERAE LLYAEYDLVD EVLSTVQEAR EAEVPWDEIA ETLDAGAEQG IPAAETVVDV DGGEGTVTVE LRGGDGEDDD GETTRIELDA SAGVEVNADR LYQEAKRIEG KKEGAMEAIK STRAELEAVK ERKAEWEAKE AAADETAGDG ADDGEEEEDG EEYQTDWLSR SSIPIRSPDD WYDRFRWFYT STGYLVIGGR NADQNEELVK KYMGKHDRFF HTQAHGGPVT LLKAAGPSES ADPVDFSEET LREVAQFAVS YSSDWKDGRG AGDAYMVEPD QVSKTPESGE YIEKGSFVIR GDRTYFEDVP CRIAVGVQCE PVTQAIGGPP SAIVDRVATS VTLEPGMYAQ NDAAMMVYRE LKGRFADQSF VRKVASADQL QEFIPAGGSD IVD
|
| |