Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hlac_0722 |
Symbol | |
ID | 7400195 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorubrum lacusprofundi ATCC 49239 |
Kingdom | Archaea |
Replicon accession | NC_012029 |
Strand | + |
Start bp | 736694 |
End bp | 738595 |
Gene Length | 1902 bp |
Protein Length | 633 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 643707788 |
Product | ATPase |
Protein accession | YP_002565394 |
Protein GI | 222479157 |
COG category | [R] General function prediction only |
COG ID | [COG1855] ATPase (PilT family) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0212836 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 0.622935 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACGTAG TACCGGACAC CAGTGCGGTC GTCGACGGCC GCGTGTCCGA ACGGGTCGAG GACGGGACCT ACGCGGCGGC GCGGGTGCTG ATCCCGGAGG CCGTCGTCGG CGAGCTGGAG TCGCAGGCCA ATGACGGGCT CGAATCGGGC TGGGACGGAT TGAGCGAGCT CAAACGGCTC GCCGACTACG CCGACGAGGG GACGATCGAA CTGGAGTACG TCGGCGAGCG AGCCGACGGC GACGCCCGCT CGCGCGCCCA CAAGGGCGAC GTGGACGCGT TGATACGCGA GGTCGCCGAC GACTACGACG CCACGCTCCT CTCCAGCGAC ATCGTACAGG CCGAGGTCGC CCGCGCGAAG GGGATCGACG TCGAGTACGT CGAGCCGGTC GCCCGCGGCG TCGTCGACGA GCTGCCGATT CTGGAGTTCT TCACCGACGA GACGATGTCG GTCCACCTCA AGACGGGGAC GAAGCCGAAG GCGAAGCGGG GAGCGCTCGG CGAGATGCGG TACGTCGTCA TCGACGAAGA GGAGACGAGC GAGGAGCAGA TGGACGAGTG GGCCACCGCC ATCGTCGATC TGGCCCGCCA GTCCACCGAT GGGTTCATCG AACTCTCCGA CGACGGGATG GACATCGTCC AGTTCCGTAA CTACCGGATC GCCGTTGCGC GACCCCCGTT CGCTGACGGG ATCGAGATCA CGGCCGTCCG CCCCATCGCG AAAACGACGC TCGACGACTA CGAGTTCGCC GACGAGCTCC GCGAGCGGTT CCTCGAACAG AAACGCGGCG TGCTCATCTC CGGATCCCCC GGCGCCGGGA AGTCGACGTT CGCGCAGGCG GTCGCGGAAT TCCTGAACGA CAACGACTAC GCGGTCAAGA CGATGGAGAA GCCGCGGGAC CTGCAGGTCG GTCCGGAGAT CACCCAGTAC GGCGCGCTCG GCGGCCAGAT GGAGAAGACG GCGGACTCCC TCCTGTTGGT TCGCCCCGAC TACACCATCT ACGACGAGGT GCGGAAGACG AACGACTTCG AGGTGTTCTC GGACATGCGG ATGGCCGGCG TCGGGATGGT CGGCGTCGTC CACGCCTCCC GCGCCATCGA CGCGCTCCAG CGGCTCGTCG GCCGGGTCGA GCTCGGGATG ATCCCGCAGA TCGTCGACAC CGTGGTGTAC ATCGAGGCTG GCGAGGTCCA CACCGTCTAC GACGTGACCA CCGAGGTGAA GGTGCCTGCG GGGCTCACCG CCGAAGACCT CGCGCGGCCG GTCATCCAGG TGTCGAACTT CGAGACGGGT CGGCCGGAGT ACGAGATATA CACGTTCAAC CGGCAGGTCG TCACGGTTCC GCTGAACGAC GAGGACGGTG AGGGCGAGGC GGAGACCGGC GTCGGCCGGA TCGCGAAACA GGAGATCGAA CGCGAGATTC GGTCGGTCGC ACACGGCCAC GTTGACGTCG AGCTCAAGGG GAACAACAAG GCGATCGTGT ACGTCTCTGA GGGGGACATC GGCACCGTCA TCGGTAAGGG CGGCGGCCGC ATCAGCGACA TCGAGAACCG CCTTGGGATC GAGATCGACG TCCGCACTCA CGACCAGAAG CCCGGCGGCC GCGAGGGCGG TTCGGGCGGC GACGGGCGCG GCGCGAACGG ACAGGGAGAC GGTCAGGGCG GGCAGGTCGG AGAGGAGCGC GGCACCGTTG TGACCCCCGA GATCACCTCT CGACACGTCG TCATCGCCGT CGACGCCGGC GTCGGGGAGA CGGTCGAGGT GCGCGCGGAC GGCGAGTACC TCTTCACCGC CACCGTCGGG CGCGGCGGCG AGGTGCAGGT CTCCCGCGGC TCCGCAATCG CCGAAGAGTT GGAGGACGCA ATCGACCGGA AGCGGACCGT AACGGTCGTT CCGGCTCGCT GA
|
Protein sequence | MNVVPDTSAV VDGRVSERVE DGTYAAARVL IPEAVVGELE SQANDGLESG WDGLSELKRL ADYADEGTIE LEYVGERADG DARSRAHKGD VDALIREVAD DYDATLLSSD IVQAEVARAK GIDVEYVEPV ARGVVDELPI LEFFTDETMS VHLKTGTKPK AKRGALGEMR YVVIDEEETS EEQMDEWATA IVDLARQSTD GFIELSDDGM DIVQFRNYRI AVARPPFADG IEITAVRPIA KTTLDDYEFA DELRERFLEQ KRGVLISGSP GAGKSTFAQA VAEFLNDNDY AVKTMEKPRD LQVGPEITQY GALGGQMEKT ADSLLLVRPD YTIYDEVRKT NDFEVFSDMR MAGVGMVGVV HASRAIDALQ RLVGRVELGM IPQIVDTVVY IEAGEVHTVY DVTTEVKVPA GLTAEDLARP VIQVSNFETG RPEYEIYTFN RQVVTVPLND EDGEGEAETG VGRIAKQEIE REIRSVAHGH VDVELKGNNK AIVYVSEGDI GTVIGKGGGR ISDIENRLGI EIDVRTHDQK PGGREGGSGG DGRGANGQGD GQGGQVGEER GTVVTPEITS RHVVIAVDAG VGETVEVRAD GEYLFTATVG RGGEVQVSRG SAIAEELEDA IDRKRTVTVV PAR
|
| |