Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Strop_1939 |
Symbol | |
ID | 5058402 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora tropica CNB-440 |
Kingdom | Bacteria |
Replicon accession | NC_009380 |
Strand | + |
Start bp | 2212219 |
End bp | 2213877 |
Gene Length | 1659 bp |
Protein Length | 552 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 640474213 |
Product | ATPase central domain-containing protein |
Protein accession | YP_001158779 |
Protein GI | 145594482 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0464] ATPases of the AAA+ class |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.208257 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.282616 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGAGGC CTTTTCGCGA GGCACTGTCG CAACTGCTCA AAGCGCGCTT CCCGATCCTC TACGTAGAGT CGTACGAAGA GCATCGGGTG GTCGCGGAGG TGACGGCCGT AGCCCGCGAT GTGGCCCTGG TTCGTACTCC CAGGGCCGTC TGGACCTGGT CCGCGACAGA AGGACTGGTT CAGCCGGACG GTACCCCGAG GAAGGGCACG ACGGACCCGG AGGATGCGCT CAACGCCGTG CTTCGGATCG ACGAGCCCAG CGTGCTCATG TTCAGGGACC TACACGCGGC GCACGGGGGT GGCGACAGGC CTGGGAGCCC GGGTGTGGTT CGGCGGTTGC GGGATGTGGC GGCGGCCTTC AAGTCCGGCC CGGTCGCGCG GGCCTTGGTG CTTATCTCTC CCATGCTGCG GATTCCGGTG GAGCTGGAAA AGGACGTCAC GATCGTTGAC TTTCCACTGC CCACTGAGCA CGAGATCCGC ATGGTGCTTG AAGGGATGAT CGCGGCCAAC TCCGCCAGCG GCCGGATCCG CATCGGCCTG GATGAGGTGG GGAGGGAGCG GTTGGCGAAG GCAGCTCTCG GCCTCACGCT GCAGGAGGCG GAGAACGCCT TCGCTCGTGC GATGGTCAAC GACGGTGTGC TTGACCTTGA CGACATCGCG GTGGTGCACG AGGAGCAGCG GCAGACGGTA CGCAAGTCGG GACTGCTCGA GTTCGTCGAT GTCGATGTCG ACCTGGCCGA CGTCGGTGGT CTGGAAAATC TGAAGCGGTG GCTGGCCAAA AGGGATAGTT CCTGGTTGGC AGAGGCGGCG GAGTACGGGC TTCCCGCGCC GCGCGGTGTG CTGATCACTG GCGTGCCCGG CTGCGGCAAG TCGCTCACCG CAAAGGCCGT CGCCGCTAGC TGGGGCTTGC CGCTGCTGCG CTTCGACATC GGTCGGGTCT TCTCCGGACT GGTCGGATCC AGCGAGCAGA ACGTGCGCAA CGCTATCCGC ACCGCCGAGG CCACCTCACC GTGCGTTCTC TGGGTCGACG AGATCGAGAA GGGCTTCGCC AGCGGCGGCG CCGCTGGTGA CTCGGGAACG TCATCTCGCG TGTTCGGCAC CTTCCTCACC TGGCTGCAGG AGAAAACGGA GCCCGTCTTC GTCATCGCGA CCGCGAACAA CATTGAAAGC CTCCCGCCTG AGATGCTGCG AAAGGGGCGC TTCGATGAGA TCTTCTTCGT CGATCTCCCG ACCATCGCGG AGCGCGCGTC GATCTGGGCC ATGCACATCG CAAAGCGGAT GACGCATCCT GCTGTCGCCA GAGATCTCAC CGTCGACGAC GCACTGCTGA AGGAGCTCTC GGGGCTGAGT GAGGGCTATT CGGGAGCCGA GCTCGAGCAG GCGGTCATCG CCGGCCTTTT CGACGCCTTC TCGGAAAGAC GCCCGTTACG CAAGGACGAT CTAATCCACG GCGTGGTCAA CATGGTGCCG CTCAGCGTGA CCCAGGCGGA GCGGATCAGC GCCATCCGCG CCTGGGCGAA TATGCGTGCT GTGGCGGCCA CGGCCGCGGA AGACTGGGAC CCTCCTGGCC GCTCCGGGAC CACCATTGCG TCCACGGCTC CTACCCAACC CGGCGATGCA CCACCCCCTC ACATTGGTGG AAGATCGGTC GAGTTTTGA
|
Protein sequence | MARPFREALS QLLKARFPIL YVESYEEHRV VAEVTAVARD VALVRTPRAV WTWSATEGLV QPDGTPRKGT TDPEDALNAV LRIDEPSVLM FRDLHAAHGG GDRPGSPGVV RRLRDVAAAF KSGPVARALV LISPMLRIPV ELEKDVTIVD FPLPTEHEIR MVLEGMIAAN SASGRIRIGL DEVGRERLAK AALGLTLQEA ENAFARAMVN DGVLDLDDIA VVHEEQRQTV RKSGLLEFVD VDVDLADVGG LENLKRWLAK RDSSWLAEAA EYGLPAPRGV LITGVPGCGK SLTAKAVAAS WGLPLLRFDI GRVFSGLVGS SEQNVRNAIR TAEATSPCVL WVDEIEKGFA SGGAAGDSGT SSRVFGTFLT WLQEKTEPVF VIATANNIES LPPEMLRKGR FDEIFFVDLP TIAERASIWA MHIAKRMTHP AVARDLTVDD ALLKELSGLS EGYSGAELEQ AVIAGLFDAF SERRPLRKDD LIHGVVNMVP LSVTQAERIS AIRAWANMRA VAATAAEDWD PPGRSGTTIA STAPTQPGDA PPPHIGGRSV EF
|
| |