Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_2931 |
Symbol | |
ID | 5705236 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | - |
Start bp | 3317705 |
End bp | 3320860 |
Gene Length | 3156 bp |
Protein Length | 1051 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 641272380 |
Product | ATPase-like protein |
Protein accession | YP_001537748 |
Protein GI | 159038495 |
COG category | [R] General function prediction only |
COG ID | [COG3899] Predicted ATPase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.111498 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGAGCC ACCTGATAGG CCGTGATCAA CAGGTCCGGA TCCTCCAGGA CCTCATCACC CGCACCCTCG ACAGTCACGG CGGTCTCGCA CTCATATCCG GCGATGCCGG CGTCGGCAAG ACCGCCCTCG CCGCGGCCGC GGCCCAACAT GCACAGAGTC GGGGCTTGCT TGTGCTGAAC GGGGCCTGCT GGGACCCCGG GAGCGCGCCG GACTACTGGC CGTGGGTACA GGTGGTTCGG GCCCTGCGCA GCGGGGCTTC GGCCACGGCC TGGGAAGCGG GTAACGAGAT CGGTAGCAGT TTGTCAATCA TGCTTGGTGA AACCGTGATG GCCGAGCCCG ACGAGTTCCG GCTCCACGAC GGCGTCACGA CAGCGTTGGT CTCCGCCGCC CACCACCAAC CACTGGCAGT GGTGCTGGAA GATCTGCACT GGGCCGACGC CGCGTCCGTG CGCCTTCTCG ACTTCGTGGC CCGGCACACG TGGTTCGAAC GTATCCTCCT CCTCGGTACG TATCGCGACC TCGAACCTGT CTGGCACGAG CACCCCGTTC ATGCCGTGGC GGACACACTG CTCGGCAAGG CGACAACCAT CACCCTGACC GGTTTGGGCC GCGACGAGGT CGGTACGTTG ATGGCGCGCC ACACGGGCCA CGAACCGACC TCGGATCTGG TGTCCGAGGT GCATCGGCGT ACGGGAGGGA ACCCCTTCTT CGTCGAGCAA ACCGTTCATC TGCTGGCCGC CGGTGACGAG ATCACCGCGA TCCCACCCGG TGTCGCCGAC GTCGTGCACC AACGGCTGGC GCGATTGCCA GCAGCGGAGC AACGTCTACT GGAGACCGCC GCCGTGGCCG GACACGAGTT CCATCGTGAT GTGATCGCGG CGTTGTCGTC GAACCCAGGG TCGGTGGAAC GCCTGCTGCC TGACCTGGTG GCTGCTCGTC TGCTCGACGC CCTCGGACAC GGCCGTTTCG CGTTCGCGCA GGATCTCGTG CGCGAGACCA TCTACCGGGA ACTCCCACCG CGGCAGGTGT GTGCGGCCCA CGCCGCGATT GTCCATGCGG TCACACGTCA CCCTCATCTC GGCGCGAGCA CCCTGCCTAG CGAGCTGGCC CGGCACGCTC AGCTAGCAGG TACCGCCATA CCTGCCGGCC GTGCGGTGGA ACTGCTTCTC GCGGCCGCCG ATGACGCCGA CCGGCGGATG GCGGTCGACG AGTCCGTCGA GCATCATCGT CGGGCGCTCG ACACGGTACC CGCCGACGAC GCAAGGACGA TCGTGGACGT CGCGCTGAGA CTCGGCACGA GGCTGCGTCG GCGCAGGGGC AGGTCCGCCG CCGACGCGGC GTTCACCCGG GCGGTCGCCG CGGCTCGTGA GGTGGGCGAC GCCCTCCTGC TGACGCGGGT CGCACTGACC GTGTACCGCC GTGACCTGAC CGGTGAGGAC CGCGTCGGCC TGCAGCTGAT CAGAGAGGCC CACGACGCCC TGGGCCTGGC CGAAGCGCCC GGTCGTCGCG AGTCAGACCG TGATGGCGGC GAAGAGCCCT CCACCGATCG GCTCCTGCAG GAGCTGGCGA CTCACGGAGT GGTCGAGGCC CGCCGCCACC GGGACGACAA CGCGTTGACC TTCGCCCTGC GGGCTCTCCA CGACGTGACG TGGGGGCCGG GCACCGCGCC ACAGCGGGAG GCGGTGACAA CTGAGCTCGC TCAGGTTGGG CACCGGACGG GCAGCACCGA ATCCGACCAG TTCGCCCTGT CACTTCGGTG GGTCGCCCTG CTCGAGCTCG GCGACGCCCG ATACCTCGGT GCCCATCATG AGTTCGTGGC GCTGGCGATG AGCCGTGAGT CGCCCCGGTA CGCGCTCGCC GCGATCCTCG ACCGCGCTAT TATCGCCACC TTCCGTGGTC AGTTCGACGG CGCTGACGCA CTGCTTGATC AGGCGACCGG ACACGCCGGG TTCGATCAGC ACTGGACCTT CGTCGTCGAC CACCTCCGCT GGGCGCTCAG ACTCATGCAG GGACGCTACG CCGAACTCGA CGGACTGCAC GCGAGACTGG CTGCTCACGG CCCTCGCCAT GGGCAGCTGG TGGCGGCGAT CTCCACGGTG CAGCGTGGGG ACGTCCCCAC GGCACTGCGC CACCTCGTCG AGTCGTCCGG ACGAGCCCAC CCGTACCCCC GCATCTACGT TCCGTTGTGG TTACGGTTCC AGGCCCAGGT GGCGGCGGCC TCGGGCCGCC GCGAGCTCTG CCAGCAGATA CGTGCGAGCC TGCAACCCTA TGTCGGCCAG TGGGCAGTTT CCCTGTACGG ATGCGACATC AGCGGCCCCT ACCAGCTCTG GTGCGCCGTC CTCGACGCAG CCCTGGAGCA CTGGGAGGAC GCCATCGTCG GGTTCACGGC CGCGGGAGAG GCGGCTGAGC GACTGCATTC ACGGCCGTGG TCGCTCCAGG CCCGGGCCGG GCTGGCCCTG GCCCTGATCG GACGCGGCCA GGCGACGGAC GCCGAGGCCG CGGCAACGCT GTTGAACGAC GTGGAACGCG AATCCATCGA GATCGGCATG AATGGACTGG TCGAGCACAT CCGGCAGGCG AGAGCCCCGG CGATGCGCGC CCCGGTCACC GCCGCCGACG GTGCCTTCCG TTTCGATGGC CAGGTGTGGC AGGTGAGCTA CGCGGGACGC AGTGTGTACC TTCCCGACGC GAAGGGCTTG GGTGACCTGC ACCAGCTACT GAGCTGCCCT GGCCAGGAGG TGGCGGCCCT CAGGCTGGTC GAGCGTGACC ATGTCGCCAC CACGGCGCAC CTCGGGGCGG ATCCGGTGCT CGACGACGCG GCCCTGGCAC ACTACCGCCG GCGGCTGGCC CAGCTCGACG AGCACATCGA CTGGGCGGTA GCCAGAGGTG ACGACGGTGT CGCCGCGCGT CACGACGGGG AGCGTGCCGC GCTGCTCGCT CAACTACGCT CCGCCGTCGG CCTTCACGGC CGCAGCCGCC GACTCGGCGA TGACGCGGAG CGAGCACGTA AGGCGGTCAC AGGTCGGATT CGGGCGGCGC TGCGTAGGCT GGAGGACCAT CATCCCGAAC TCGCGAACCA CCTGAACGCC ACGATATCGA CTGGACTCAC CTGCACGTAT CGGCCGCCAA CGAAGGTGTC GTGGGAACTT CGATGA
|
Protein sequence | MTSHLIGRDQ QVRILQDLIT RTLDSHGGLA LISGDAGVGK TALAAAAAQH AQSRGLLVLN GACWDPGSAP DYWPWVQVVR ALRSGASATA WEAGNEIGSS LSIMLGETVM AEPDEFRLHD GVTTALVSAA HHQPLAVVLE DLHWADAASV RLLDFVARHT WFERILLLGT YRDLEPVWHE HPVHAVADTL LGKATTITLT GLGRDEVGTL MARHTGHEPT SDLVSEVHRR TGGNPFFVEQ TVHLLAAGDE ITAIPPGVAD VVHQRLARLP AAEQRLLETA AVAGHEFHRD VIAALSSNPG SVERLLPDLV AARLLDALGH GRFAFAQDLV RETIYRELPP RQVCAAHAAI VHAVTRHPHL GASTLPSELA RHAQLAGTAI PAGRAVELLL AAADDADRRM AVDESVEHHR RALDTVPADD ARTIVDVALR LGTRLRRRRG RSAADAAFTR AVAAAREVGD ALLLTRVALT VYRRDLTGED RVGLQLIREA HDALGLAEAP GRRESDRDGG EEPSTDRLLQ ELATHGVVEA RRHRDDNALT FALRALHDVT WGPGTAPQRE AVTTELAQVG HRTGSTESDQ FALSLRWVAL LELGDARYLG AHHEFVALAM SRESPRYALA AILDRAIIAT FRGQFDGADA LLDQATGHAG FDQHWTFVVD HLRWALRLMQ GRYAELDGLH ARLAAHGPRH GQLVAAISTV QRGDVPTALR HLVESSGRAH PYPRIYVPLW LRFQAQVAAA SGRRELCQQI RASLQPYVGQ WAVSLYGCDI SGPYQLWCAV LDAALEHWED AIVGFTAAGE AAERLHSRPW SLQARAGLAL ALIGRGQATD AEAAATLLND VERESIEIGM NGLVEHIRQA RAPAMRAPVT AADGAFRFDG QVWQVSYAGR SVYLPDAKGL GDLHQLLSCP GQEVAALRLV ERDHVATTAH LGADPVLDDA ALAHYRRRLA QLDEHIDWAV ARGDDGVAAR HDGERAALLA QLRSAVGLHG RSRRLGDDAE RARKAVTGRI RAALRRLEDH HPELANHLNA TISTGLTCTY RPPTKVSWEL R
|
| |