Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | NSE_0725 |
Symbol | |
ID | 3931927 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Neorickettsia sennetsu str. Miyayama |
Kingdom | Bacteria |
Replicon accession | NC_007798 |
Strand | + |
Start bp | 647332 |
End bp | 649038 |
Gene Length | 1707 bp |
Protein Length | 568 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 637900881 |
Product | pentapeptide repeat-containing protein |
Protein accession | YP_506601 |
Protein GI | 88607990 |
COG category | [S] Function unknown |
COG ID | [COG1357] Uncharacterized low-complexity proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.967446 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGTTGAGTC TAACAATTAA AGTGATTGTA TCTAGCCCGA ATTGGTTGCC TATGAAAAAT TTGATCCCGT TTTTTTTAGT CGTTATTGCT TTTTTCGCTT TTTCTGATTG TTACGCTGCC TCACCAACCA AGAAGGAAAT ATTAGAATTT CTAAAACTAC AGAATAAAGA TCACAACGTT GTGTTGGATT TTAAGGAGCG CTTTGGTGAT AGACTGACAA ATGTTGATTT CTCTGGACTG GATCTTGGTA AAGTAACGTT CGATGGAATG ATAATAGAGA ACTCTTCTTT CGATCGTGCA GTTTTCACAA GTCTTACAAT CAAAAATAGC GTCGTAAATA ACTCTACTTT TTACAGTACA ATTATCTACA AAGGACATAT CGTAAGCAGC CAGTTGGATG GCGTGTCTAT TATCGAATCG GATTTAAACA ACACACAAAT AGAAAAATCT GAGCTAAAAA ATGTCAGAAT TCTGAAGACA CATGCTCCCT CGCTTATACT CGATGATAGC AAGCTAAACA GCATCATAAT CAGAAACTCC GATATTTTGG AGGCCAAATT TGGAAGAACC GTCCTTGCTG AAAGTGAAAT TTCTGGAAGT GATCTCTCCA ATATGAGAAT GGAAGATTCG AAAATTACCG ATTCTCGACT AACGGTCTCA AAGCTCATAA ATGCAAAGAT GTCCAAGAGT ACGCTTACAA ACAGCACCTT ACACGGAATA GAATTATCAG GTTCTCACAT GCGTGATACT CAGATATTTT CCTCAGAGAT AACTAACTCT GACCTTCATC GAAGTAGACT GTACAACTGT CATTTAGAGA AAGTAAGCAT GCTAAACACC GATTTTGGCT ATGCATCAGT TGAAGGTACC TCATTTATCA AAGCCGACTT CTCCTCGGCA TCTTTAGAGG GATTACATAT AGGAAGCGCA ACTTTTTCAG AATGCTGTTT GTGTAACTTC AGCTCACAAA ACATTACGAT CGATTCATCA GCAATAACCC ATTCGTCTAT AAGCAATGTA AAATTGTATG ACAGCGAAAT TGCAGATACC TCGCTAAAAG ATTCTAGTAT TGCAAATCTT AGTATCTCTA ACTCAGGGTT TCTTGATACC GTACTTTTGG ACGTCAATGG CAAGAGTATT GCAATCAAGC ATACGCAAAT AGATTCTTTG CTTCTCCAAG GGGATTTTTC AGATATAACA ATAGAGGACT CACAGATCGT GAAGAGTTCT TTAAGGAATC TCAAATTACA ACTGCTGTGG CTGCTGCATT CTCATATAAA CAACACGCAG GTCCAAGATG GCACGATATC TAAAAGCAAT TTCTTAGCTA ATACCTTTAT AGACAGCTCA GTGGACAATT TAACGATCGC CAAATCAAGT TTCACTGAAA ACAATTTTGT TGGAACAAAT GTAGGAAATA TTTCATTTAC AAAGACACTT TTCACAGAAA AATTTATAGA GGGCGTTTCT TCCAAACTCG CCCAAATGGG AGCAATTGTC GGATTGTCGA ATTTTGAAAA GCTCATTGCA AGTGGAACGT ATGATTTCAC AGATGTAAAC TACTCAAATA TCGACTTCAG TAAAATTGAT TTGGGAAAGG TAAATTTCAA GGGAGCAATA TTAAGGGAAA ACATTTTTAG TGAAAATAAA CTACACGATG TAGACCTTAC GAAAGCTGAT CTAGAAGGAA GTACCTTCCA TAAATAA
|
Protein sequence | MLSLTIKVIV SSPNWLPMKN LIPFFLVVIA FFAFSDCYAA SPTKKEILEF LKLQNKDHNV VLDFKERFGD RLTNVDFSGL DLGKVTFDGM IIENSSFDRA VFTSLTIKNS VVNNSTFYST IIYKGHIVSS QLDGVSIIES DLNNTQIEKS ELKNVRILKT HAPSLILDDS KLNSIIIRNS DILEAKFGRT VLAESEISGS DLSNMRMEDS KITDSRLTVS KLINAKMSKS TLTNSTLHGI ELSGSHMRDT QIFSSEITNS DLHRSRLYNC HLEKVSMLNT DFGYASVEGT SFIKADFSSA SLEGLHIGSA TFSECCLCNF SSQNITIDSS AITHSSISNV KLYDSEIADT SLKDSSIANL SISNSGFLDT VLLDVNGKSI AIKHTQIDSL LLQGDFSDIT IEDSQIVKSS LRNLKLQLLW LLHSHINNTQ VQDGTISKSN FLANTFIDSS VDNLTIAKSS FTENNFVGTN VGNISFTKTL FTEKFIEGVS SKLAQMGAIV GLSNFEKLIA SGTYDFTDVN YSNIDFSKID LGKVNFKGAI LRENIFSENK LHDVDLTKAD LEGSTFHK
|
| |