Gene NSE_0725 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNSE_0725 
Symbol 
ID3931927 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNeorickettsia sennetsu str. Miyayama 
KingdomBacteria 
Replicon accessionNC_007798 
Strand
Start bp647332 
End bp649038 
Gene Length1707 bp 
Protein Length568 aa 
Translation table11 
GC content37% 
IMG OID637900881 
Productpentapeptide repeat-containing protein 
Protein accessionYP_506601 
Protein GI88607990 
COG category[S] Function unknown 
COG ID[COG1357] Uncharacterized low-complexity proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.967446 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGTTGAGTC TAACAATTAA AGTGATTGTA TCTAGCCCGA ATTGGTTGCC TATGAAAAAT 
TTGATCCCGT TTTTTTTAGT CGTTATTGCT TTTTTCGCTT TTTCTGATTG TTACGCTGCC
TCACCAACCA AGAAGGAAAT ATTAGAATTT CTAAAACTAC AGAATAAAGA TCACAACGTT
GTGTTGGATT TTAAGGAGCG CTTTGGTGAT AGACTGACAA ATGTTGATTT CTCTGGACTG
GATCTTGGTA AAGTAACGTT CGATGGAATG ATAATAGAGA ACTCTTCTTT CGATCGTGCA
GTTTTCACAA GTCTTACAAT CAAAAATAGC GTCGTAAATA ACTCTACTTT TTACAGTACA
ATTATCTACA AAGGACATAT CGTAAGCAGC CAGTTGGATG GCGTGTCTAT TATCGAATCG
GATTTAAACA ACACACAAAT AGAAAAATCT GAGCTAAAAA ATGTCAGAAT TCTGAAGACA
CATGCTCCCT CGCTTATACT CGATGATAGC AAGCTAAACA GCATCATAAT CAGAAACTCC
GATATTTTGG AGGCCAAATT TGGAAGAACC GTCCTTGCTG AAAGTGAAAT TTCTGGAAGT
GATCTCTCCA ATATGAGAAT GGAAGATTCG AAAATTACCG ATTCTCGACT AACGGTCTCA
AAGCTCATAA ATGCAAAGAT GTCCAAGAGT ACGCTTACAA ACAGCACCTT ACACGGAATA
GAATTATCAG GTTCTCACAT GCGTGATACT CAGATATTTT CCTCAGAGAT AACTAACTCT
GACCTTCATC GAAGTAGACT GTACAACTGT CATTTAGAGA AAGTAAGCAT GCTAAACACC
GATTTTGGCT ATGCATCAGT TGAAGGTACC TCATTTATCA AAGCCGACTT CTCCTCGGCA
TCTTTAGAGG GATTACATAT AGGAAGCGCA ACTTTTTCAG AATGCTGTTT GTGTAACTTC
AGCTCACAAA ACATTACGAT CGATTCATCA GCAATAACCC ATTCGTCTAT AAGCAATGTA
AAATTGTATG ACAGCGAAAT TGCAGATACC TCGCTAAAAG ATTCTAGTAT TGCAAATCTT
AGTATCTCTA ACTCAGGGTT TCTTGATACC GTACTTTTGG ACGTCAATGG CAAGAGTATT
GCAATCAAGC ATACGCAAAT AGATTCTTTG CTTCTCCAAG GGGATTTTTC AGATATAACA
ATAGAGGACT CACAGATCGT GAAGAGTTCT TTAAGGAATC TCAAATTACA ACTGCTGTGG
CTGCTGCATT CTCATATAAA CAACACGCAG GTCCAAGATG GCACGATATC TAAAAGCAAT
TTCTTAGCTA ATACCTTTAT AGACAGCTCA GTGGACAATT TAACGATCGC CAAATCAAGT
TTCACTGAAA ACAATTTTGT TGGAACAAAT GTAGGAAATA TTTCATTTAC AAAGACACTT
TTCACAGAAA AATTTATAGA GGGCGTTTCT TCCAAACTCG CCCAAATGGG AGCAATTGTC
GGATTGTCGA ATTTTGAAAA GCTCATTGCA AGTGGAACGT ATGATTTCAC AGATGTAAAC
TACTCAAATA TCGACTTCAG TAAAATTGAT TTGGGAAAGG TAAATTTCAA GGGAGCAATA
TTAAGGGAAA ACATTTTTAG TGAAAATAAA CTACACGATG TAGACCTTAC GAAAGCTGAT
CTAGAAGGAA GTACCTTCCA TAAATAA
 
Protein sequence
MLSLTIKVIV SSPNWLPMKN LIPFFLVVIA FFAFSDCYAA SPTKKEILEF LKLQNKDHNV 
VLDFKERFGD RLTNVDFSGL DLGKVTFDGM IIENSSFDRA VFTSLTIKNS VVNNSTFYST
IIYKGHIVSS QLDGVSIIES DLNNTQIEKS ELKNVRILKT HAPSLILDDS KLNSIIIRNS
DILEAKFGRT VLAESEISGS DLSNMRMEDS KITDSRLTVS KLINAKMSKS TLTNSTLHGI
ELSGSHMRDT QIFSSEITNS DLHRSRLYNC HLEKVSMLNT DFGYASVEGT SFIKADFSSA
SLEGLHIGSA TFSECCLCNF SSQNITIDSS AITHSSISNV KLYDSEIADT SLKDSSIANL
SISNSGFLDT VLLDVNGKSI AIKHTQIDSL LLQGDFSDIT IEDSQIVKSS LRNLKLQLLW
LLHSHINNTQ VQDGTISKSN FLANTFIDSS VDNLTIAKSS FTENNFVGTN VGNISFTKTL
FTEKFIEGVS SKLAQMGAIV GLSNFEKLIA SGTYDFTDVN YSNIDFSKID LGKVNFKGAI
LRENIFSENK LHDVDLTKAD LEGSTFHK