Gene NSE_0167 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNSE_0167 
SymboldnaE 
ID3932104 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNeorickettsia sennetsu str. Miyayama 
KingdomBacteria 
Replicon accessionNC_007798 
Strand
Start bp134937 
End bp138170 
Gene Length3234 bp 
Protein Length1077 aa 
Translation table11 
GC content41% 
IMG OID637900323 
ProductDNA polymerase III, alpha subunit 
Protein accessionYP_506064 
Protein GI88608539 
COG category[L] Replication, recombination and repair 
COG ID[COG0587] DNA polymerase III, alpha subunit 
TIGRFAM ID[TIGR00594] DNA-directed DNA polymerase III (polc) 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGGCGGC AGACATTCGT ACATCTTAGA GTACGAAGCG ATTATTCGCT CTTGAGGGCA 
GTAACCAAAC CTAGGAGAAT TGTTGATCTG GCCCGTGCTG CTGGTATGCC AGCTGTGGCG
CTCACTGATA TTGATACAAT GGCAGGGGCG CTTGAGTTTG CTGAGTATGC TAAGGCTTCT
GGTATTCAGC CGATTATTGG GATTGATCTT TCGATTTCAC ATAATCGTGT TGATTCACGT
GTACTCTTGA TTGCAAAAAA CGAAGAGGGC TATCACAACA TGATTAAGCT TTCTGGGATA
GTTTCCTTGC GGAGAGTTGC GCTGAATGAG GTTTTCACAC ATAGCAAGGG TCTAATTCTT
CTGCTAGGAG AGTTCTTTAT TGATGCTATC AACAGTTACT CGATTGATGC AGAAACTTTG
GTATCCGATC TAAAGCAAGT TTTTGGTAAG GACCTTTTTG TCGAAATACA ACGTTTGGGT
AATCAATCTG CAGAGGAAAA ACTTATTCAA CTTGGATATG AGCATAATAT TCCTTTTGTT
GCTACGAACG ATGTTCTTTT TCCAGATACG GATTTCAGGG AGGCGCATGA CGTATTGACT
TGCATAGGAA GCCTTACGTA TGTCAACGAT ATAAACCGCG TACATTACTC TCCAGAATCG
TATTTTAAAA GTGCAGAGGA AATGATTGCA CTTTTTGCTG ATTTGAAGGA GGCGCTGCAC
AATACTGTCT TGATAGCGCG TCGTTGTAGC TTTATGCCAG AAGTGAGAAG TCCTATTCTG
CCCAAGTTTG AATGCGAAAT GGACGAGAGC GATGAGTTGA GGCGTCTAAG TTACATTGGC
TTACACAGAA GGATGGGGGC TGATGTTCCA CAAAATTATT TGGACAGGCT TGAGTACGAG
CTTGGTATTA TCATTGACAT GAATTATGCT GGCTACTTTT TGATAGTGGC TGATTTTATA
CGTTGGAGTA AGGCCAACGG CATCGTCGTT GGTCCTGGAA GAGGATCTGG AGTCGGATCA
ATAGTTGCCT GGTGCCTCGG TATTACCGGT TTAGATCCAC TGAAATTTGC CCTTTTTTTT
GAGAGGTTTT TGAATCCGGT TCGTGTATCG ATGCCGGATT TTGATGTGGA CTTCTGTCAG
GAAAAGCGCC ATCTTGTAAT AGAGTACGTG AGAAAAAAAT ATGGTCACGT TGCTCAAATA
ATGACTTTTG GTACCCTCCA GCCCAGAGCT GCGTTGCGAG ATGTTGGAAG GGTGCTCCAG
CTACCGTATG GTCGAGTCGA TAGGATCTGT AAAATGATTC CTAACAATCC CGCTAATCCG
ATTTCTCTCC AGGAAGCAAT AAATCTTGAC AAAGATCTCC AAAGAGAAAG TGAAGATGAT
GAATCTATAG CAAAGTTGCT TGATCTTGCT CTTAAGTTGG AAGGAACGCT AAGACATACC
TCTACCCATG CTGCTGGAAT AGTTATTAGT GATACTCCGA TAGAGGATTA TCTTCCCGTT
TACCACGACA AGGAGTCAGA TATTCCAGTT ACCCAATATT CGATGAAATA TGTTGAGAAA
GCAGGATTAA TAAAGTTTGA TTTTCTTGGT TTGAAAACGT TAACAGTGAT CAATCAGGCG
TGCTTGTTGG TACGCTTAAA AAATCCCAAT TTTGATATAG ATTCTATTCC TCTTGACGAT
AAGAGAACGT ATGACCTTCT TTCAGCAGGA AATGCGATAG GTGTATTCCA GTTGGATAAC
GCATACATGT GTGAGACGCT AAAACGTCTG CACCCGGACT GTTTTGAAGA TATAATTGCC
CTGATCTCGC TTAATAGGCC TGGTCCTATG GCGAATATTC CGACATATAT AGCAAGAAAA
CATGGAAAAG AGGCAGTCAA ATATCCACAT CCGCTACTTG AGTCTTCATT GAAGGAAACT
TTTGGTGTTG TCATCTATCA AGAGCAGGTA ATGGAAATGG TTAGGCTTCT TGCTGGGTAT
ACACTAGCCG AAGCAGATAT TCTCCGACGT GTTATGGGTA AGAAAATACG CGCGGAAATG
TCAGAGCAAG CCGAGAAATT TGTCGAGGGT GCAAAACGTA ACGGAATCGA GACAGAAAGA
GCTCAAGAAA TTTTCGAGAT GATAGAAAAA TTTGCTGGCT ATGGTTTTAA TAAGTCGCAT
GCTGCTGCAT ACGCGCTAAT CTCATATCAA ACAGCCTTCC TCAAGGCTAA TTTTCCAATC
GAATTTACCA CGGCCGCTCT TAACCTTGAA TTGCACCATA CAGACAAGCT CGCAATCCTA
ATACAAGATG CAAAAAATTA TGGTATCACA ATTCTGCCTC CAGATATAAA TAAGTCAAAA
ATTTTATTCT CTATTGAAGA TAATGCGATA AGGTACTCGC TTGCAGCTTT GAAGAATGTT
GGAGAAGCTG CAGCTTCAGC AATACAGAGA AGAAGTCCGT TCGCAACAAT TCCAGAACTT
CAAAGGTGCT TGGATGGAAA AATAGTTCAT AGAAAGGCTG TTGAAAGTCT CATAAAGTCA
GGTGCTACGG ATGTACTTTC TCCTGATAGG AGTGGACTTT ATTCTTGTCT GAAGGAATTA
GTGGGAAAAG AGGATGATGG TGCACAGATG ACATTATTTG ATATCTCTGT GAACAAACCG
GAAAAGACCG TAAAGCCCTG GAATTTTTTT GAAAGGACAC AGTGTGAATT TGATGCCTTT
GGCTTTTTTC TTTTTGATCA CCCTCTCTTT CCATATCGAA AGTTTTTAAA GCTTTCTCCG
AATCAGATTG CTGGTGTGAT AACTGAACTA AGGATAAGGT CCAGAGGGGA CAGAAAGTTT
GCTGTGATGC ATGTTTCAAC AGTAAACGAC ATATACACTG TTATGTTTTA CGAGTCCGAC
GTTATAGATT CCAGACGTGA ACTTTTTGTA GTTGGTGTGA AAGTCGTACT GACTCTCTTA
AAGAGTGATA ATGGCTATGT ATGTACCAAT TTAGCTGAAT TACACGAATT TATCTTTTCG
AATTATAACG GAAGATTTGC AATTCTCGTA AATCGCAAAG AACAGGTCAC TGCACTGAAG
GATGTTCTAA GGCGTGGTGG TAAGTATGAG GTTACACTCG TTGTGCGTAA CGATTCTCAA
TACACGAGAA TTGTTCTAGG ACATGACTTT GATCTTGCAG TTGACAAGCT TGCATTAATT
GAAGGTGTGG AGATTCTCAA GTACAGGGTA TGCTGCAGCG ATATGATCAA TTAG
 
Protein sequence
MRRQTFVHLR VRSDYSLLRA VTKPRRIVDL ARAAGMPAVA LTDIDTMAGA LEFAEYAKAS 
GIQPIIGIDL SISHNRVDSR VLLIAKNEEG YHNMIKLSGI VSLRRVALNE VFTHSKGLIL
LLGEFFIDAI NSYSIDAETL VSDLKQVFGK DLFVEIQRLG NQSAEEKLIQ LGYEHNIPFV
ATNDVLFPDT DFREAHDVLT CIGSLTYVND INRVHYSPES YFKSAEEMIA LFADLKEALH
NTVLIARRCS FMPEVRSPIL PKFECEMDES DELRRLSYIG LHRRMGADVP QNYLDRLEYE
LGIIIDMNYA GYFLIVADFI RWSKANGIVV GPGRGSGVGS IVAWCLGITG LDPLKFALFF
ERFLNPVRVS MPDFDVDFCQ EKRHLVIEYV RKKYGHVAQI MTFGTLQPRA ALRDVGRVLQ
LPYGRVDRIC KMIPNNPANP ISLQEAINLD KDLQRESEDD ESIAKLLDLA LKLEGTLRHT
STHAAGIVIS DTPIEDYLPV YHDKESDIPV TQYSMKYVEK AGLIKFDFLG LKTLTVINQA
CLLVRLKNPN FDIDSIPLDD KRTYDLLSAG NAIGVFQLDN AYMCETLKRL HPDCFEDIIA
LISLNRPGPM ANIPTYIARK HGKEAVKYPH PLLESSLKET FGVVIYQEQV MEMVRLLAGY
TLAEADILRR VMGKKIRAEM SEQAEKFVEG AKRNGIETER AQEIFEMIEK FAGYGFNKSH
AAAYALISYQ TAFLKANFPI EFTTAALNLE LHHTDKLAIL IQDAKNYGIT ILPPDINKSK
ILFSIEDNAI RYSLAALKNV GEAAASAIQR RSPFATIPEL QRCLDGKIVH RKAVESLIKS
GATDVLSPDR SGLYSCLKEL VGKEDDGAQM TLFDISVNKP EKTVKPWNFF ERTQCEFDAF
GFFLFDHPLF PYRKFLKLSP NQIAGVITEL RIRSRGDRKF AVMHVSTVND IYTVMFYESD
VIDSRRELFV VGVKVVLTLL KSDNGYVCTN LAELHEFIFS NYNGRFAILV NRKEQVTALK
DVLRRGGKYE VTLVVRNDSQ YTRIVLGHDF DLAVDKLALI EGVEILKYRV CCSDMIN