Gene Apar_0531 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApar_0531 
Symbol 
ID8413385 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAtopobium parvulum DSM 20469 
KingdomBacteria 
Replicon accessionNC_013203 
Strand
Start bp614647 
End bp616752 
Gene Length2106 bp 
Protein Length701 aa 
Translation table11 
GC content48% 
IMG OID645022104 
Producttranscription termination factor Rho 
Protein accessionYP_003179553 
Protein GI257784336 
COG category[K] Transcription 
COG ID[COG1158] Transcription termination factor 
TIGRFAM ID[TIGR00767] transcription termination factor Rho 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000000460455 
Fosmid HitchhikerNo 
Fosmid clonabilityunclonable 
 

Sequence

Gene sequence
ATGTCGGACG AAACATCTTC GGTCCAAAAT GATTCTTTAA ACGATGTAGA AATTACTCCA 
GCACCACGTA GACGTGGTAG ACCTCGCAAG AGTAAAAATA TTAATTCGGA AAATACTACT
TCTGAAAATA ACAATTCTGA TACAACTGAA GAAAAGACAT CTAATTCGTC AGCTGTGAAG
CGTCGTCGTG GTCGTCCTAC TAAAAAGGAG ACTGAAGCTC GTCGTGCAGT AGAGGCAGTT
ATGGTTGAAG CTTTAAATGC AGCTGGCGTT GCAGAGCCTC CTAAGCGTCG TCGCGGTCGT
CCAAGCAAAG AAGAGATGGA GGAGCGCGCT TTACGGGAGC AGACTCTTGA GGCTCTTCAA
GCGGAAGTTT TGGCTCAGGT CCAGGCAGGC GGCAAGGTCT CTTCTGACGT TGTAGATACT
GATCGACTTA CCAAGATTGT TCAGGATGCA ACTCAGCCTT CTGTAGAGGT AACCCCTGTT
CCAGCTTTTA CCAAAGCAGA GGCAGAAAGT ACAGCGGACC AGAATGACAA TGCATTCCAG
AAGAGCATTG TTGAACATCA AGACAGTGCT GAGCAGCAGG AAAGTCCAGT TCAAGAGCAG
ACTTCTGGCG CAGAGGCAGA TGCTGAGGTA GCTCCAACAG GGCGTCTAGG ACGTTCACGT
ACGGGAGTGG CACGTCGCCG GCACCGTAAT GCTGAGCCAA AAGAAGCAGC TCAGGAGGGC
GAGAAGACAC AGGAAAACGC ACACGCTAGC AATCAAGAGC ATTCTTCTCG CGATCGTAGG
CAGAACAACC AACGCAACAA GAATCGTAAG CAAGGTCAGG CAAGCAAACC TTCACTTTCC
AAAGAAATGC TTCAAGAGAT GAAGCTTTCG GAGCTTCGTG CTAAAGCAAC TGAGTTGGGT
ATTGAGTATG CGGGCGTTCA CAAGTCTGAT TTGATTGAGC TTGTGTATGC TGCATCTGCA
GCCGCTGAAG GCTTCAAGAC TGTTGAGGGT ATTCTTCAGA TTAGCAATGA AGGCTACGGC
TGGATTCGTA CAGGTAATTA TATGGAAGGC GATAACGACG CCTTTGTTCA CCAGCAGATT
ATTAGAAATC TTGGTCTGCG TCCTGGCGAT AGCGTTTTGG GTATGGTTGG ACCTGCTCGC
GTAAATAGTA AGTATCCACC ACTACTTCAA GTTACCCAGG TTAATGGTGG GGAAGTTGAG
GGTCTTAAGG ATCGTCCTCG TTTTAAAGAC CTGACACCTA TTTTCCCAAA TCAACCTCTC
ACTATGGAGC ACGGCAAAGA CTCCATTACG GGGCGTGCAA TTGATTTAGT AGCTCCTATC
GGTAAGGGTC AGCGTGGTTT GATTGTCTCG CCTCCAAAGG CAGGTAAGAC CACTATTTTG
AAGCGTATCT GCCAGTCCAT TACCATCAAC AATCCCGAAG TTCATCTCTT CTGCCTTTTG
GTAGACGAGC GTCCTGAGGA AGTCACGGAC ATGGAGCGTT CCATTAAGGG TACTGTCGTT
GCTTCAACTT TTGATATGCC AGCAGAGAAT CACACTGTTG TTTCTGAGCT TGTCATTGAG
CGAGCAAAGC GTCTGGTTGA GATGGGTCAA GACGTTGTAG TTATTCTAGA CTCCATCACA
CGTCTTGCTC GTGCGTATAA CCTTGCCATC CCTGCATCTG GTCGTATTCT GTCTGGCGGT
GTTGACTCCG CTGCTCTGTA TCCACCAAAG AAGTTCTTAG GTGCAGCTCG AAATATCGAA
AGCGGTGGCT CACTCACCAT CATTGCTTCT GCTCTCGTAG ATACGGGCTC CAAGATGGAC
GAGGTTATCT TCGAGGAGTT TAAGGGCACC GGTAATATGG AGCTCAAGCT TGACCGTGAT
CTGGCAGACC GCCGAATCTT TCCAGCTATT GATCCCGTTT CTTCTGGTAC CCGTAATGAG
GATTTGCTTA TTAAGCCAGA GCTTCAGCCT ATGGTTTGGG GCGTTCGTCG TATTCTTGCA
GGCTTTAACA ACACTGAGCG AGCTACTACT GCTCTTATCA GTGGTCTTAA GCAGACCGAC
AATAACCAGG ACTTCTTGAT TAGATCTGCA AAGAAGGCCG CACAGTCTGA CTATCTACAG
AACTAG
 
Protein sequence
MSDETSSVQN DSLNDVEITP APRRRGRPRK SKNINSENTT SENNNSDTTE EKTSNSSAVK 
RRRGRPTKKE TEARRAVEAV MVEALNAAGV AEPPKRRRGR PSKEEMEERA LREQTLEALQ
AEVLAQVQAG GKVSSDVVDT DRLTKIVQDA TQPSVEVTPV PAFTKAEAES TADQNDNAFQ
KSIVEHQDSA EQQESPVQEQ TSGAEADAEV APTGRLGRSR TGVARRRHRN AEPKEAAQEG
EKTQENAHAS NQEHSSRDRR QNNQRNKNRK QGQASKPSLS KEMLQEMKLS ELRAKATELG
IEYAGVHKSD LIELVYAASA AAEGFKTVEG ILQISNEGYG WIRTGNYMEG DNDAFVHQQI
IRNLGLRPGD SVLGMVGPAR VNSKYPPLLQ VTQVNGGEVE GLKDRPRFKD LTPIFPNQPL
TMEHGKDSIT GRAIDLVAPI GKGQRGLIVS PPKAGKTTIL KRICQSITIN NPEVHLFCLL
VDERPEEVTD MERSIKGTVV ASTFDMPAEN HTVVSELVIE RAKRLVEMGQ DVVVILDSIT
RLARAYNLAI PASGRILSGG VDSAALYPPK KFLGAARNIE SGGSLTIIAS ALVDTGSKMD
EVIFEEFKGT GNMELKLDRD LADRRIFPAI DPVSSGTRNE DLLIKPELQP MVWGVRRILA
GFNNTERATT ALISGLKQTD NNQDFLIRSA KKAAQSDYLQ N