Gene Apar_0817 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApar_0817 
Symbol 
ID8413682 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAtopobium parvulum DSM 20469 
KingdomBacteria 
Replicon accessionNC_013203 
Strand
Start bp898343 
End bp899371 
Gene Length1029 bp 
Protein Length342 aa 
Translation table11 
GC content46% 
IMG OID645022399 
Producttranscriptional regulator, LacI family 
Protein accessionYP_003179837 
Protein GI257784620 
COG category[K] Transcription 
COG ID[COG1609] Transcriptional regulators 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.533751 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.134467 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACTCGCT CGCATAGTTC TAGCTCAAAT AAACGCTCTG TTTCAATGGC TGATGTTGCA 
CAAGTTGCTG GCGTTTCTCA GCAGACCGTC TCTCGTGTTG CTAATGGTGC CCAGAACGTT
AGCAAAGCAA CGCGTGAAAA AGTTCAAGCC GCAATGGAGT CTATGGGCTT TAGGCCAAGC
TTTGCTGGTA GGTCATTGAG GTCTGGCTTG TATCAATCAG TGGGACTTTG TCTGTATGAC
ATTCGCGAGT TTGGTAACTT AGCTACTCTC GACGGCATTG TTTCGGCTGC TCGTGATCAT
GAATATGCAA TTACGATGAT TGAGAAGGGT TCTGGCGACG GCTTATGCCT TCAGGATATT
TCTCATCGCA TGTCTAATCT TCCCGTTGAT GGCATGATTA TTAGTATGAG TCTTATGGCG
TCAGACTTTG AATCTTTTGT ACCACAACCA GGTCTTGGAA CAGTTCTTCT TACCATGCAT
GAGCATCCTT ACTGTACCAC TGTTGATTCT GATCAGTATG GCTGCTCAAA GCTTGTCATT
GACCATCTCT TTGAACTTGG GCATCGCAAA ATCCGTTTTG TAGCAGGTCC CTCATACTCT
ATTGACTCAC AATTTCGCGA GAAGGGCTGG CGAGATGCAA TGTCTGAGTA TGGGTTGGAA
ATTGTCGAGC CATTTGCTGG TGACTGGACT GCTAATAGTG GCTATGAAAT TGGTAAAAAG
TTGCGAGAAA ATCGCGATTA TACGGCAGTG TATGTTGCAA ACGATCAGAT GGCACTTGGT
GTCATTGCGG CATTTGAAGA AGTTGGACTG AGCGTTCCAG ATGATGTCAG CGTTGTTGGT
GTTGACGACT CTCTTGAAAA TTATTTGCCT AACTTCTCAT TAACCACAGT TCGCTTTAAC
CTACTAGAGC GCGGACGTGT TGCACTTGAG CATGCAATTC GTGCATCTGA GCCTGGATAT
AAACCCGAAG CAATCAGAAT TGCTCCAAAG CTCATTGTTC GTACTACCAC AGCAGCACCA
CAGAAGTAG
 
Protein sequence
MTRSHSSSSN KRSVSMADVA QVAGVSQQTV SRVANGAQNV SKATREKVQA AMESMGFRPS 
FAGRSLRSGL YQSVGLCLYD IREFGNLATL DGIVSAARDH EYAITMIEKG SGDGLCLQDI
SHRMSNLPVD GMIISMSLMA SDFESFVPQP GLGTVLLTMH EHPYCTTVDS DQYGCSKLVI
DHLFELGHRK IRFVAGPSYS IDSQFREKGW RDAMSEYGLE IVEPFAGDWT ANSGYEIGKK
LRENRDYTAV YVANDQMALG VIAAFEEVGL SVPDDVSVVG VDDSLENYLP NFSLTTVRFN
LLERGRVALE HAIRASEPGY KPEAIRIAPK LIVRTTTAAP QK