Gene Apar_1220 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApar_1220 
Symbol 
ID8414099 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAtopobium parvulum DSM 20469 
KingdomBacteria 
Replicon accessionNC_013203 
Strand
Start bp1368700 
End bp1369830 
Gene Length1131 bp 
Protein Length376 aa 
Translation table11 
GC content49% 
IMG OID645022814 
Producttranscriptional regulator, LacI family 
Protein accessionYP_003180238 
Protein GI257785021 
COG category[K] Transcription 
COG ID[COG1609] Transcriptional regulators 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.485348 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCAGAAA AAGTCACTCT AAAAACGATT GCGCGCAAGG TGGGGCTATC ACCCGCTACG 
GTTTCGCTGG TGCTGAATGG CCGACCGGTG CGTGTGAGCG ACGAGAACCG CCGTCGTATC
CTTGATGTTG CCCGTCGTGA GCATTACATT CCCAACCAGA TTGCAAGAAG CCTGGTCACG
CAGCATACGC AGACGTTGGG ACTGATTGTA CCGAATATCG AGAGCCGGTT TTTCTCGTCG
TTTGCAAAGA TGCTGGAGAT GAAATGCCGT CGTCGTGGGT ACGCGCTGTT TATCACCAAC
TCAGATAACA ACACGTCAAA CGATGCAGAT CTTGTGCGAC TGCTGGTTAA TCGCGGCGCC
GATGGCATCT TTATTATCAC GTCCGATGAG GTTGAAGCCT CGAAACAGCT CATTGCTGAC
CTAGAGCAGC TGCCTGTGCC GTACGTTATG GTAGACCGAA CTATTGACGC GCTCGACTGC
GATAAAGTTA CATTTAACAA TGAGCTTGGC GGCTATCTTG CCACAAAATA CTTGCTAGAC
CACGGTCATC GCCGAATTGC ATGCATGGTT AATACCGCAT CTAATACCGG GTGTGCGCGC
CTTAACGGCT ATGTTCGTGC ACTTGGCGAG AAGGGCCTGA AGTTTGATCA GTCACTTGTG
TTGACCAGTG ATTATTACAT TCCTGATGCA TATCTTGCGG CTCAGCAGTT GATTCGCATA
GATGCTACTG CTGTTATTGC GACGTCAGAC AACATTGCTT TGGGTTTGTT GCGCTATCTG
TACGAGCGCG GATTGCATGT TCCTCATGAT TATTCTGTTG TTGGATACGA TAACAGCATT
TCAGACGCAT TGTTTGAACC GGCGTTGACT TCGATTGAGC AAAATGTTGA TGAGCTTTCT
GACGCTGCAC TTTCGATTAT GTTCCGTCGT TTGAATGAGC ATGGAGCGGA TGTTGGTGTA
GATACAAGTG ATAGCGTTGG CGCAGATGCT GGTGCAGGTC TTAGTTTTGC AAGGGGTGAC
GCAGCAGGTC AGGAAAGTGC GGATGAACCG GCGGATAACG GGGATTCAAT TCAGATAATT
CTTGAGCCAC GTATGATTGA GAAGAATAGC GTACGTGTTT TGGACTGCTA G
 
Protein sequence
MAEKVTLKTI ARKVGLSPAT VSLVLNGRPV RVSDENRRRI LDVARREHYI PNQIARSLVT 
QHTQTLGLIV PNIESRFFSS FAKMLEMKCR RRGYALFITN SDNNTSNDAD LVRLLVNRGA
DGIFIITSDE VEASKQLIAD LEQLPVPYVM VDRTIDALDC DKVTFNNELG GYLATKYLLD
HGHRRIACMV NTASNTGCAR LNGYVRALGE KGLKFDQSLV LTSDYYIPDA YLAAQQLIRI
DATAVIATSD NIALGLLRYL YERGLHVPHD YSVVGYDNSI SDALFEPALT SIEQNVDELS
DAALSIMFRR LNEHGADVGV DTSDSVGADA GAGLSFARGD AAGQESADEP ADNGDSIQII
LEPRMIEKNS VRVLDC