Gene Apar_0139 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApar_0139 
Symbol 
ID8412985 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAtopobium parvulum DSM 20469 
KingdomBacteria 
Replicon accessionNC_013203 
Strand
Start bp158252 
End bp159334 
Gene Length1083 bp 
Protein Length360 aa 
Translation table11 
GC content49% 
IMG OID645021709 
Producttranscriptional regulator, MarR family 
Protein accessionYP_003179166 
Protein GI257783949 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0524] Sugar kinases, ribokinase family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTTACCA ATCGAGAACA GATGATTTTC AACTGGATTA AAGAGCAGCC TTCCATTACT 
CAGAAGGAGA TTGCAGAACG CGCAGGTATT TCACGTTCTT CTGTTTCGGT TCACATCTCT
AACCTCACCG CAAAGGGCGC TATTTTGGGT CGTCGTTACA TTTTGAGCGA ACGTCCCTAT
TTTATTGTGA TTGGCGCTGC CAACATGGAT ATTGCTGGTC GACCAGAGAC TTCTCTTGTG
GCAGGAGATT CAAATCCTGG CAAAGTCACC ATATCCTTTG GTGGCGTTGG CAGAAATATC
GCACACAACC TGGCCCTTCT CGACAGCGAC GTACGTCTGC TTACCGCCTT TGGCGAGGAC
TATCGTGCCC GTGAGCTTAA AGAGGGCTGC CTAGACTGCG GTATTGACAT CGATGCATCC
ATTACTGTTC CAGGCGCTTC CACTTCTACC TATCTCTTCA TTATGAACGA GCACGGTGAG
ATGCAGGAAG CCATCAATGA TATGCAGATC TACGAGTATG TTACCCCTGA ACGTATCGAA
GAGCGCCTAG ATGTTATCCA GCATGCCGCA GCTTGCGTGA TTGACACTAA CCTGCCTCAA
CAGACCATTG AGTTTATTGC CAAGAACGTG ACCTGCCCTA TTTTCTGCGA TCCCGTTTCT
TCTATTAAAG CGCAAAAACT TAAGAAAGTC CTAGGCAAAC TTTACACGCT CAAACCAAAT
CGCCTTGAGG CTGAAATGCT CTCGGGCATC AAGATTACTG ACGACGCTTC CCTTGAAGCA
GCAGCTCATG AGCTACTTGC AACAGGTCTT AAGCGTGTAT TTATCTCTCT TGGCGAGAAG
GGGCTTCTTT GTGCTGATCA CGAGAAGACC ATCCGCTTAC CACTGCTCCC AGCAAAGCTA
ATCAACATGA CAGGCGCGGG AGATGCCATG ATGGCAGGAA TTACCTGGGC GTATACCCAG
GGTATGACGC TTGAGCAAAC TGGACTGGTA GGTATGGCTG CCTCCAGCAT AGCCATTGAA
GGAGAGGAAA CCATCAACAG TCAGCTCAAC GTTGAAGAGG TTATCAAGCG CGCAGGTATT
TAG
 
Protein sequence
MLTNREQMIF NWIKEQPSIT QKEIAERAGI SRSSVSVHIS NLTAKGAILG RRYILSERPY 
FIVIGAANMD IAGRPETSLV AGDSNPGKVT ISFGGVGRNI AHNLALLDSD VRLLTAFGED
YRARELKEGC LDCGIDIDAS ITVPGASTST YLFIMNEHGE MQEAINDMQI YEYVTPERIE
ERLDVIQHAA ACVIDTNLPQ QTIEFIAKNV TCPIFCDPVS SIKAQKLKKV LGKLYTLKPN
RLEAEMLSGI KITDDASLEA AAHELLATGL KRVFISLGEK GLLCADHEKT IRLPLLPAKL
INMTGAGDAM MAGITWAYTQ GMTLEQTGLV GMAASSIAIE GEETINSQLN VEEVIKRAGI