Gene Apar_0129 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApar_0129 
Symbol 
ID8412974 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAtopobium parvulum DSM 20469 
KingdomBacteria 
Replicon accessionNC_013203 
Strand
Start bp146099 
End bp147838 
Gene Length1740 bp 
Protein Length579 aa 
Translation table11 
GC content48% 
IMG OID645021698 
ProductUbiC transcription regulator-associated domain protein 
Protein accessionYP_003179156 
Protein GI257783939 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3959] Transketolase, N-terminal subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.625714 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCATTTG AGACAGGAGT TTTTCCGGTT TTAGTTTCTG AGGATGAAGC TGTAGAGTCA 
CTGCCTCCAC AGGTTAGGGG AGGGCTCAAT GCGTCATCTG CTACGCCTCT TCATGTTCAG
CTGTCAGATT TAATGCGCGT TAAAATACTG TCAGAAGACT GGAAGGCAGG AACGTATATT
CCTTCTGAGG CCGAATTTAT GGCGCAGTAT GGTGTTAGTC GAGGCACTAT CAGAAAAGCC
ATTCAATCTC TGGTTAAAGA AGGTTTGCTG CTTACTCAAA AGGGTCGAGC AACACAAGTT
ATTTCTAACA CCGTTCGACA TGCAGCAGGA AATACCGTGC TTTCATTTGC CGCAGCGCTT
AGAGATGGTG GATTTGAATA TCGAACCGAA GTCTTGTTTA AGCAGGTGGT TCCTGCAGAT
CAAGCTGTTG CCGAGCATCT AGAGATTCCC ATCGGCTCTG ACGTGCTCTT TTTGCGTCGT
GTCAGGAGCG TTTCGGATAG ACCTGTGGTA TGCCAGGAAT CATGGTCTAA CCTGCTTGTT
TGTCCACAGC TCGAAGAGGC GGACTTTGAG AATGAGTCGC TCTTTGATGC AGTTGAAAGA
ATATCCAAAA AAGAAATTGC ACGTTCTCGC ATGCGCTATC AGTCGCAAAT TGCGGATAAA
GACCACGCAG ATTACCTGCA GTGTTCTCCA AATGAGGCTT TGCTTGTACT TGAGCAGGTA
ATTGAGCTGT CCGACGGCAA CTGCATTGAG TGGAGCCAGA CTTGGCTTGC TCCACACCAG
AGTGTTGTTG GTGTATCAGA GCAGGTAGAT GGCTCTATTG GACCTCTGGA CATTTCTTCG
GTCCGCCAGT CAGAGCACTC TACCTCCAAT ATTGTTGCTT TAAAAAATGA TATCAAACAG
CGCACGCAGC TTGAACTTGA TTTGCGTCAC GAAGCACTGT CAGTACGTCG AGGCATCGTT
GAGCTGGCGC ATCGCTACAG TTCAACGCCA TTTCACATTG GCGGTGCATG TTCTGTAGCA
GACATAGTCT CCGTACTCTT GAGTAAGGTT ATGCAGGTAG GAAATAGGGA TTGCGAGTGG
GAACTCAGAG ACCGCCTGAT TCTGTCCAAG GCTCATACGT CGCTGGCGCT ATTCCCAGCA
CTTCTACGTG CAGGAATGAT CTCCCAAGAG GATATTGATC GCGGAGTTTT TGGACCTGAT
GCCGTCCTTT TTAAACATCC TCTGCGAGAT TCTCAACGTG GTTTTGAAAT CTCTGGTGGC
AGTCTTGGCA TGGGCCTGGG CTACGCCGCT GGGCTAGGGT TAAGTTTTCG CAGGAAAGAT
CTTCCTTCTC GCGTTTTTTG TATTTTGGGC GACGGTGAGT GCGATGAGGG TTCCATTTGG
GAGAGTGCAG CGTTTATCGG TCACAACCAG CTCTCCAACG TGACGGTGAT TGTTGATCAG
AATCGTATGC AGCTGGATGG CCCTTGCGCG TCCATTTTGG ATACCGGATC AATTGCAAGA
AAGTTTGATG CGTTTGGCTT TGAATCCATT GAGGTAGATG GTCATGATGT GCTTGCATTG
TACGACGCGC TGAAAGAGAA AGCTTCCAAG CCTCGCGTGA TTATTGCACA CACCATTAAA
GGTAAAGGGT TCTCGTTTGC CGAGAATAAC GTCTCGTTCC ATGATGCATG TGTGACGGAT
GACCTCTATG AACAGGCATT GTCTGATCTG AAAGTTGCAG AGGAGGCGTG TTCATGCTAG
 
Protein sequence
MSFETGVFPV LVSEDEAVES LPPQVRGGLN ASSATPLHVQ LSDLMRVKIL SEDWKAGTYI 
PSEAEFMAQY GVSRGTIRKA IQSLVKEGLL LTQKGRATQV ISNTVRHAAG NTVLSFAAAL
RDGGFEYRTE VLFKQVVPAD QAVAEHLEIP IGSDVLFLRR VRSVSDRPVV CQESWSNLLV
CPQLEEADFE NESLFDAVER ISKKEIARSR MRYQSQIADK DHADYLQCSP NEALLVLEQV
IELSDGNCIE WSQTWLAPHQ SVVGVSEQVD GSIGPLDISS VRQSEHSTSN IVALKNDIKQ
RTQLELDLRH EALSVRRGIV ELAHRYSSTP FHIGGACSVA DIVSVLLSKV MQVGNRDCEW
ELRDRLILSK AHTSLALFPA LLRAGMISQE DIDRGVFGPD AVLFKHPLRD SQRGFEISGG
SLGMGLGYAA GLGLSFRRKD LPSRVFCILG DGECDEGSIW ESAAFIGHNQ LSNVTVIVDQ
NRMQLDGPCA SILDTGSIAR KFDAFGFESI EVDGHDVLAL YDALKEKASK PRVIIAHTIK
GKGFSFAENN VSFHDACVTD DLYEQALSDL KVAEEACSC