Gene Apar_1126 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApar_1126 
Symbol 
ID8413999 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAtopobium parvulum DSM 20469 
KingdomBacteria 
Replicon accessionNC_013203 
Strand
Start bp1276292 
End bp1277464 
Gene Length1173 bp 
Protein Length390 aa 
Translation table11 
GC content46% 
IMG OID645022715 
Productglycosyl transferase family 2 
Protein accessionYP_003180145 
Protein GI257784928 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0463] Glycosyltransferases involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.463481 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTCACA CTCAAAATCA CCAACCCTTG GTTTCTCTTG TTGTGCCTAT ATATAACGTT 
GCAGACTATC TGGAGCAGTG CCTGGCAAGC ATTCAATCAC AAAGCTACAC AAACCTAGAG
ATTATCTGCT TAAACGATGG CTCAACCGAC ACATCTTTAG CTCTTTTGGA AGCATACGCA
GCCCATGATG GACGCATTGT CATCATAGAC AAAGAAAACG AAGGTTACGG AGCAACATGT
AATCGTGGTA TTGCCGCAGC TCACGGCATG TGGGTAGGCA TTGTTGAACC TGACGATTAC
CTTGAGCCAA CTATGGTTCA AGAGCTTATT GATCTTATCC AAAAAAACGG CGGAGAAGAC
CAGGTAGATA TTGCACGTTC TGCGTATTGG CGCGTGTTTG GCAATCAGAA AAATGGTCGA
GCAGGAGCAA AGACACAGAT AAAGAATACT GCTGGTTCTG CCGAATACAG GATTGCCTGC
GCTTATAAAG GCCGCGTCAA ACCTAAGTAT CAACCTTGTT CTATTGATCA GATGTCACAG
CTTCTATTAC ACCATCCTGC CATTTGGACA GCACTCTATC GCAAGAAATT CTTGACCCAG
AACAACATCA ACTTCAGAGA AGTTCCTGGT GCAGGCTGGG TGGACAACCC CTTCCTCATT
GCGTCGCATT GCTGTGGTGC TCGTCTGGTG TATACAGACT CAGCACTTTA CAACTATCGC
GAGAATGGCT ATGCAGAAGC TGTCGCTTTT GCGCAGCGTC AGCCCAAAAT TCCGCTTGAA
CGTTGGAATG ACATGATGGA CGTTGTGGAC ACACGCAACA TCACCTCAAA CGTAGTGCTC
AATGCACTCA CGCTGCGTGG CATAAATTAC GCACTGCTTA CAAAAGACGC ACTTATCTGG
CGAGAAAAAC ATGATGCTGC AGGTGAGATT GACTCAGAGG CACACGATCT TCTCGCCAAG
AGCTTTGAAC GTATGGACGC AAAACGTGTT ATTGAAAACC CAGCAATCCC GGGTTCTGGG
AAAGCTTTCT TTGCGCAGAT ACGTGGTATT GCCCTACCAA AAGAAGACAA ATTTGCTCGT
TATGCTTATC TAGCCAAAGA AGGGTTCTTC CGACTCAAAA ATGATGGCAT AGTACAGACG
CTTAAATCGC TTACAGACAG ACGCGAAAGT TAA
 
Protein sequence
MSHTQNHQPL VSLVVPIYNV ADYLEQCLAS IQSQSYTNLE IICLNDGSTD TSLALLEAYA 
AHDGRIVIID KENEGYGATC NRGIAAAHGM WVGIVEPDDY LEPTMVQELI DLIQKNGGED
QVDIARSAYW RVFGNQKNGR AGAKTQIKNT AGSAEYRIAC AYKGRVKPKY QPCSIDQMSQ
LLLHHPAIWT ALYRKKFLTQ NNINFREVPG AGWVDNPFLI ASHCCGARLV YTDSALYNYR
ENGYAEAVAF AQRQPKIPLE RWNDMMDVVD TRNITSNVVL NALTLRGINY ALLTKDALIW
REKHDAAGEI DSEAHDLLAK SFERMDAKRV IENPAIPGSG KAFFAQIRGI ALPKEDKFAR
YAYLAKEGFF RLKNDGIVQT LKSLTDRRES