Gene Apar_0835 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApar_0835 
Symbol 
ID8413701 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAtopobium parvulum DSM 20469 
KingdomBacteria 
Replicon accessionNC_013203 
Strand
Start bp924919 
End bp926226 
Gene Length1308 bp 
Protein Length435 aa 
Translation table11 
GC content39% 
IMG OID645022418 
Productglycosyl transferase family 2 
Protein accessionYP_003179855 
Protein GI257784638 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1215] Glycosyltransferases, probably involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.445477 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00049802 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCCATTGC AATTTAACAG ACTCGGTATC ACACCAATCG TTGTTTTTAA TTTTATCATC 
TGGCTTTTCT TTACGCTCGC ATACTTCTAT CAGATTGTGT ATATCCTTCG CGTTATGTTT
AAAGGCGAGG TAAAGCTTCC CGAGGCAAAA AAGCAGCATC GCTATGCTTT CTTTATTGCT
GCCCACAATG AAGAGCCTGT TATTGGCAAT CTTGTCAGAT CTATTCTTTC TCAAGATTAT
CCTCGGGAGC TGATGGATGT CTTTGTTGTT GCAGATGCCT GTACTGATAA AACAGCAGAA
GAGGCAAGAA AAGCTGGAGC AATTACTTGG GAGCGTAATG ACCTTGCTCG TAAGGGTAAG
AGCTGGGTTA TGGATTACGG CTTTGATCGC ATTCTTAATG AGTACGGTGA CAAATACGAG
GCTTTCATTG TTATGGACGC CGATAACCTG GTTTCTCCAA GCTATCTTAA AATTATGAAT
CAAGCTTTTG ATGCAGGGTA TCTTGTGTGC ACCAGCTATA GAAACTCAAA GAATTTTGAT
TCTAGTTGGG TTAGTTCTGC CTATGCTACA TGGTTTATGC GTGAAGCAAA GTTTTTGAAT
AATGCTCGTA TGATGATGGG TACAAGCTGT GCGGTTTCTG GTTCGGGTTG GATGGTTTCT
TCTCGCATTA TTAAAGGCAT GCATGGATGG GATTTTCATA CATTGACTGA AGATATTCAG
TTTTCTACGT TTTGCTGTGC TCACAACATT CAAATTGGTT ATGCTCCAGC AGAATTTTTT
GATGAACAGC CTTTGACATT TAAAGCTTCA TGGACTCAAA GAATGCGCTG GACAAAAGGA
TTTTATCAGG TATTTTTCTC GTATGGCTTT GACCTACTTA AAGGTATTTT CAAGGGTCAG
TTTGCTTCAT ACGATATGCT TATGACAATT GCACCAGGTA TGATTTTGTC GTTGCTTTCT
GCATTTATTA ATGGAACTTA TCTTTTGGTT GGTTACTTGA GCCACGGCTT TGTTGCAACT
GATGCCGAGA TTGCTATGAG TGTGGGTTCT TTGGTTATGA CGGTTTTCTC GATGTATGTT
GTCTTCTTTA TTCTGGCGCT CATCACTACT ATTTCAGAGT ACAAGCATTT CCATGTAAAG
AAAAAGTGGC GTATTTTTAC CAATCTCTTT ACGTTTCCTA TTTTTATGAT GACGTATATT
CCTATTACCG TTGCAGCTTT GTTCAAAAAA GTTGAGTGGG TTCCTACTAA ACATGACATT
GCTGTTAACT TTGAGGATGT TATTGCTTCA AGTGGGAGTT CAAATTAA
 
Protein sequence
MPLQFNRLGI TPIVVFNFII WLFFTLAYFY QIVYILRVMF KGEVKLPEAK KQHRYAFFIA 
AHNEEPVIGN LVRSILSQDY PRELMDVFVV ADACTDKTAE EARKAGAITW ERNDLARKGK
SWVMDYGFDR ILNEYGDKYE AFIVMDADNL VSPSYLKIMN QAFDAGYLVC TSYRNSKNFD
SSWVSSAYAT WFMREAKFLN NARMMMGTSC AVSGSGWMVS SRIIKGMHGW DFHTLTEDIQ
FSTFCCAHNI QIGYAPAEFF DEQPLTFKAS WTQRMRWTKG FYQVFFSYGF DLLKGIFKGQ
FASYDMLMTI APGMILSLLS AFINGTYLLV GYLSHGFVAT DAEIAMSVGS LVMTVFSMYV
VFFILALITT ISEYKHFHVK KKWRIFTNLF TFPIFMMTYI PITVAALFKK VEWVPTKHDI
AVNFEDVIAS SGSSN