Gene Apar_1112 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApar_1112 
Symbol 
ID8413985 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAtopobium parvulum DSM 20469 
KingdomBacteria 
Replicon accessionNC_013203 
Strand
Start bp1257901 
End bp1260945 
Gene Length3045 bp 
Protein Length1014 aa 
Translation table11 
GC content43% 
IMG OID645022701 
Productglycosyl transferase family 8 
Protein accessionYP_003180131 
Protein GI257784914 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1442] Lipopolysaccharide biosynthesis proteins, LPS:glycosyltransferases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCAAGG TTTCTTTTGT AATTCCTGCA TACAACATTG AATCGTACAT TGGGCGTTGT 
ATTCAAAGTG TAAAGAATCA GACGTTTGGT GATTTTGAAG CAATTATTGT TGACGACGCC
TCAACAGATT CCACTCCAGA GAAAATTGTT ACTGCAGTAG GGGATGACAA AAGGTTCAAA
GTTGTCACTC ATGCAACTAA CCAGGGACTT CATCTTGCAC GTAAGACTGC TGCAGCGTAT
ACAAAAGGAG AGTGGGTCTT TTGTTTAGAC GGTGATGATG AGGTTACTCC TGATTTTCTT
GAGCAGGTGG TTGGTCGTAT TGAACAGAAT CCTGTTGATA TTCTCCACTT GGGTATTACC
GTCATTCCAG AAAACGGTGT AAGTGAGGCT GAGGCAGAAG GTTTTGGTAG CTTCATCAAC
CAGCAGTCTC ACTTTACTCA AGGTGATGAA GTTCTCCGCA CCATTTTTGA TGAGAGCTAT
GGACAGAAGA TTGATTGGCG TACTACGCAG CGTCTGTATC GCGGAGAACT CTTTCGTTCT
GCCTTTGCAG AGATGACTTC GGAGCGCTTG GTAAGAGCAG AAGATGCCTA TGAGGTATTT
GTGCTTTCTG ATAAGGCTCA GACTGCTGAT GGATTTGAGT CTTGTAGAGG TCTTCTTTAC
CACTTTGGTA TTGGTGTAAC AGGAGTTTCT CGCATTTCTT TAGATAAGTT TGGAGAGTTC
TGCTATCAGT TCTTAGACAA TATTGAACAG ACAGAGTTCT ATATTGGCAA AACAGACAAT
GTTGTTTACC TTAGGTCTTT TGAGGGCATG AAGCACAAGT TGATGGAGCT TTTGATTGGT
GACTGGAAGT CTCGTCTTGC TCCTGAAGAT CAAGAAGCAG CTCTTGAGCC GTTTGCTATT
TTATTTGGAC CATCAGTGGC TGCTCGAGAG CTTTATCGCT TTGTAAGAGA TGACGCTTAC
GAGGCCATTA AAACTTGTAC TGAGCTTCCA GAAGACAGCA ATGCATATCT CTATAGATCG
TATGCAAAGA AATATGCTGC TCTCATGCAT CAGGATGAGG GACTGTCGTT TGATAGGGCA
GTTCGCATGA AACAGATTGC CGATGAACAT ATGGAATATC TAGAAAGAAA GCACATGGTA
AAGATGTTTG AGCAGCAGCC TATTCGCATT TTTATTACCT CTCATAAAGA TGTTGATGTT
CCAGAAAGTA ATTATCTTCA GCCTATTCAA GTAGGACCAG GTCAGAAGAC AAATCGCTTT
TCGTATATGC TTCATGATGA TGAGGGCGAT AACATTACCG AAAAAAACCC AATGTACTGT
GAGATGACTA CACAATACTG GGCATGGAAG AATATTACTA ATGAGCGCTA TGTTGGCTTT
GGTCATTATC GTCGTTACTT CAACTTTACC GATACGATTT ATCCAGAAAA TCCTTTTGGT
GAGGTTATGG ATGATTTTAT CGATGAGGAT GCCATCAAGA AATATGGTCT TGATGATCAG
ACCATTGCTC AGTGTATTGA AGGATATGAT CTCATTACCA CTGGGGTAAA AGATATTCGT
AAATTCCCTG GAAGCGCCAA TACACCACTC GAGCAGTACC ATGCTGCTCC ATTGCTGCAT
CCAAAGGATA TGGATACTAT GGCGGCGCTT ATTGTTGAGC GTCATCCAGA GTATGCAGAG
GATGTAAACG CTTTTCTTAA TGGTTATGAA CAGTGTTTCT GTAACATGTA TATCATGCGC
AGGGAGCTTT TTGATCGCTA TGCAGCATGG GTATTCCCAC TTGTTGACGA GTGGACTGCT
CGTACTGATA TGTCAACCTA CAGTAAAGAG GCTCTAAGAA CTCCAGGTCA CCTAACTGAG
CGTCTCTTTA ATATCTGGCG TATGCACATG CTGCGCACAG AGGGTAAAAA CTGGAAGGTA
AAAGAGCTAC AGTGTGTTCA CTTTACTAAT CCAGAGCCTC GTCAGAAGTT TATTCCTCTC
TTTGAAGAGA AGCCTGAGAT TGCAAGTCAG AACGTTGTTC CTGTTGTTTT TGCAGCAGAT
AACAACTACG TTCCAATTCT TACTTGTGCA ATGGGTTCAA TGCTTGAGAA TGCAGATCCT
AACCGGTATT ACGACGTAGT TGTCCTTAAT ACCAATATTG GCGGATCAAA GCAGGAATTG
GTTAAGAAGT TCTTCTCACG CTATAAGAAT GCTCGCATCA CGTTCTATAA CGTGTGGCGT
ATGGTTAAAG ACTATAAATT AGATACCAAT AACGCGCATA TTAGCGTTGA GACATACTTC
CGTTTCTTGG CCCAAGATAT CCTTTCTGCT TACGATAAGG TTGTCTATCT TGACTCTGAC
CTTGTGGTTA ATGGCAATGT TGCTGAACTT TACGATGTAA GAATAGGCAA CAATCTTATT
GCTGCAACGC TTGATATTGA CTATCTAGCA AACCTCAATA TTCGCGGTGG AGACCGCATG
AAGTACAGCC TTGACGTGCT TAACCTCAAA AATCCTTATG CTTATTTCCA GGCGGGAGTT
ATGGTTTTTA ATACCGCTGA ACTGCGCCGT TACCACACTG TTCCAGAGTG GTTGCGTATT
GCATCTAATC CAATCTTTAT TTATAACGAT CAAGATATTC TGAATAGCGA GTGTCAAGGT
CGAGTGCTAT ATCTTCCTGC CGATTGGAAC GTTACGCATA ATATTTTTGG TCGTGCAGAG
GAACTCTATC CAATGGCACC AAACAGTGTT TTTGATGATT ATCAAGCAGC ACGTCGAGCA
CCAAAGATTG TTCACTTTGC TGGCGCCATT AAACCTTGGC AGAATGCCAG CTGTGATATG
GCTTCCTACT TCTGGAAGTA TGCACGCAAT ACCCCGTTCT ATGAGGTCAT TATTCAGGAT
ATGGTTCCAA GCGCCAGAAA TGACGCGGAC GTTACAGAGT TCCATGAGCG TGCACTTTCT
GATGCAAGTC CTCTGCGTAA GATTATTGAC CCTATTGCAC CGTATGGCAG CGCAAGGCGA
GAAGCTCTTA AGGCCCTTGG TAGAACCTTA AGAGGTCGCA AATAA
 
Protein sequence
MPKVSFVIPA YNIESYIGRC IQSVKNQTFG DFEAIIVDDA STDSTPEKIV TAVGDDKRFK 
VVTHATNQGL HLARKTAAAY TKGEWVFCLD GDDEVTPDFL EQVVGRIEQN PVDILHLGIT
VIPENGVSEA EAEGFGSFIN QQSHFTQGDE VLRTIFDESY GQKIDWRTTQ RLYRGELFRS
AFAEMTSERL VRAEDAYEVF VLSDKAQTAD GFESCRGLLY HFGIGVTGVS RISLDKFGEF
CYQFLDNIEQ TEFYIGKTDN VVYLRSFEGM KHKLMELLIG DWKSRLAPED QEAALEPFAI
LFGPSVAARE LYRFVRDDAY EAIKTCTELP EDSNAYLYRS YAKKYAALMH QDEGLSFDRA
VRMKQIADEH MEYLERKHMV KMFEQQPIRI FITSHKDVDV PESNYLQPIQ VGPGQKTNRF
SYMLHDDEGD NITEKNPMYC EMTTQYWAWK NITNERYVGF GHYRRYFNFT DTIYPENPFG
EVMDDFIDED AIKKYGLDDQ TIAQCIEGYD LITTGVKDIR KFPGSANTPL EQYHAAPLLH
PKDMDTMAAL IVERHPEYAE DVNAFLNGYE QCFCNMYIMR RELFDRYAAW VFPLVDEWTA
RTDMSTYSKE ALRTPGHLTE RLFNIWRMHM LRTEGKNWKV KELQCVHFTN PEPRQKFIPL
FEEKPEIASQ NVVPVVFAAD NNYVPILTCA MGSMLENADP NRYYDVVVLN TNIGGSKQEL
VKKFFSRYKN ARITFYNVWR MVKDYKLDTN NAHISVETYF RFLAQDILSA YDKVVYLDSD
LVVNGNVAEL YDVRIGNNLI AATLDIDYLA NLNIRGGDRM KYSLDVLNLK NPYAYFQAGV
MVFNTAELRR YHTVPEWLRI ASNPIFIYND QDILNSECQG RVLYLPADWN VTHNIFGRAE
ELYPMAPNSV FDDYQAARRA PKIVHFAGAI KPWQNASCDM ASYFWKYARN TPFYEVIIQD
MVPSARNDAD VTEFHERALS DASPLRKIID PIAPYGSARR EALKALGRTL RGRK