Gene Apar_1046 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApar_1046 
Symbol 
ID8413919 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAtopobium parvulum DSM 20469 
KingdomBacteria 
Replicon accessionNC_013203 
Strand
Start bp1182660 
End bp1186097 
Gene Length3438 bp 
Protein Length1145 aa 
Translation table11 
GC content45% 
IMG OID645022635 
Product4-alpha-glucanotransferase 
Protein accessionYP_003180065 
Protein GI257784848 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1640] 4-alpha-glucanotransferase 
TIGRFAM ID[TIGR00217] 4-alpha-glucanotransferase 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGGCTT TTCATAACAC CTCAAACATT GTTTGTAGAG ATCCAAAAGG CGCAGCAGAA 
CTAGGCACTT CAGTAACAAT CAGAATATTT GCTTGGGACG ATGATGTTCA ATCAGTAACG
CTTCGTCTAT GGCAAGAATA TGGTCCTGAA TCATCTTTAG AGGCCCCAAC TCTATCTGGC
GAGAAATGCG TTGTGATGCA AAAGAGTGTT TTTGCTGAAA CGCTTCCAGC TGGTGTACCC
GATTATGCAC AGTGCTTTGA GGCTATTATC AAACCTGCTG CTACAGGCCT TATTTGGTAT
CGTTTTGAGC TGCAAGCTTC TGACGGAGCA GTCTGGTTGT ATGGTGCTCA AGAAAATAGG
TGCACAGGTG TTGGTGGTTT TGCCTATGGT GAGCCACCAT CATTTCAGAT TACCTGCTAC
GAGCCTCGTA CCAGCGTTTT TGGTGTTGAA GATCCAAGCT GGTACAAAGG GGGAGTTGTC
TACCAGATTT TTCCTGACAG GTTTGCACGA GATGCAAATT GGCGCGAACG TACCATGCAA
GCACTTGCTG TTCCAAGAAA TGGAGTTTCT CGCCAGTTGG TTGAAGACTG GGAGAAAGTC
CCGGAGTATC AAAGAGACGC TAATGGTCGC GTGACCGAGT GGGATTTCTA CGGCGGATCT
TTGGTTGGCA TTGAAGAGAA ACTGCCCTAT TTGGAAGACC TAGGAATAAC TGCGCTTTAT
TTGAATCCTA TTTACGCTGC TTCTAGTAAT CATCGCTACG ATATTGCGGA TTATTTGGAA
GTTGACCCTG TTTTGGGCAC CGTAGAAGAC TTTGAGCACC TCTGTGTTAA AGCGGCTGAG
CATGGTATTT CTATTATCTT GGATGGCGTA TTTAACCACT GTGGTGCTGA CTCAAAATAT
TTTAATAAAT TTTCTAATTA CCCCGAACCA GGAGCAGTTC AACAGGCAGG TTCGGTCTAT
GATGAGTGGT TTACGCTTCA TGAGGATGGA ACGTATGAAA GTTGGTGGGG TGTTGACGCG
CTTCCAACTA TTGTGTCTGA TAACCCTACT TACCAGGAGT TTATTTGTAG TGAGAATGGT
GTTATTAGAA CATGGCTACG TCGTGGCGCT CGCGGTTGGC GCTTAGACGT TGCAGATGAG
ATTTCTGATT CATTTATTCA GAAGATTCGT GCAGCTGCAC TCGCAGAGCG CAGCGATGCG
GTTATTATTG GAGAAGTCTG GGAGGATGCA AGTAATAAGC GTGCCTATGG AAAACTTAGG
CATTATCTTG AGGGTTTTGA GCTTGACGGC CCTATGAATT ATCCTCTGAG AAAAGCTATT
CTTTCATTTT TGATGAATGA GGCTGGTGCT GAGGCAACTG TTTCAGCTCT TGAGGAGCTT
TGGGAAAACT ATCCTCACGA GGCGTTTTAT TCTTGTCTAA ATGTGTTTGG AACGCATGAT
AAGGAACGTT TAATTAATGT AGTTGCGGAT GCTCCCACGC CTGATTCTCT TTCTGCTCAT
GAGCAGGTGA CGTATAGACT TTCTGCTGAT CAGCGAGGTC TTTCTAAGGC TCGTATGTGG
CAGGCGGCGG TGCTCCAGAT GACACTTCCT GGTGTTCCTT CTATTTATTA CGGCGATGAG
ATGGGTGTTG AGGGATTTGT TGATCCAACA AATCGCGCGA CAATGCCATG GCCTGGAACC
AGTCCTCGTG CAGATTTAGA TTACTTTCAC ATTTATCGCA ATGCTATTGC GCTTAGAAAA
ACGCTACCAC TACTGGTAAA CGGCTCGTTT GAACCGTTTG TTCCAGAAGA TGGATCTAAT
GATGTTTTGG CTTTCTGGCG TAGGCCTCTT CAGAAAGATG ATTGTTTACA GCAGCAGGGT
GAGTTTACGC CTGGCATTTG TGTTCTGGTA AATAAGAGCC GCTCAGATGC TAAAACAGTA
TATGTTTCTG TACCTCAAAA TATGCAGGTT ATTGACGTTA TTAGTGGCCA AGCGGTACCT
GTCAAAGATG GTAAAGCAGA GGTCTTTTTA TGGCAGCTTG GATGTACCGT TTTAAACGTA
CAGCCGACCC ACAGGTTACA AAAGTCTCTT GAGCCTGGTA TGGGCGTGCT CGCTCATATT
ACTTCTCTGC CCGCCAATGA TGGAGATGTG ACTAAACCGG GACAGAGACT TGGTGTTATT
GGTCGTGAAA CCTTCGAGTT CATTGATTTT CTCGCCAAAT CTGGTCAAAA GTACTGGCAG
ATTTTGCCTG TCAGTCCAAC CGATGAATAT GGCTCTCCTT ATGCGGGAAT TTCCGCTTTT
GCAGGAAACA TTAATTTGCT CGATCCAGCT GCAATGGAGA AAGTCATTCT TGAGTGCGAC
GACTCTCAGG GTGAGCGCGC TGCTGAGTAC CAGGCATTTC TGGCGAGAAA CTCGTATTGG
CTTGATTCGT ATGCGGCTTT TCGCGCTATT AAAGATCTTC TTGGTGAGGG GATGTGGCAG
GAGTGGCCTG AACAGTATCG TAGCTGGTCA CAAGAGTTGT TTACACGTCC TGAGCTTGTT
CATGCAATTG AGCTTCATCG GAAGTACCAA TTTGCTTTTG ATGTCATTTG GAGTCAGACT
TTAGCGGAAG CGCATGCTAA GGGTATTCAG ATTATTGGTG ACATGCCCAT GTTTGTTTCG
GAAGATTCCG CCGATGTTTG GGCACATCCA GAGCTTTTTG CTTTAGATGC TACAGGCCAT
ACAGAGCTTC AAGCTGGTGC TCCAGCTGAT GCTTTTTCGC AAGATGGTCA GCTGTGGGGT
AATCCAACAT ATAACTGGCA AGCTCATAAA GACGAGGAAT ATTGTTGGTG GATTGAGCGA
TTCCGCAGGT CTTTCTATCT ATACGACTAT ACAAGGCTCG ATCACTTTAT TGGCTTTACG
TCTTACTATG CTATTGAGCA AGGAAAAACT GCAACTAGAG GTTCATTTAA GTTTGGACCT
GGCTATGAGT TGTTTGATGT TGCTTATAAA CAGCTTGGTC CACTTCCTTT TATTGCCGAG
GATTTGGGAG CAGTTACTCC TGCTGTTAGA GCACTGCTCT CTCAGACGGG ATTTCCTGGC
ATGAGCGTCA TTCAGTTTGC TGACGGAGAC TGCAGATATT CCTTTGCACC AGCTCAGGAG
TCTATTGTGT ACAGTGGTAC ACATGATACC CAAACGCTTA TGGGTTTTGT AGAAGATCGC
TTTACAGGCG GCCAGGCAAC TGATGAATCA CAACAGATTT TTAATCATCT TATGGAGCAG
ATAGTTGATA CCTCTAATGC AGTAGTTATC CTGCCGCTTC AGGATGTTTT GTGCCTTTCC
GACAATGCTC GTATGAACAT TCCTGGTAAG ACTGAAGGCA ATTGGTCTTG GCAGGTTAAA
AAGGATATGA TTACGCCAGA GGTTATTCAA AGGCTACAGA GGTATGTAGA ACTGCATCAG
AACAGACGCA ACGCTTAA
 
Protein sequence
MQAFHNTSNI VCRDPKGAAE LGTSVTIRIF AWDDDVQSVT LRLWQEYGPE SSLEAPTLSG 
EKCVVMQKSV FAETLPAGVP DYAQCFEAII KPAATGLIWY RFELQASDGA VWLYGAQENR
CTGVGGFAYG EPPSFQITCY EPRTSVFGVE DPSWYKGGVV YQIFPDRFAR DANWRERTMQ
ALAVPRNGVS RQLVEDWEKV PEYQRDANGR VTEWDFYGGS LVGIEEKLPY LEDLGITALY
LNPIYAASSN HRYDIADYLE VDPVLGTVED FEHLCVKAAE HGISIILDGV FNHCGADSKY
FNKFSNYPEP GAVQQAGSVY DEWFTLHEDG TYESWWGVDA LPTIVSDNPT YQEFICSENG
VIRTWLRRGA RGWRLDVADE ISDSFIQKIR AAALAERSDA VIIGEVWEDA SNKRAYGKLR
HYLEGFELDG PMNYPLRKAI LSFLMNEAGA EATVSALEEL WENYPHEAFY SCLNVFGTHD
KERLINVVAD APTPDSLSAH EQVTYRLSAD QRGLSKARMW QAAVLQMTLP GVPSIYYGDE
MGVEGFVDPT NRATMPWPGT SPRADLDYFH IYRNAIALRK TLPLLVNGSF EPFVPEDGSN
DVLAFWRRPL QKDDCLQQQG EFTPGICVLV NKSRSDAKTV YVSVPQNMQV IDVISGQAVP
VKDGKAEVFL WQLGCTVLNV QPTHRLQKSL EPGMGVLAHI TSLPANDGDV TKPGQRLGVI
GRETFEFIDF LAKSGQKYWQ ILPVSPTDEY GSPYAGISAF AGNINLLDPA AMEKVILECD
DSQGERAAEY QAFLARNSYW LDSYAAFRAI KDLLGEGMWQ EWPEQYRSWS QELFTRPELV
HAIELHRKYQ FAFDVIWSQT LAEAHAKGIQ IIGDMPMFVS EDSADVWAHP ELFALDATGH
TELQAGAPAD AFSQDGQLWG NPTYNWQAHK DEEYCWWIER FRRSFYLYDY TRLDHFIGFT
SYYAIEQGKT ATRGSFKFGP GYELFDVAYK QLGPLPFIAE DLGAVTPAVR ALLSQTGFPG
MSVIQFADGD CRYSFAPAQE SIVYSGTHDT QTLMGFVEDR FTGGQATDES QQIFNHLMEQ
IVDTSNAVVI LPLQDVLCLS DNARMNIPGK TEGNWSWQVK KDMITPEVIQ RLQRYVELHQ
NRRNA