Gene Apar_1051 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApar_1051 
Symbol 
ID8413924 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAtopobium parvulum DSM 20469 
KingdomBacteria 
Replicon accessionNC_013203 
Strand
Start bp1192345 
End bp1194357 
Gene Length2013 bp 
Protein Length670 aa 
Translation table11 
GC content44% 
IMG OID645022640 
Product1,4-alpha-glucan branching enzyme 
Protein accessionYP_003180070 
Protein GI257784853 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0296] 1,4-alpha-glucan branching enzyme 
TIGRFAM ID[TIGR01515] alpha-1,4-glucan:alpha-1,4-glucan 6-glycosyltransferase 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAGAGT TAAATGCATC GTTAGCAGTG AGTGACCAGG ATGCATACTT GTTTGCGCAA 
GGAACATGGT TTCAGAGCCA TAAAAAACTA GGTGCGCACC CTGCGGTCCA AGAAGATGGA
ATTGAGGGCT ATCATTTTGC TGTTTGGGCA CCCAATGCAG TTTCTGTTTC TGTTGTTGGA
GACTTTAATA ATTGGGATGA TACCGTCAAC GTTCTATCTC GCTCAAAACA CGGTGGAATC
TGGGAAGGTT TTGTTCCAGG AATAACCTCA GAAGTATTGT ATAAATTTCT GATTGTATCC
ACATCTGGCG AGAAGATTTT CAAGGCTGAC CCGTATGCTA CCTATGCAGA GGTTAGACCA
CATACGGCAT CTATTACGTT TGACCCGGAT GTGTACGTTT GGGAAGACGA TGTATGGATG
AAGAAACGTG CAACGCTTAA GTTTCTACAT AGTCCTCTCA ACATTTTTGA AGTTCACCTT
GGTAGCTGGA AGCAGCATAC TGACTCGTCT GCTCAAGAAG ATTCTGCTAA TGCAGAGACT
ACTGAAAAGA TTGAAGAACC TAAAGATTAT TTTGATACAA ACGTAGACGC GTTCTACACC
TATGATGACC TGTCAAAAGA GTTGGTAGCT TACGTTAAAG AGATGGGCTA CACCCATATT
GAGCTTCTCC CCGTTATGGA GCACCCCTTT GATGGATCCT GGGGCTATCA GGTAACTGGC
TATTTTGCTC CCACCTCAAG GTATGGTAAT CCTGCTCAGT TCAAGCACTT TATTGACTCC
TGCCATCAGG CTGGTATCGG CGTCATTTTA GACTGGGTTC CAGGAGGTTT CTGCAAAGAT
GCTCATGGTC TTGCAGAGTT TGATGGCACC AAACTTTTTG AAGAAAAAGA GCATCCTAAC
TGGGGAACTC TTAAGTTTGA CCTTACTCGT GGTGAGGTTA GAAGCTTCCT TGTTTCTAAC
CTGCTTATGT GGTTGAAGGA TTATCACGCT GATGGCATTC GTGTTGACGG TGTCAGTAGC
ATGTTGTATT TGAATTTTGG CATCGATGAC CCTTCTCAGA AGCGCTTTAA TTGCAAGGGT
ACTGAGGAAG ATTTGGATGC AAGCGCTTTC TTGCGTCTCT GCAATGAGAC CGCTCAAAAG
CAGTATCCTG ATATTTTGAT GATTGCAGAA GAGTCCACGG CATGGCCGCT GGTTACCTAT
CCTCCAGATG TTGGGGGACT TGGTTTCAAC CTTAAGTGGG ACATGGGTTG GATGAACGAT
ACGTTGCACT ATTGTCAGAC TGATTTTCCT TATAGACCAG GCAATCATCG ACTTTTAACC
TTCTCCAGCA TGTATCAGTT CAATGAGAAC TTTGTGCTGC CACTGAGCCA TGATGAAGTT
GTTCATGGTA AGTGCAGCCT TATTCAGCGT ATGCCAGGAG ATTGGTGGAG ACAGTTTGCT
GGTATGAGAG CTCTAGCGCT CCATCAGATG ACCCATCCAG GTGCTAAGCT CAACTTTATG
GGCAACGAGA TTGCACAGTT TATTGAGTGG CGCTATTACG AGGGCATTGA GTATTACTTG
ACTGAGGAGT ATCCAACTCA TGCTCACCAG CAGGCATATA TTAAGGCTCT TAATCATTTC
TATAAGAATC ATCCAGGACT TTGGCAGTAC GCCTATGACA ACCGTGGATT TGATTGGATT
GATGCAGATA ATAATGAGCA ATCAATCATT TCGTTTGTTC GCCATGGAAG AAAGCCTTCC
GAGGACCTGG TTATTCTCAT CAATTTTGAC GTTGCAACCC ATCAGAACTT CCGTTTAGGT
ATGCCTTCTG AAGGTGTGTG GAAAGAAGTA TTTAATTCCG ACGCAAAAGA GTTTGGCGGT
TCGGGTGTTG TGAATAGTAA GAAACTTTCC ACTAAACCAG TGGCATGGAA TGGAAGAGAT
TATTCGGTAG AACTGTCTGT TCCTCCGCTT GGTGGCATTG TTTTAGCGTT TGAGAAAGAA
CTTCCAAAGA GGGGGAAGCA TGGCCGTAAA TAA
 
Protein sequence
MAELNASLAV SDQDAYLFAQ GTWFQSHKKL GAHPAVQEDG IEGYHFAVWA PNAVSVSVVG 
DFNNWDDTVN VLSRSKHGGI WEGFVPGITS EVLYKFLIVS TSGEKIFKAD PYATYAEVRP
HTASITFDPD VYVWEDDVWM KKRATLKFLH SPLNIFEVHL GSWKQHTDSS AQEDSANAET
TEKIEEPKDY FDTNVDAFYT YDDLSKELVA YVKEMGYTHI ELLPVMEHPF DGSWGYQVTG
YFAPTSRYGN PAQFKHFIDS CHQAGIGVIL DWVPGGFCKD AHGLAEFDGT KLFEEKEHPN
WGTLKFDLTR GEVRSFLVSN LLMWLKDYHA DGIRVDGVSS MLYLNFGIDD PSQKRFNCKG
TEEDLDASAF LRLCNETAQK QYPDILMIAE ESTAWPLVTY PPDVGGLGFN LKWDMGWMND
TLHYCQTDFP YRPGNHRLLT FSSMYQFNEN FVLPLSHDEV VHGKCSLIQR MPGDWWRQFA
GMRALALHQM THPGAKLNFM GNEIAQFIEW RYYEGIEYYL TEEYPTHAHQ QAYIKALNHF
YKNHPGLWQY AYDNRGFDWI DADNNEQSII SFVRHGRKPS EDLVILINFD VATHQNFRLG
MPSEGVWKEV FNSDAKEFGG SGVVNSKKLS TKPVAWNGRD YSVELSVPPL GGIVLAFEKE
LPKRGKHGRK