Gene Apar_1336 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApar_1336 
Symbol 
ID8414221 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAtopobium parvulum DSM 20469 
KingdomBacteria 
Replicon accessionNC_013203 
Strand
Start bp1502713 
End bp1504410 
Gene Length1698 bp 
Protein Length565 aa 
Translation table11 
GC content52% 
IMG OID645022933 
Productalpha amylase catalytic region 
Protein accessionYP_003180351 
Protein GI257785134 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0366] Glycosidases 
TIGRFAM ID[TIGR02403] alpha,alpha-phosphotrehalase 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00360472 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTCAGGGC AATCTTCACG AGTAACAAGC AATCCCAAGG GCGCTTATAC GCGCGCTGAG 
TTTCTCGGCA CCAAGGTTGT GTATCAAATC TACGTCCGCT CCTTTAACGA TTCAAACGGA
GACGGCATCG GAGACCTTCC CGGCATCACG CAACGTCTTG ACTACCTGCA AAAACTCGGC
GTTGACTATC TGTGGCTTAC TCCGTTTTTT GTTTCACCCC AACATGACAA TGGCTACGAC
GTTGCAGATT ACCGCAACGT CGAGCCTCTC TTTGGCACCA TGGCTGACTT CAACGAACTT
TCAGCCGAAG CAAAAAAGCA CGGTATCAAG CTGATGCTCG ACATGGTCTT TAACCACACA
TCAACCGAGC ATCCATGGTT TCAGCGGGCA CTTAGCGGCG ATCCTCAATA CCTTGCCTAC
TATACCTTTG TCGATGGCAA CCCCAACACG CCGCCAACAA ACTGGAAGTC CAAGTTTGGC
GGAAGCGCAT GGGAGTGGGT TCCCACACTT CATAAGTGGT ATCTCCATCT CTTTGACGCC
TCACAGGCGG ATCTCAACTG GGACAACCCA AGCGTCCGAG CCGAGCTTGC CGACATAGTG
AGTTTCTGGC ACAACAAAGG CGTTGATGGT TTTCGCTTTG ACGTTGTTAA TCTCATTTCC
AAGCCAGATG TCTTTGAGGA CGACGCCATA GGAGACGGCA GGCGCTTTTA CACCGACGGG
CCACATGTCC ACGAATACCT GCAGGAACTC GTCAAGCGCG GTGGTATCGA CGGGCTGATG
ACCGTCGGTG AAATGAGCTC CACCTCAATT GAAAACTGCA TACGCTATTC CAATCCGGCT
GACCATGAGC TTGCCATGAC CTTCTCGTTT CATCACCTTA AGGTTGATTA CCTCAACGGA
GACAAGTGGT CGCTCAAAGA GCCGGATATC GGCAAGCTTC GTGAGCTGTT GAAATCGTGG
CAAGAACAGA TTACGGCAGG TGGAGGCTGG AATGCGCTGT TTTGGGCCAA TCACGATCAG
CCTCGTCCGA ATTCTCGCTT TGGCGATACC GAGCACTACT GGGAAATGTC CAGCAAACTG
CTTGCCGTCA CCGCACATCT TCTGCGCGGA ACTCCCTATA TTTACCAGGG AGAAGAGCTG
GGTATGACCA ATGCGGGATT TACCAATATC ACACAGTATC GAGACGTGGA GTCCCTCAAT
TACTTTAAGA TCCTTCAGGA TCGGGGCTGC TCCCCCAAAG AAGCGCTCCA TATCATCTCC
GAGCGCTCCC GCGACAATGG GCGCACACCC GTTCAATGGG ATGCCTCAAA AACCGCAGGT
TTCACTTCCG GTACACCGTG GATTGGCATT CCCGACAACC ACACCATCAT CAATGCTGCT
GCAGAAGTTG GCGATCCTGA CTCGATATTC TCCTTCTATC AAAAGCTCAT TGTTCTTCGA
AAAACTCATC CCGTTATCAG CGAGGGAGAT GTTTGCTTTA TCGACTCCGC TGGCGAGAAG
GTCATTGCCT ACGAGCGTAC CTTGGATAGC TGTTGCGTGC GCGTCTTTGC AAACTTCTCT
GACCAAAAGG TTCGCTGTGC GCCGAAAGCT GAAATCGATG GGTCTGACGT CCTTATTGGT
AACTATCCCA ACACCGTAAC CGATGCAGAT GCGCTCATAC TTCGTCCTTT TGAGGCACGA
GCCTTTATCT GGGAATGA
 
Protein sequence
MSGQSSRVTS NPKGAYTRAE FLGTKVVYQI YVRSFNDSNG DGIGDLPGIT QRLDYLQKLG 
VDYLWLTPFF VSPQHDNGYD VADYRNVEPL FGTMADFNEL SAEAKKHGIK LMLDMVFNHT
STEHPWFQRA LSGDPQYLAY YTFVDGNPNT PPTNWKSKFG GSAWEWVPTL HKWYLHLFDA
SQADLNWDNP SVRAELADIV SFWHNKGVDG FRFDVVNLIS KPDVFEDDAI GDGRRFYTDG
PHVHEYLQEL VKRGGIDGLM TVGEMSSTSI ENCIRYSNPA DHELAMTFSF HHLKVDYLNG
DKWSLKEPDI GKLRELLKSW QEQITAGGGW NALFWANHDQ PRPNSRFGDT EHYWEMSSKL
LAVTAHLLRG TPYIYQGEEL GMTNAGFTNI TQYRDVESLN YFKILQDRGC SPKEALHIIS
ERSRDNGRTP VQWDASKTAG FTSGTPWIGI PDNHTIINAA AEVGDPDSIF SFYQKLIVLR
KTHPVISEGD VCFIDSAGEK VIAYERTLDS CCVRVFANFS DQKVRCAPKA EIDGSDVLIG
NYPNTVTDAD ALILRPFEAR AFIWE