Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Paes_1356 |
Symbol | |
ID | 6460402 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prosthecochloris aestuarii DSM 271 |
Kingdom | Bacteria |
Replicon accession | NC_011059 |
Strand | + |
Start bp | 1476715 |
End bp | 1478643 |
Gene Length | 1929 bp |
Protein Length | 642 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 642725340 |
Product | alpha amylase catalytic region |
Protein accession | YP_002016025 |
Protein GI | 194334165 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0366] Glycosidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.00000107731 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.0276756 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCCCAA AAAAACAACA TCTCGAACAG CTTCTCAAAG CACTGGAACA GCTAACGATC GGGGAATGTC ACGAAAGCTA TCACGTCCCC CGTCTCTGGC AGGAAACAGC AAAAGGCCAG CACACGGTCA ATCCCGCACG TTACTACCAC GACATCATCA GTCGTCTTGT CAATAGCCCT GAGCTTCCTT TTGACAACAG CGGTGCCCCG GCCGATGAGT GGAGCAGGAG TGCCATCGTC TATAACTTAT TCCCTCGCCT GACGACAGCA TTCGATCATG ATAACAACAA TATCATCTCT GCCGATCCCA TGGCCAGCGG GTTTCGTGAA ACAGGCACCT TACTGAAATC CATCGCCCTG CTGCCCTACA TCAAGCAATT GGGTGTCAAT ACGCTCTATC TCCTGCCGGT AACAGAACCC GGCTATTATG GAAAAAAAGG CGCACTCGGT TCACCGTATG CCATAAAAAA CCCATTTATC ATCGATCCAC TGCTTTCTGA ACCAGTCCTC GGGCTTAGAG CAGAGATTGA ACTTCAAGCT TTTGTCGAAG CAGCGCACCA TCTCGGCATG AGAGTCGTTC TGGAGTACGT ATTCAGGACC GCATCTGTCG ATAGCGACTG GATCTCAACC CACCCGGACT GGTTCTACTG GCTGAAGGGC AACAGTGAAG AAAACCAACA GTTCGGCCCC CCCCGTTTCG AGCACGACAC GCTCACAAGA ATCTACGAAC AGGTCGATAA ACATGAATTC GACCATCTGC CGGCCCCGTC AGAAGAGTAC CGCAATCAGT TTTCTGAAGC GCCGCTCACC ACAAGGCTCA ATCCTGACGG AAGCATGACC GGCATAGCCG CTGACGGAAA AGAGTGTCGT ATAGCCAGCG CATTTTCCGA CTGGCCTCCG GATGACAGTC AGCCGCCGTG GACGGATGTG ACCTACCTGA AAATGCACAG GGACAGCTCG TTCAACTATA TGGCTTACAA TACAATACGT ATGTACGACG CGGCTCTTGA CCGACCGGAA GTCATCAATA ATGAGCTCTG GTCATCCCTG GAAAGCATCA TTCCTTTCTA TCAGAGCACG TTCAGCATAG ACGGCGCCAT GATCGACATG GGCCATGCCC TGCCTTCTGC ACTCAAGGCA GCTATTGTAG AGAAAGCCCG GCAACAGAAT TCTGACTTTG CTTTCTGGGA TGAAAATTTC GATCCCTCTT CATCGGTTAA AGAAGAAGGC TTCAACGCGG TCTTCGGCTC GTTGCCCTTT GTCATCCAGG ACCCGATTTA CATACGAGGA CTGCTGAACT TTCTCAATAA AACAGGTTCT GCCATTCCCT TCTTCGCTAC AGGAGAAAAT CACAATACCC CGCGGCTCTG CCACAACCAC CCGCGCCAGG AAACCGGAAG AAACCGTTCA CGTTTTATTT TTGCGCTCGG CATCATGCTG CCGGCGCTGC CGTTCATTCA TTCAGGAATG GAACTGTGTG AATGGCATCC GGTCAACCTC GGCCTGAACT TCAGCGATGA AGACCGCAGA AATTTTCCGT CTGAAACCCT GCCGCTCTTC AGCACGGGAA GTTATGACTG GAGCAGAACA AACGACCTGG AGCCGCTGAA CGCTTTTATC TCGAAATTGC TTGTAATCAG GCGGCGCTAT CATGATATTA TAAGCAGCGG AGAGAAAGGT TCGCTGATCG TACCGTTCGT AACCGGACCG GAAATTCTTG CAGTCATGCG TAAGGGCGAA GGGCACAATC TGCTCTTCAT CGGAAACAGC AACCCGCAAG AGCACTCATC GGGCACCATG GAATTCAAAA ACAGTCATTT CACCCTCACT GATCTGATCG GAGGACAAAC GCTGAGAGTA GCGGATCACA AGCTGCATCT TGAACTGCAG CCAGGTGAAT GCCTGATCGT AGAGCTGACC GCAGAATGA
|
Protein sequence | MSPKKQHLEQ LLKALEQLTI GECHESYHVP RLWQETAKGQ HTVNPARYYH DIISRLVNSP ELPFDNSGAP ADEWSRSAIV YNLFPRLTTA FDHDNNNIIS ADPMASGFRE TGTLLKSIAL LPYIKQLGVN TLYLLPVTEP GYYGKKGALG SPYAIKNPFI IDPLLSEPVL GLRAEIELQA FVEAAHHLGM RVVLEYVFRT ASVDSDWIST HPDWFYWLKG NSEENQQFGP PRFEHDTLTR IYEQVDKHEF DHLPAPSEEY RNQFSEAPLT TRLNPDGSMT GIAADGKECR IASAFSDWPP DDSQPPWTDV TYLKMHRDSS FNYMAYNTIR MYDAALDRPE VINNELWSSL ESIIPFYQST FSIDGAMIDM GHALPSALKA AIVEKARQQN SDFAFWDENF DPSSSVKEEG FNAVFGSLPF VIQDPIYIRG LLNFLNKTGS AIPFFATGEN HNTPRLCHNH PRQETGRNRS RFIFALGIML PALPFIHSGM ELCEWHPVNL GLNFSDEDRR NFPSETLPLF STGSYDWSRT NDLEPLNAFI SKLLVIRRRY HDIISSGEKG SLIVPFVTGP EILAVMRKGE GHNLLFIGNS NPQEHSSGTM EFKNSHFTLT DLIGGQTLRV ADHKLHLELQ PGECLIVELT AE
|
| |