Gene Plim_2048 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPlim_2048 
Symbol 
ID9138751 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePlanctomyces limnophilus DSM 3776 
KingdomBacteria 
Replicon accessionNC_014148 
Strand
Start bp2661143 
End bp2662294 
Gene Length1152 bp 
Protein Length383 aa 
Translation table11 
GC content54% 
IMG OID 
ProductPrephenate dehydratase 
Protein accessionYP_003630075 
Protein GI296122297 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCACAA AGCCTCAGCC GACCAAGAAA ACATCGAAAG CCTCCACCAG CGGATCGGCA 
AAAACTGCTG GCAATCCCGT CAAGGCGACA GCGAGTCTTG AAACGGAAAT CGAGAAGCTC
GACGGTGAAA TTGTGAAACT TCTCAACCAG AGAGCCTCTG CCACTTTAAA GCTCCTCGAA
GCAGCCCCCG ACAAAAAGGC TGTGCTCTTT GATCCTAAGG CTGATGATCT CCTCGCATTG
AAGCTTGAAG AAAAGTCTTC CGGCCCTCTG CCTGCCCATG CTTTACGAGG CGTTCTGCGA
CAAGTTCTCA GCTCGATTCG CAACCAGGTT CGCACCCAGC GAGTGGCCTA TCTGGGCCCG
GCGTTCAGCT TTACTCACAT GGCGGCTATT GAGCGTTTTG GTGAGGCCGC CGATCTGATC
CCCGTGAATA CGATTGCCGC AGTGTTTGAA GAGGTTAATC GAGGCCATGC CGACTTCGGA
CTGGTGCCCA TCGAGAATAG TACAGATGGC AGGATCATCG ACACTCTCGA TATGTTCACT
CGCCTGCCAC TGCGTATCTG CGGGGAAGTT CAACTCGCCA TTCATCACAA TCTGCTGGGG
CGATCTGCCC GTGGAGAAAT CACAGAGATC TACAGTAAAC CGCAAGCCCT TTCGCAATGC
CGCGAATGGC TGAGTCGTAA TATGCCTCAA GCCCGGCTGA TTGAAGTCAC CAGCACTTCG
ACAGCCGCTC AACTGGGACG TGATAAACCA GGTGCCGCCG CAGTTGCCAG TCGCCAGGCA
GCCACGGAGT ACGGATTGGA AATTCTGGCA GCGAATATTG AAGACAACTC GCAGAACATC
ACGCGGTTTG CTGTGCTTGG CGACCGTACT ATGGCCCCCA CAGGACAGGA TCGAACCTCG
ATCCTGCTGC AGACAGCCGA CAAGCCAGGG GCTCTAGCCA ATGCTCTGGA AATCTTCAAA
CGGCTCAAAA TCAACCTGAC CTGGATTGAA TCCTTTCCAT TGCGTGGGCC CGAGAGCGGT
TACATCTTCT TCATCGATGC CGAGGGGCAC ATGAAGGACA CTCGTGTCAA GAAAGCACTT
GATGAACTGA CAACTCATGC GGTTCGCCTC GAAGTGCTGG GCTCTTACCC CTGCAGCGAA
CCGATTGACT GA
 
Protein sequence
MATKPQPTKK TSKASTSGSA KTAGNPVKAT ASLETEIEKL DGEIVKLLNQ RASATLKLLE 
AAPDKKAVLF DPKADDLLAL KLEEKSSGPL PAHALRGVLR QVLSSIRNQV RTQRVAYLGP
AFSFTHMAAI ERFGEAADLI PVNTIAAVFE EVNRGHADFG LVPIENSTDG RIIDTLDMFT
RLPLRICGEV QLAIHHNLLG RSARGEITEI YSKPQALSQC REWLSRNMPQ ARLIEVTSTS
TAAQLGRDKP GAAAVASRQA ATEYGLEILA ANIEDNSQNI TRFAVLGDRT MAPTGQDRTS
ILLQTADKPG ALANALEIFK RLKINLTWIE SFPLRGPESG YIFFIDAEGH MKDTRVKKAL
DELTTHAVRL EVLGSYPCSE PID