Gene Plim_2022 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPlim_2022 
Symbol 
ID9138725 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePlanctomyces limnophilus DSM 3776 
KingdomBacteria 
Replicon accessionNC_014148 
Strand
Start bp2622208 
End bp2623485 
Gene Length1278 bp 
Protein Length425 aa 
Translation table11 
GC content54% 
IMG OID 
Productprotein of unknown function DUF201 
Protein accessionYP_003630049 
Protein GI296122271 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAGACGA CCCCCAAAAT TCTGATCGCA GGGGGAAGTG TGCGTGCTGC AGCGGACTCG 
GCTCAGCGTG GCGGGTGGCA AGTTATCGCC TATGACCAGT ACGGAGATCT GGATCTTCAA
GAATCATCGA TCTGGCAGTC ACTTCCCGCA GAAACCAAGA ATTGGTCTTT TGATCTCGCC
CATGTCGAAC ACGTGCAAGG TTGGACATAC ACAGGCCCAT TTGAGAACTG GCCCGCAGAT
GCATCGAGAT TCTCAGAGCA AGCCGACAAA TGCTCGATTC CGCTGCTTGG AGTTTCACCC
CGGCTTCTCA AACGGATTCG TGATCCCCGC TTTCTGCAGG AAACGTTTGC CAGTATCGGT
GGGAATCCTC TGGCGGTGAT CCTTCCCGGC CCAGTCGTAC CTTCCGAATT ACGACCAGCG
ACTCAAATCG ATCTATCCAA AGTTCAAGAG TTGATTCGTA AACCGCTGCA CTCCGGGGGA
GGGTTCAATG TTCGCGATGT CTCTCAAGCC GACATCATGG AAGAGAGCTT CCCTCATTCG
CTTGATGAGC TTGAATACCT GCAGGAAAAA GTCCATGGGG TTCCGCAAAG CTGCATCTTC
ATGGCGAGTT CAACCCATGT GGAGAGAATC GGTTGGACTG AAGGTCTGAT GGGCACTCCC
TGGGGAAAAT ATGGTTATCG CGGCTCCGTG GGCCCGATCA CTGTTCCCCC ACCGATCGAT
GAGCTGGCCT GTTCACTGGC GAACGCGATT GTCAAAGCGA CGGGCCTGTC GGGAATTCTG
GGAATTGATG GCATTCGCTG CGATGAGTCG TGGCGACCGA TCGAAATCAA CCCGCGCTAC
ACGGCGAGTT GTGAAATTCT GGAAGGAACA GTCGGCCATC CCGCATCGCT GATGGATCTT
CATCTCAGGG CCTGGCAGAG CGAACTGATG CCACCTTGTT TGAGAGATGT GTCCGCAACT
TCCAGGAAAA AAACAACAGG TGGGGATCAT TCGCAGCGCC CCGACTTCGG TTGCAAGCAG
ATCGTCTATG CAGGGGAAGC CTTTAATGCG CCGGATTTGC GTGTTCCCTG GTTGAAATCG
CATTCTGGCA AATTCCTTCC ACAAGCATCA CCTCATCAAG TGATTGCCGA TATCCCCATG
CCTGGAAGTC TCATTTCCAG TGGATACCCG GTTTGCACCC TCTTCGGATG GGGTTCTTCG
ATTGCCGATG CCCGCCGACA ATGCGCAGAG CATTGGTCTC AACTGACGAC TCATTTTCCC
GAGCTGAAGC TTGCCTGA
 
Protein sequence
MQTTPKILIA GGSVRAAADS AQRGGWQVIA YDQYGDLDLQ ESSIWQSLPA ETKNWSFDLA 
HVEHVQGWTY TGPFENWPAD ASRFSEQADK CSIPLLGVSP RLLKRIRDPR FLQETFASIG
GNPLAVILPG PVVPSELRPA TQIDLSKVQE LIRKPLHSGG GFNVRDVSQA DIMEESFPHS
LDELEYLQEK VHGVPQSCIF MASSTHVERI GWTEGLMGTP WGKYGYRGSV GPITVPPPID
ELACSLANAI VKATGLSGIL GIDGIRCDES WRPIEINPRY TASCEILEGT VGHPASLMDL
HLRAWQSELM PPCLRDVSAT SRKKTTGGDH SQRPDFGCKQ IVYAGEAFNA PDLRVPWLKS
HSGKFLPQAS PHQVIADIPM PGSLISSGYP VCTLFGWGSS IADARRQCAE HWSQLTTHFP
ELKLA