Gene Plim_3640 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPlim_3640 
Symbol 
ID9140358 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePlanctomyces limnophilus DSM 3776 
KingdomBacteria 
Replicon accessionNC_014148 
Strand
Start bp4681938 
End bp4683500 
Gene Length1563 bp 
Protein Length520 aa 
Translation table11 
GC content56% 
IMG OID 
Productsulfatase 
Protein accessionYP_003631651 
Protein GI296123873 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACTGA AACGATGGCT GTTGACTCTA TGCACTCTGG CTTTATGCGT GGTAAACCAC 
TCGGAAGTTC CTTCAGCTCT GGCTGCTGAA ACCACAGGAA AGCCCCAGGT TTCCCGGCCC
AACATCGTGC TCATCTATGT GGACGATCTC GGTTATGGCG ACATCAGTTG TCATGGCGCC
ACGCTGGTCA AAACGCCCCA TGTCGATCGA TTGGCTCGCG AAGGACTCAA CTTTTCTGAT
GGACACTCAC CGTCGGCCAC TTGCACTCCC TCGCGCTACG CCATGCTCAC CGGCGAATAT
GCCTGGCGGA AAAAAGGGAC TGGAGTTCTT CCGGGCGATG CCAGGCTGAT TATTGAACCG
GGCCGCCGCA CTCTCGCTTC GACACTTCAA AAGGCGGGCT ATCGCACCGG TGTCGTCGGG
AAATGGCATC TGGGTTTAGG AGACGAAAAA CTTAACTGGA ACGGCGTCAT CAAACCCGGC
CCTCTCGAAG TGGGCTTTGA TGAATCGTTC ATCATGGCCG CGACCGGCGA TCGAGTGCCA
TGTGTCTACG TTGAGCAGGA TCGTGTGCTC AACCTTGACC CGAACGACCC AATCAAAGTC
CAGTTCGGCA AACCAATTGA TCCTGCCCTA CCCACAGGGA AATCGCATCC AGAATTGCTC
ACGGTCATGA AGCCGAGCCA CGGTCACGAC ATGACGATTA TCAATGGTGT CAGCCGGATT
GGCTATATGA CAGGTGGCAA GGCCGCTCTC TGGAACGATC AGGAGATGGC CGATGTCTTT
ACCTCAAAAG CACTCAAGTT CATGACCGAT CATTGGGCTC GCCATGCCGA TCAGCCGTTT
TTCTTGTTCT TTTCGCTGCA CGATATTCAC GTTCCCCGCT TGCCTCACCC CCGCTTTGTC
GGCAGCACCA GCATGGGCCC GCGCGGCGAC GTGATTGTCG AAATGGATTG GTGTGTCGGT
CAGGTGCTCG ACAAGCTCGC GGCCTTAGGA ATTGACGACG AGACGATGGT CATCTTTACC
AGCGATAATG GCCCCGTCGT CGATGATGGA TACAAAGATG AAGCCGTCAC GAAGCTGAGT
CATCATCAAC CGGCTGGCCC TTATCGAGGT GGTAAATATA GTGCCTATGA AGGAGGGACT
CGCGTCCCCT TCATTGTCCG CTGGCCAGGT CGCATCCAGC CGGGAACATC GAACGCGTTG
ATGTGTCAGA TCGACCTCAT GGCCTCGCTC GGCAAACTGG TGGGGCAACC TGTCCCACCC
CAGGAAGCGT ATGACAGTAT TGATGTCCTT CCCGCTTTAT TGGGTGAGTC ACAGGCAGGT
CGAGAGCAAC TGGTGGAGCA CTCGGGAGTT CTGGGCCTGC GCGCGGGCCC CTGGAAACTG
ATTGAGCCCG GCAAAGCCCC GCGTGTCTTC CAGCAGACCA ATACCGAAAC CGGCCAGCTC
CCCAGACCTC GCCTGTTTAA TCTCGAAGAA GACCCCGGCG AAACCCGCGA CCTCGCCGAA
GACCAACCCG AAAAAGTCAA AGAACTCCAA GCCCTCCTCG AGAGAATCAA AGGTGAGATC
TAA
 
Protein sequence
MKLKRWLLTL CTLALCVVNH SEVPSALAAE TTGKPQVSRP NIVLIYVDDL GYGDISCHGA 
TLVKTPHVDR LAREGLNFSD GHSPSATCTP SRYAMLTGEY AWRKKGTGVL PGDARLIIEP
GRRTLASTLQ KAGYRTGVVG KWHLGLGDEK LNWNGVIKPG PLEVGFDESF IMAATGDRVP
CVYVEQDRVL NLDPNDPIKV QFGKPIDPAL PTGKSHPELL TVMKPSHGHD MTIINGVSRI
GYMTGGKAAL WNDQEMADVF TSKALKFMTD HWARHADQPF FLFFSLHDIH VPRLPHPRFV
GSTSMGPRGD VIVEMDWCVG QVLDKLAALG IDDETMVIFT SDNGPVVDDG YKDEAVTKLS
HHQPAGPYRG GKYSAYEGGT RVPFIVRWPG RIQPGTSNAL MCQIDLMASL GKLVGQPVPP
QEAYDSIDVL PALLGESQAG REQLVEHSGV LGLRAGPWKL IEPGKAPRVF QQTNTETGQL
PRPRLFNLEE DPGETRDLAE DQPEKVKELQ ALLERIKGEI