Gene Plim_3398 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPlim_3398 
Symbol 
ID9140114 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePlanctomyces limnophilus DSM 3776 
KingdomBacteria 
Replicon accessionNC_014148 
Strand
Start bp4394766 
End bp4396340 
Gene Length1575 bp 
Protein Length524 aa 
Translation table11 
GC content54% 
IMG OID 
Productsulfatase 
Protein accessionYP_003631410 
Protein GI296123632 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.145395 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGAATTC ATCGACTGAT TCTGGCGATA TTGCTCCTGT TGACTTGCCG GGGTGCTATG 
GCTGCGGAGC CGGCGCCACC GAATATTCTG TTCGCCATTG CGGATGACTG GGGTTATGGG
CATGCGAGTG CTTATGGCTG CCAGTGGGTG AAGACACCCG CTTTTGATCG TGTCGCCCGC
GAGGGGTTAC TCTTTCAGCA GGCATATACG CCCAATGCCA AGTGCGCACC TTCGAGAGCC
TGTATTCTCA CCGGCCGAAA CTCGTGGCAA CTCAAAGAGG CGGCCAACCA TTTGTGCTAC
TTCCCCACGG AGTTCAAAAC ATGGGGGGAA GCGTTGGCCG AGAAGGGTTG GCATGTGGGC
TATACAACCA AAGGCTGGGG GCCTGGCGAA GCGAAAGACG AGACTGGCAA GCCCCGAGCT
ATGACAGGGC AACCCTTCAA CAAGCAGAAG CTCACACCGC CTGCCCAGGG AATTGGGCCT
AATGATTATG CCTCGAATTT TGGGGATTTT CTGGCGAGTG CACCTGCTGG TAAGCCATGG
TGTTTCTGGT ATGGGAGCAT TGAGCCCCAT CGCGATTATG AGTTTGGTTC GGGGATCAAA
AAAGCGGGCA AAAAACTGAG TGAAATCGAT CATGTGCCGG GTTATTGGCC CGACAATGAA
ACGGTTCGCA CGGATCTGCT TGATTATGCC TATGAGGTGG AACACTTCGA TCAACATCTG
GGGCGGATGC TGAAGGCTCT TGAAGAGAAG GGGCTGCTGG AAAATACACT GGTGATCGTC
ACTTCTGATC ACGGCATGCC GTTTCCCAGG TGCAAAGGAG GCGCCTACGA AGCGTCCAAT
CATGTCCCGC TGGCCATGAT GTGGCCCAAA GGAATTCGAG CACCGGGGCG CTCCATCGAC
GACTTCGTGA GCTTTATTGA TCTGGCACCG ACCATTCTTG ATGTGGCATC AATCCCGTGG
AATGAAACCG GGATGGCACC GGCGACCGGT CGATCACTCA GAGACATCTT CGAATCGAAA
CAATCAGGCC GGGTCGATGC AGCGCGCGAT CATGTGCTGA TTGGTATGGA GCGGCACGAT
ATCGGTCGCC CGCTCGATGT CGGTTATCCC ATTCGTGGCA TCATCACGCA TGAGTCGCTC
TATCTGCACA ATTTTGAGCC TGATCGCTGG CCCGCGTGCA ATCCGGAAAC GGGCTATCTC
AATTGTGATG CGGGTGCGAC GAAAACGGTC ATTCTGGAAG CCCGGCGGAA AGCTGGAAGC
GATCCTTACT GGTCACTCTG TTTTGGCAAG CGGCCACGGG AAGAGTTTTA TGATCTCCAG
CAGGACCGCG ATTGCATTCA AAATCTGGCT GATGACCCGA CTCTGACAGC TCGCATGGAA
GCACTCAAAA ACCGTCTCTT TGCCCAGTTG AAGGAGGAGC AGGATCCTCG CATGGATGGC
AAGGGTTATC TTTTTGATGA ATACCCGCAC TCGAACAAAA TCCATCGCGG CTTTTACGAA
CGATTTATCA AGGGCGAGAA GCTGAATACC GGCTGGGTTG ATCCCACAGA CTATGAGCCA
GCACCCCTTG ATTGA
 
Protein sequence
MRIHRLILAI LLLLTCRGAM AAEPAPPNIL FAIADDWGYG HASAYGCQWV KTPAFDRVAR 
EGLLFQQAYT PNAKCAPSRA CILTGRNSWQ LKEAANHLCY FPTEFKTWGE ALAEKGWHVG
YTTKGWGPGE AKDETGKPRA MTGQPFNKQK LTPPAQGIGP NDYASNFGDF LASAPAGKPW
CFWYGSIEPH RDYEFGSGIK KAGKKLSEID HVPGYWPDNE TVRTDLLDYA YEVEHFDQHL
GRMLKALEEK GLLENTLVIV TSDHGMPFPR CKGGAYEASN HVPLAMMWPK GIRAPGRSID
DFVSFIDLAP TILDVASIPW NETGMAPATG RSLRDIFESK QSGRVDAARD HVLIGMERHD
IGRPLDVGYP IRGIITHESL YLHNFEPDRW PACNPETGYL NCDAGATKTV ILEARRKAGS
DPYWSLCFGK RPREEFYDLQ QDRDCIQNLA DDPTLTARME ALKNRLFAQL KEEQDPRMDG
KGYLFDEYPH SNKIHRGFYE RFIKGEKLNT GWVDPTDYEP APLD