Gene Plim_3419 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPlim_3419 
Symbol 
ID9140135 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePlanctomyces limnophilus DSM 3776 
KingdomBacteria 
Replicon accessionNC_014148 
Strand
Start bp4426682 
End bp4428262 
Gene Length1581 bp 
Protein Length526 aa 
Translation table11 
GC content54% 
IMG OID 
Productsulfatase 
Protein accessionYP_003631431 
Protein GI296123653 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATACCG TCCGATTTGC CTTAATTCTC AGTCTGGGCT GGCTGGCTCT CTTCTCTCCT 
GATGTTCAGA TCTCACCTGC TATGGCGGCA GAAAAGGCGG CACCGCCCAA CATTCTGTTC
ATCTTCAGCG ATGACCTTGC CTATCAGGCC ATCAGTGCCT ATGGAGACGA GCGAAAGCTC
CTCGAAACAC CTCATATCGA CCGTGTGGCG AAAGAGGGGA TCCGCTTTGA TCGCTGTGTC
GTCACGAACT CGATCTGCGG GCCTTGCCGT GCCACCATTC TGACAGGCAA GTACTCGCAT
AAAAACGGGT TCTACAACAA CACCAACTCA CGCTTTGACA GCACCCAAAC GACATTCCCT
AAGCTGCTCA AATCGCAGGG GTACAGCACG GCACTCATTG GCAAATGGCA CCTCATCAGC
GAACCCACAG GCTTTGATCA TTGGGAGATC CTGCCAGGGC AGGGCATCTA TTACAACCCG
CCCATGATCG CCAATGGCCA GAAGGTGCAG CGAGAAGGCT ACGTCACCGA CATCATTACG
GATCGATCGA TCGACTGGCT GAAAAATCGC GACAAATCCA AGCCCTTTCT GCTGATGGCA
CAGCATAAGG CTCCGCATCG CGAATGGTCG CCTGCACTCA GGCATCTGGG GTTCAACAAA
GACAAACCGT TTGCCGAACC CGCGACGCTC TTTGATCAGC ACAAAGATCG CGCTCAGGCA
GTCGTTGATC ACGATATGGG GATCGACCGC ACCTTCACCA AGCTCGATGC CAAACTCGTC
CCGCCTCCCG GCATCAACAG CACTCAACTC GAAGAATGGA ACAAGTACTA CCTGCCGCGT
AATAACGCCT TTGAAGCAGC CCATCTTCAA GGTCAGGATC TCGTGCGCTG GCGCTATCAA
CGGTACATGC ACGATTATCT GGCCTGTGTG AAGGCCGTCG ATGAAAGTGT GGGCCGATTA
CTCCAGACGC TCGATGAAGA AGGCCTTGCC GAAAACACAC TCGTGGTGGT TTCATCCGAT
CAGGGCTTTT ATCTGGGTGA ACATGGCTGG TTCGATAAAC GCTGGATCTT TGAAGAATCT
CTGCGGACAC CTCTGCTCGC GCGCTGGCCA GCCGCTATTC CTGCAGGCCG CACGAATGGA
CAGATTGTCT CGCTGCTCGA TATTGCCCAG ACATTCCTCG ATGTCGCAAA AATCGACGCA
CCAAACGACA TGCAGGGGGC CAGCCTCCTG CCACTGCTGA AAGGGGATAC GCCCGCTGAC
TGGCGAAAAT CGCTCTACTA TCGCTACTAC GAATACCCTT CACCTCACCG CGTCAAGCCG
CATTATGGCG TGGTGACTGA TCGTTACAAA CTCGTGCATT ATGAAGGGAC TGGTGAAGGC
GAATGGGAAC TGCTTGATCG ACAGGTTGAC CCCCAGGAAG TCAAAAGCTT CCATAACGAC
CCGGCCTATG CCCAGACCAT GACAGAACTC AAAGACGAAA TTCGACGTCT CCAGAAAGTG
GTTGACGATC AGACGCCACC TCCCGCTAAG GCTTATGGGA ATGCTCCGCT CGAATGGTCC
CCCTTCGGCC CATTGAAGTA A
 
Protein sequence
MNTVRFALIL SLGWLALFSP DVQISPAMAA EKAAPPNILF IFSDDLAYQA ISAYGDERKL 
LETPHIDRVA KEGIRFDRCV VTNSICGPCR ATILTGKYSH KNGFYNNTNS RFDSTQTTFP
KLLKSQGYST ALIGKWHLIS EPTGFDHWEI LPGQGIYYNP PMIANGQKVQ REGYVTDIIT
DRSIDWLKNR DKSKPFLLMA QHKAPHREWS PALRHLGFNK DKPFAEPATL FDQHKDRAQA
VVDHDMGIDR TFTKLDAKLV PPPGINSTQL EEWNKYYLPR NNAFEAAHLQ GQDLVRWRYQ
RYMHDYLACV KAVDESVGRL LQTLDEEGLA ENTLVVVSSD QGFYLGEHGW FDKRWIFEES
LRTPLLARWP AAIPAGRTNG QIVSLLDIAQ TFLDVAKIDA PNDMQGASLL PLLKGDTPAD
WRKSLYYRYY EYPSPHRVKP HYGVVTDRYK LVHYEGTGEG EWELLDRQVD PQEVKSFHND
PAYAQTMTEL KDEIRRLQKV VDDQTPPPAK AYGNAPLEWS PFGPLK