Gene Plim_2738 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPlim_2738 
Symbol 
ID9139450 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePlanctomyces limnophilus DSM 3776 
KingdomBacteria 
Replicon accessionNC_014148 
Strand
Start bp3550742 
End bp3552142 
Gene Length1401 bp 
Protein Length466 aa 
Translation table11 
GC content56% 
IMG OID 
Productsulfatase 
Protein accessionYP_003630760 
Protein GI296122982 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.846304 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCACCCG CTCATCACCT TACATCGGCC ACATCCCGTC TCACGTTGTT GCTTTGGAGC 
TTTTGCGTCC TCTGCGGTGC TTCCTTCGCC GCAGATTCCA CGAAGCCCAA CATCGTGGTG
ATCTTTGCTG ATGATCTTGG TTACGGTGAT CTCGGCTGTT ATGGGTCGCC AACAATCCGC
ACGCCGCACC TCGACGAGAT GGCTGCGGAG GGATTACGGT TTACAGATTT TTATTCTGCC
GCCGAGGTGT GTACCCCCAG CCGAGCGGCT TTGCTCACGG GACGATTGCC AATTCGCAGT
GGCATGTGCG GGGCTCGCCG CGTGCTGTTT CCAAACTCCA AAGGAGGGTT GCCCCCGGCA
GAGATCACCA TCGCCGAGGC ACTCAAGGAA AAGGGCTATG CCACTGCACA GATTGGCAAG
TGGCATCTCG GCATTCACCC GGGGTCGCGT CCATTGGATC AGGGCTTCGA TCAAAGCTTT
GGCCTGCCAT ACTCCAATGA TATGGATGCC CGGGCCGATT TGCCGAAAGG CTCGACGGGT
TCACCCAATC CACCGCTCGA CGGCTGGAAT GTGGCGCTGT TGCGCAATGG AGAGGTTGTT
GAACAACCGG CAAACCAGAC CACGTTAACG AAACGTTATA CCGAAGAAGC CATCAAGTTC
ATCACAGAGA AGAAGAACGT TCCATTCTTC CTCTACATGC CTCACACCTT TCCGCATGTG
CCCATGTTTG CCTCGCAGGA TTTCAAGGGC AAAAGCCGTG CGGGCATTTA TGGTGACGCT
GTTGAAGAGC TGGATTGGAG TGTGGGGCAG GTCCTGGGAG CCTTGTGTCG GGAAGGTATC
GCTGAGAATA CGCTTGTTTT CTTCTCCAGT GATAACGGCC CCTGGCTCAT CATGGGCGAT
CAAGGCGGCA GTGCCGGTCT GCTCAAGGAT GGCAAAGGGA GCACCTGGGA AGGCGGCATG
CGTGTACCCG GGATTGCCTG GATGCCGAGC CGGATCAAGC CCGGCGTGAC CAGTCAGCTC
GCCAGTGCGA TGGATGTGTT TCCCACGGCT CTGGCCCTGG CCGGTGCATC GCTCCCGAAG
GATGTTGTGT TCGATGGCGT CGATCTCGCG CCATTACTTT TCGAATCCAG GCCTCTGCCG
GAGCGACCGT TCTTTTATTA TCGAGGCAAT CAACTTTTTG CCTGCCGCCT GGGTGAATGG
AAGGCTCATT TCCAGACTCA AACAGGCTAT GGAGGCTCAA AACCGGAACG GCATGAACCA
GAACTGCTCT TTCATCTCGG TCGCGATCCT TCCGAGAAAC GTAATGTCGC CGCCGCACAT
CCCGAGGTTC TCATTCGTAT TCAGGAAGCT GTGAAGGCTC ACCAATCCCA AGTGATTCCA
GGCCCCCCAC AGCTTCAATA G
 
Protein sequence
MSPAHHLTSA TSRLTLLLWS FCVLCGASFA ADSTKPNIVV IFADDLGYGD LGCYGSPTIR 
TPHLDEMAAE GLRFTDFYSA AEVCTPSRAA LLTGRLPIRS GMCGARRVLF PNSKGGLPPA
EITIAEALKE KGYATAQIGK WHLGIHPGSR PLDQGFDQSF GLPYSNDMDA RADLPKGSTG
SPNPPLDGWN VALLRNGEVV EQPANQTTLT KRYTEEAIKF ITEKKNVPFF LYMPHTFPHV
PMFASQDFKG KSRAGIYGDA VEELDWSVGQ VLGALCREGI AENTLVFFSS DNGPWLIMGD
QGGSAGLLKD GKGSTWEGGM RVPGIAWMPS RIKPGVTSQL ASAMDVFPTA LALAGASLPK
DVVFDGVDLA PLLFESRPLP ERPFFYYRGN QLFACRLGEW KAHFQTQTGY GGSKPERHEP
ELLFHLGRDP SEKRNVAAAH PEVLIRIQEA VKAHQSQVIP GPPQLQ